Presentation is loading. Please wait.

Presentation is loading. Please wait.

Structure Alignment in Polynomial Time Rachel Kolodny Stanford University Nati Linial The Hebrew University of Jerusalem.

Similar presentations


Presentation on theme: "Structure Alignment in Polynomial Time Rachel Kolodny Stanford University Nati Linial The Hebrew University of Jerusalem."— Presentation transcript:

1 Structure Alignment in Polynomial Time Rachel Kolodny Stanford University Nati Linial The Hebrew University of Jerusalem

2 Problem Statement 2 structures in R 3 A={a 1,a 2,…,a n }, B={b 1,b 2,…,b m } Find subsequences s a and s b s.t the substructures {a s a (1),a s a (2),…, a s a (l) },{b s b (1),b s b (2),…, b s b (l) } are similar

3 Motivation Structure is better conserved than amino acid sequence –Structure similarity can give hints to common functionality/origin Allows automatic classification of protein structure

4 Correspondence  Position Given a correspondence the rotation and translation that minimize the cRMS distance can be calculated Kabsch, W. (1978).

5 Position  Correspondence Given a rotation and translation one can calculate the alignment that optimizes a (separable) score –Using dynamic programming –Essentially similar to sequence alignment Example score

6 Score  cRMS We want to give “bonus points” for longer correspondences –e.g. corresponding ONE atom from each structure has 0 cRMS Even better scores ? –vary gap penalty depending on position in structure –Incorporate sequence information

7 Score  cRMS A specific correspondence

8 Previous Work Distance MatricesHeuristics in rotation and translation space DALI [Holm and Sander 93] CONGENEAL [Yee & Dill 93] SSAP [Taylor & Orengo 89] Nussinov-Wolfson [89,93] Godzik [93] … STRUCTAL [Subibiah et al 93] COMPARER [Sali & Blundell 90] LOCK [Singh & Brutlag 97] CE [Shindyalov & Bourne 98] Taylor (??) [93] Zu-Kang & Sipppl 96 (?) … *most data taken from Orengo 94

9 “…It can be proved that, for these reasons, finding an optimal structural alignment between two protein structures is an NP hard problem and thus there are no fast structural alignment algorithms that are guaranteed to be optimal within any given similarity measure…” Adam Godzik ‘The structural alignment between two proteins: Is there a unique answer’ 1996 “There is no exact solution to the protein structure alignment problem, only the best solution for the heuristics used in the calculation.” Shindyalov & Bourne ‘Protein Structure Alignment by Incremental Combinatorial (CE) of the Optimal Path’ 1998

10 Exponentially many Focus on Scoring Functions

11 Exponentially many Focus on Scoring Functions

12 Exponentially many All Maxima are interesting Noisy data !!

13 Good scoring functions Each of the functions is well-behaved –Satisfies Lipschitz condition Thus, the maximum over a finite set is well-behaved In each dimension two points at distance  have function values that vary by O(n  ) Need O(n) samples in every dimension

14 Sampling is Sufficient

15 Polynomial Algorithm Sample in rotation and translation space –compute best score (and alignment) for each sample point Return maximum score Need O(n 6 n 2 ) time and O(n 2 ) space

16 Internal Distance Matrices Invariant to position and rotation of structures  can be compared directly Find largest common sub-matrices (LCM) whose distances are roughly the same

17 LCM is NP-complete Harder than MAX- CLIQUE Matrices encode distances that are positive, symmetric and obey triangle inequality 011111 101111 110111 111011 111101 111110 0123233452 1012112341 2103223452 3230123452 2121012341 3122101231 3233210122 4344321013 5455432104 2122112340

18 Example 1dme 28 amino acids 1jjd 51 amino acids Best STRUCTAL score 149 Best score found by exhaustive search 197

19 Heuristic Consider only translations that positions an atom from protein A on an atom of protein B O(m*n) instead of O((n+m) 3 )


Download ppt "Structure Alignment in Polynomial Time Rachel Kolodny Stanford University Nati Linial The Hebrew University of Jerusalem."

Similar presentations


Ads by Google