Download presentation
Presentation is loading. Please wait.
Published bySage Bradwell Modified over 10 years ago
1
Segment Alignment (SEA) Yuzhen Ye Adam Godzik The Burnham Institute
2
Outline A new look at the local structure prediction Network matching problem Practical issues Applications
3
GSDKKGNGVALMTTLFADN EEEEEE HHHHHHHHHHHHHH EEEEEE LLHHHHHHHHLLL LHHHHHLLLL LLLEEEEEEEEE LLLLL Description of local structure one or many answers? GSDKKGNGVALMTTLFADN LLHHHHHHHHLLLEEEEEE A prediction HHHHHHHHLLLLLHHHHHH Real structure
4
Motivation A natural description of local structures: keep the segment information of local structures Keep uncertainties in local structure predictions: drawbacks of prediction programs and intrinsic uncertainties of local structures in absence of global interactions Incorporating the protein local structure in protein sequence comparison may help to detect the distant homologies and to improve their alignments (for homology modeling)
5
Proteins are described as a network of PLSSs (predicted local structure segments)
6
Protein comparison problem is equivalent to a network matching problem Given two networks of PLSSs, find two optimal paths from the source to the sink in each of the networks, whose corresponding PLSSs are most similar to each other. It does not follow the typical position-by-position alignment mode
7
Solving the network matching problem: dynamic programming V(i,j) i j V(i 1,j 1 ) V(i 1,j 2 ) V(i 3,j 1 ) V(i 3,j 2 ) (i-1) 1 i1i1 3 2 1 2 4 (i-1) 3, (i-1) 4 i2i2
8
Example: (1e68A,1nkl) Each protein is represented as a collection of potentially overlapping and contradictory PLSSs (a network). SEA finds an optimal alignment between these two proteins Simultaneously, SEA identifies the optimal subset of PLSSs (a path in the network) describing each protein. 1e68A: Bacteriocin As-48 1nkl : Nk-lysin
9
General performance of SEA incorporating different local structure diversities
10
Keeping local structure diversity helps improve alignment quality alignment between -repressor from E.coli (1lliA) and 434 repressor (1r69)
11
Stable region Variable region Local structure information is crucial for improving alignments, especially in the more divergent regions 1esfA: straphylococcal enterotoxin 2tssA: toxic shock syndrome toxin-1
12
Practical issue: local structural prediction Searching I-site database (web-server or standalone program) Our solution: FragLib –using sensitive profile-profile alignment program FFAS to predict local structures
13
Applications Distant homology detection Local structure prediction Improving alignments for protein modeling
14
Reference A segment alignment approach to protein comparison (Bioinformatics, April issue) Web server http://ffas.ljcrf.edu/sea
15
Related work Spliced sequence alignment –Gelfand et al., 1996, PNAS; Novichkov et al., 2001 –Assembling genes from alternative exons Jumping alignment –Spang R, Rehmsmeier M, Stoye J. JCB, 2002 –Computes a local alignment of a single sequence and a multiple alignment –The sequence is at each position aligned to one sequence of the multiple alignment (reference sequence) instead of a profile Partial order alignment –Lee C, Grasso C, Sharlow MF, Bioinformatics, 2002 –Multiple alignment
16
Acknowledgements Dariusz Plewczyński Iddo Friedberg Łukasz Jaroszewski Weizhong Li This project is supported by SPAM grant GM63208
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.