Presentation is loading. Please wait.

Presentation is loading. Please wait.

CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment.

Similar presentations


Presentation on theme: "CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment."— Presentation transcript:

1 CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment

2 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Evolution at the DNA level …ACGGTGCAGTTACCA… …AC----CAGTCCACCA… Mutation SEQUENCE EDITS REARRANGEMENTS Deletion Inversion Translocation Duplication

3 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Orthology and Paralogy HB Human WB Worm HA1 Human HA2 Human Yeast WA Worm Orthologs: Derived by speciation Paralogs: Everything else Orthologs: Derived by speciation Paralogs: Everything else

4 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Orthology, Paralogy, Inparalogs, Outparalogs

5 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010

6 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Genome Evolution – Macro Events Inversions Deletions Duplications

7 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Synteny maps Comparison of human and mouse

8 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Synteny maps

9 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Synteny maps

10 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Synteny maps

11 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Building synteny maps Recommended local aligners BLASTZ  Most accurate, especially for genes  Chains local alignments WU-BLAST  Good tradeoff of efficiency/sensitivity  Best command-line options BLAT  Fast, less sensitive  Good for comparing very similar sequences finding rough homology map

12 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Index-based local alignment Dictionary: All words of length k (~10) Alignment initiated between words of alignment score  T (typically T = k) Alignment: Ungapped extensions until score below statistical threshold Output: All local alignments with score > statistical threshold …… query DB query scan Question: Using an idea from overlap detection, better way to find all local alignments between two genomes?

13 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Local Alignments

14 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 After chaining

15 CS273a Lecture 9/10, Aut 10, Batzoglou Chaining local alignments 1.Find local alignments 2.Chain -O(NlogN) L.I.S. 3.Restricted DP

16 CS273a Lecture 9/10, Aut 10, Batzoglou Progressive Alignment When evolutionary tree is known:  Align closest first, in the order of the tree  In each step, align two sequences x, y, or profiles p x, p y, to generate a new alignment with associated profile p result Weighted version:  Tree edges have weights, proportional to the divergence in that edge  New profile is a weighted average of two old profiles x w y z Example Profile: (A, C, G, T, -) p x = (0.8, 0.2, 0, 0, 0) p y = (0.6, 0, 0, 0, 0.4) s(p x, p y ) = 0.8*0.6*s(A, A) + 0.2*0.6*s(C, A) + 0.8*0.4*s(A, -) + 0.2*0.4*s(C, -) Result: p xy = (0.7, 0.1, 0, 0, 0.2) s(p x, -) = 0.8*1.0*s(A, -) + 0.2*1.0*s(C, -) Result: p x- = (0.4, 0.1, 0, 0, 0.5)

17 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Threaded Blockset Aligner Human–Cow HMR – CD Restricted Area Profile Alignment

18 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Reconstructing the Ancestral Mammalian Genome Human: C Baboon: C Cat: C Dog: G C C or G G

19 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Neutral Substitution Rates

20 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Finding Conserved Elements (1) Binomial method  25-bp window in the human genome  Binomial distribution of k matches in N bases given the neutral probability of substitution

21 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Finding Conserved Elements (2) Parsimony Method  Count minimum # of mutations explaining each column  Assign a probability to this parsimony score given neutral model  Multiply probabilities across 25-bp window of human genome A C A A G

22 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Finding Conserved Elements

23 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Finding Conserved Elements (3) GERP

24 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Phylo HMMs HMM Phylogenetic Tree Model Phylo HMM

25 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Finding Conserved Elements (3)

26 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 How do the methods agree/disagree?

27 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Statistical Power to Detect Constraint L N C: cutoff # mutations D: neutral mutation rate  : constraint mutation rate relative to neutral

28 CS273a Lecture 9/10, Aut 10, Batzoglou CS273a Lecture 10, Fall 2010 Statistical Power to Detect Constraint L N C: cutoff # mutations D: neutral mutation rate  : constraint mutation rate relative to neutral


Download ppt "CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment."

Similar presentations


Ads by Google