Presentation is loading. Please wait.

Presentation is loading. Please wait.

Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins.

Similar presentations


Presentation on theme: "Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins."— Presentation transcript:

1 Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins

2 Optimal alignment methods

3 Number of distinct alignments is large So it is usually of interest to identify the ‘‘best’’ one among them The Needleman-Wunsch algorithm is an application of a best-path strategy It is called dynamic programming to the problem of finding optimal sequence alignments

4 It is important that optimal methods always report the best alignment that can be achieved, even if it has no biological meaning. The one with the highest score is reported as the optimal local alignment. When searching for local alignments there may be several significant alignments so it is a mistake to look only at the optimal one.

5 Alignment methods 1. Dot matrix analysis 2. The dynamic programming a). Global alignment of sequences by Needleman and Wunsch b). Local alignment by Smith and Waterman 3. Word or k-tuple methods, such as used by the programs FASTA and BLAST

6 Dot matrix analysis

7 A dot matrix analysis is a method for comparing two sequences to look for possible alignment of characters between the sequences First described by Gibbs and McIntyre (1970) Dot matrix method displays any possible sequence alignments as diagonals on the matrix.

8 Dot matrix analysis can readily reveal the presence of insertions/deletions and direct and inverted repeats.

9

10

11

12 Tolls DNA Strider used on a Macintosh computer. The program DOTTER has interactive features for the UNIX The Genetics Computer Group programs(GCG) COMPARE and DOTPLOT also perform a dot matrix analysis.

13 Dynamic programming

14 Dynamic programming is a computational method that is used to align two protein or nucleic acid sequences. It provides the very best or optimal alignment between sequences The method compares every pair of characters in the two sequences and generates an alignment.

15 This alignment will include matched and mismatched characters and gaps in the two sequences The method has been proven mathematically to produce the best or optimal alignment between two sequences under a given set of match conditions.

16 The dynamic programming method, first used for global alignment of sequences by Needleman and Wunsch (1970) and For local alignment by Smith and Waterman (1981), provides one or more alignments of the sequences

17 Global Alignment: Needleman- Wunsch Algorithm The optimal score at each matrix position is calculated by adding the current match score to previously scored positions and subtracting gap penalties Each matrix position may have a positive or negative score, or 0.

18 Local Alignment: Smith-Waterman Algorithm Local alignments are usually more meaningful than global matches because they include patterns that are conserved in the sequences.

19 The rules for calculating scoring matrix values are slightly different The most important differences being (1)the scoring system must include negative scores for mismatches, and (2) when a dynamic programming scoring matrix value becomes negative, that value is set to zero which has the effect of terminating any alignment up to that point


Download ppt "Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins."

Similar presentations


Ads by Google