Presentation is loading. Please wait.

Presentation is loading. Please wait.

GA for Sequence Alignment  Pair-wise alignment  Multiple string alignment.

Similar presentations


Presentation on theme: "GA for Sequence Alignment  Pair-wise alignment  Multiple string alignment."— Presentation transcript:

1 GA for Sequence Alignment  Pair-wise alignment  Multiple string alignment

2 Pairwise Sequence Alignment  VNRLQQNIVSLEVDHKVANYKP  VNRLQQSIVSLRDAFNDGELD HRVLNYKP  Solving by a dynamic programming using Dayhoff matrics  Each pairwise alignment needs O(n 1 n 2 )  VNRLQQNIVSL__________EVDHKVANYKP  VNRLQQSIVSLRDAFND GELD HRVLNYKP

3 How to implement a GA ?  Representation  Fitness  Operators design  Selection strategy

4 Pair-wise Alignment: Representation  How do you think?  For example (my intuitively way) –Guess a length n –Chromosome

5 Pair-wise Alignment: Representation  So the chromosome becomes:  You can also use the gap position (1,2,4,5,6,8….) (2,4,5,7,8,10….)

6 Pair-wise Alignment: Fitness Function  Simplest –Match : 1 –Dismatch : -2 –Gap : -1  Using the scoring matrix –Protein : PAM,… –DNA: substitution matrix  Summarize the total score.

7 Pair-wise Alignment: Genetic Operators  All our previous operators. –Image one!!!  Selection –Try it!!!

8 Conclusion About Pair-wise Alignment  DP can solve it in O(NM)  GA can’t have too much advantage.

9 RPCVCPVLRQAAQ s 1 RPCVC_ P__VLRQAAQa 1 RPCACCPVLRQVVQ s 2 RPCACCP__VLRQVVQa 2 KPCLCPRQLRQV s 3 KPCLC_ P RQLRQV_ _a 3 KPCCPRQAAQ s 4 KPC_C_ P____ RQAAQa 4 SA

10 Multiple String Alignment: Representation  How do you think?  For example (my intuitively way) –Guess a length n –Chromosome

11 Multiple String Alignment: Representation  So the chromosome becomes:  You can also use the gap position –Need fewer space –Some good operators….. (1,2,4,5,6,8….) (2,4,5,7,8,10….) …

12 Multiple String Alignment: Fitness Function  The most hard part  You can never know what is the real scoring system! Even biologists!!!  Approximation –Using SOP (sum of pairs) The most widely used Using PAM,… –Motif-based…


Download ppt "GA for Sequence Alignment  Pair-wise alignment  Multiple string alignment."

Similar presentations


Ads by Google