Presentation is loading. Please wait.

Presentation is loading. Please wait.

Splicing Exons: A Eukaryotic Challenge to Gene Prediction Ian McCoy.

Similar presentations


Presentation on theme: "Splicing Exons: A Eukaryotic Challenge to Gene Prediction Ian McCoy."— Presentation transcript:

1 Splicing Exons: A Eukaryotic Challenge to Gene Prediction Ian McCoy

2

3 Gene Prediction Genes must be identified to make the genome useful Computational Problem: Take a seemingly random sequence of characters, millions or billions of bases long, and find the genes.

4 A Serious Complication Only 3% of the human genome contains genes

5 Similarity-Based Approach Instead of looking for a gene for a target protein directly, use a protein in a related organism. Find all local similarities between a genomic sequence and the target protein sequence. All substrings that exhibit a certain level of similarity will be called putative exons.

6 Exon-Chaining Problem 1. Use brute force to generate a set of putative exons. 2. Represent each exon with three parameters (l,r,w). 3. Find a maximum set of nonoverlapping putative exons.

7 Formulate as Graph Problem Create a graph G with 2n verticies: n vertices are starting(left) positions of exons and n vertices are ending(right) positions of exons. The set of left and right interval ends is sorted into increasing order. There are edges between each l i and r i of weight w i for I from 1 to n; and 2n-1 additional edges of weight 0 connecting adjacent vertices.

8 Input: A set of weighted intervals (putative exons) Output: The length of the maximum chain of intervals from this set

9 Dynamic Programming Algorithm ExonChaining (G, n) //Graph, number of intervals 1 for i ← 1 to 2n 2 s i ← 0 3 for i ← 2 to 2n 4 if vertex v i in G corresponds to right end of the interval I 5 j ← index of vertex for left end of the interval I 6 w ← weight of the interval I 7 s j ← max {s j + w, s i-1 } 8 else 9 s i ← s i-1 10 return s 2n

10

11 Shortcomings A large number of short exons will decrease the efficacy of our method for finding putative exons. Exons may be out of order.

12 Any Questions? Jones, Neil C., and Pavel A. Pevzner. An Introduction to Bioinformatics Algorithms. Cambridge: MIT Press, 2004. (p.200-203)


Download ppt "Splicing Exons: A Eukaryotic Challenge to Gene Prediction Ian McCoy."

Similar presentations


Ads by Google