Presentation is loading. Please wait.

Presentation is loading. Please wait.

Physical Mapping of DNA BIO/CS 471 – Algorithms for Bioinformatics.

Similar presentations


Presentation on theme: "Physical Mapping of DNA BIO/CS 471 – Algorithms for Bioinformatics."— Presentation transcript:

1 Physical Mapping of DNA BIO/CS 471 – Algorithms for Bioinformatics

2 Physical Mapping2 Landmarks on the genome  Identify the order and/or location of sequence landmarks on the DNA BamH1 – GGATCC Restriction Enzyme DigestsHybridization Mapping yxzw Probes Clones

3 Physical Mapping3 Producing a map of the genome x i j a m u zd w f e m u zd

4 Physical Mapping4 Restriction Fragment Mapping 38610 45117 31526373152637 A: B: A + B:

5 Physical Mapping5 Set Partitioning  The Set Partition Problem Input: X = {x 1, x 2, x 3, … x n } Output: Partition of X into Y and Z such that  Y =  Z  This problem is NP Complete  Suppose we have X = {3, 9, 6, 5, 1} Can we recast the problem as a double digest problem?

6 Physical Mapping6 Reduction from Set Partition  X = {3, 9, 6, 5, 1} 13569 12 13569 19356 19356

7 Physical Mapping7 NP-Completeness  Suppose we could solve the Double Digest Problem in polynomial time… Instance of Set Partition Convert* Instance of Double Digest Solve in polynomial time *polynomial time conversion

8 Physical Mapping8 Hybridization Mapping  The sequence of the clones remains unknown  The relative order of the probes is identified  The sequence of the probes is known in advance yxzw Probes Clones

9 Physical Mapping9 Interval graph representation a b d c e b d ace Becomes a graph coloring problem, which is (you guessed it) NP- Complete

10 Physical Mapping10 Simplifying Assumptions  Probes are unique Hybridize only once along the target DNA  There are no errors  Every probe hybridizes at every possible position on every possible clone

11 Physical Mapping11 Consecutive Ones Problem (C1P) Clones: Probes Rearrange the columns such that all the ones in every row are together:

12 Physical Mapping12 An algorithm for C1P 1.Separate the rows (clones) into components 2.Permute the components 3.Merge the permuted components S 1 = {1, 2, 4, 5, 7, 9} S 2 = {2, 3, 4, 5, 6, 7, 8, 9}

13 Physical Mapping13 Partitioning clones into components  Component graph G c Nodes correspond to clones Connect l i and l j iff:

14 Physical Mapping14 Component Graph l1l1 l8l8 l4l4 l5l5 l2l2 l3l3 l6l6 l7l7     Here, connected components are labeled with greek letters.

15 Physical Mapping15 Assembling a component  Consider only row 1 of the following:  Placing all of the ones together, we can place columns 2, 7, and 8 in any order … 0 1 1 1 0 … {2, 7, 8} {2, 7, 8} {2, 7, 8} l 1  l1l1 l2l2 l3l3

16 Physical Mapping16 Row 2  Because of the way we have constructed the component, l 2 will have some columns with 1’s where l 1 has 1’s, and some where l 2 does not.  Shall we place the new 1’s to the right or left?  Doesn’t matter because the reverse permutation is the same answer.

17 Physical Mapping17 Adding row 2  Placing column 5 to the left partially resolves the {2, 7, 8} columns … 0 0 1 1 1 0 … {5} {2, 7} {2, 7} {8} l 1  … 0 1 1 1 0 0 … l 2  S 1 = {2, 7, 8} S 2 = {2, 5, 7}

18 Physical Mapping18 Additional Rows  Select a new row k from the component such that edges (i, j) and (i, k) exist for two already added rows i, and k.  Look at the relationship between i and k, and between i and j to determine if k goes on the same side or the opposite side of i as j. i j k

19 Physical Mapping19 Definitions  Let  Place i on the same side as j if  Else, place on the opposite side i j k

20 Physical Mapping20 Placing Rows Place k on the same side as j if  Or: i i  j j  k k  So we place l 3 on the same side of l 2 as l 1, which is the right. l1l1 l2l2 l3l3 i i 

21 Physical Mapping21 Placing rows  Repeat for every row in the component … 0 0 1 1 1 0 0 0 … {5} {2} {7} {8} {1,4} {1,4} l 1  … 0 1 1 1 0 0 0 0 … l 2  … 0 0 0 1 1 1 1 0 … l 3 

22 Physical Mapping22 Joining Components  New graph: G m – the merge graph (directed) Nodes are connected components of G c Edge ( ,  ) iff every set in  is a subset of a set in  l1l1 l2l2 l3l3   

23 Physical Mapping23 Constructing G m l1l1 l8l8 l4l4 l5l5 l2l2 l3l3 l6l6 l7l7        

24 Physical Mapping24 Properties of G m  All rows in  will share the same disjoint/subset relationship with each row of  Different compenents  disjoint or subset Same component  shares a 1 column That column matches in a row of , then subset, else disjoint    

25 Physical Mapping25 Ordering components  Vertices without incoming edges: freeze their columns  Process the rest in topological order.  is a singleton and a subset    

26 Physical Mapping26 Ordering Components (2)  Find the leftmost column with a 1  In the current assembly, find the rows that contain all ones for that column  Merge the columns 


Download ppt "Physical Mapping of DNA BIO/CS 471 – Algorithms for Bioinformatics."

Similar presentations


Ads by Google