Presentation is loading. Please wait.

Presentation is loading. Please wait.

Physical Mapping Problem

Similar presentations


Presentation on theme: "Physical Mapping Problem"— Presentation transcript:

1 Physical Mapping Problem D7526010@csie.ntu.edu.tw

2 Problem Definition Physical mapping 的定義

3 A B C D E F G H J K L M N O P Q H J DNA Fragment of DNA

4 Why We Need Physical Mapping 可以利用這個地圖將 DNA 做完全排序 可以知道基因到底如何對人類產生作用 利用人造蛋白質... 等等來改進遺傳體質

5 AGACTAGTCGTAACGATCGCTAATTTAAGGCTACT..... 人類染色體 ( 約 bp) Physical map ( 約 bp) DNA Sequencing ( 約 bp)

6 Why We Need Physical Mapping 可以利用這個地圖將 DNA 做完全排序 可以知道基因到底如何對人類產生作用 利用人造蛋白質... 等等來改進遺傳體質 可以得知基因 ( 或標記 ) 的大約位置 對於一些遺傳疾病可以得到較多的資訊 可以幫助偵測是否具有遺傳疾病

7 A B C D E F G H J K L M N O P Q H J DNA Fragment of DNA α

8

9 target DNA 加入酵素

10 Partial Digest Problem by single enzyme A restriction sites: a1< a2< a3<.....< ap multiset of fragment lengths {aj- ai,i<j}

11 target DNA

12 Double Digest Problem (DDP) Clones first completely digested by enzyme A,then by B, finally A and B together restriction sites: by A: a 1 < a 2 < a 3 <.....< a p by B: b 1 < b 2 < b 3 <.....< b q by A+B : c 1 < c 2 < c 3 <.....< c p+q Reconstruct the restriction sites from these multisets

13 Example : DDP Enzyme A 3 6 8 10 Enzyme B 4 5 7 11 Enzyme A+B 1 2 3 5 6 7

14 Solution

15 Double Digest Problem (DDP)

16 target DNA........

17 By Probe Approach

18 ATGCGCTAACTGGACTTCAAGCCTAAACTGCATCAGACTT................. TACGCGATTGACCTGAAGT........ Complementary probe target DNA The Spirit of Hybridization

19 target DNA A B C D E F GHGH I J 1234512345

20 12345 A B C D E F G H I J

21 12345 A111 B1 C111 D11 E11 F11 G11 H111 I111 J1

22 JDFIEGACHBJDFIEGACHB 1 2 3 4 5

23 12345 B1 H11 C111 A111 G11 E11 I111 F111 D11 J1

24 1 2 3 4 5 6 A、CA、C C、D、EC、D、E E、FE、F A、F、GA、F、G G、H、IG、H、I E、F、I、J、KE、F、I、J、K 1 2 3 4 5 6 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G G、H、IG、H、I I、J、KI、J、K False Negative

25 1 2 3 4 5 6 A、CA、C C、D、EC、D、E E、FE、F A、F、GA、F、G G、H、IG、H、I E、F、I、J、KE、F、I、J、K 1 2 3 4 5 6 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G G、H、IG、H、I I、J、KI、J、K False Positive

26 1 2 3 4 5 6 A、CA、C C、D、EC、D、E E、FE、F A、F、GA、F、G G、H、IG、H、I E、F、I、J、KE、F、I、J、K 1 2 3 4 5 6 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G G、H、IG、H、I I、J、KI、J、K Chimeric Clones

27 5 6 3 4 1 2 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G I、J、KI、J、K G、H、IG、H、I Clones Probes 1 2 3 4 5 6 A B C D E F G H I J K

28 5 6 3 4 1 2 A、B、CA、B、C C、D、EC、D、E E、F、KE、F、K I、J、K、F、GI、J、K、F、G I、J、KI、J、K G、H、IG、H、I Clones Probes 1 2 3 4 5 6 A B C D E F G H I J K

29 How To Use Traveling Salesman Problem to Solve Physical Mapping Problem

30 How to Convert to TSP? Hamming distance

31 ABCDEFGHIJ A0 B20 C020 D3330 E23320 F233200 G2312220 H12235530 I244111340 J4231333220 12345 A111 B1 C111 D11 E11 F11 G11 H111 I111 J1

32 How to Convert to TSP? Hamming distance Cycle weight = number of gaps transitions +2n

33 ABCDEFGHIJ A0 B20 C020 D3330 E23320 F233200 G2312220 H12235530 I244111340 J4231333220 12345 A111 B1 C111 D11 E11 F11 G11 H111 I111 J1

34 How to Convert to TSP? Hamming distance Cycle weight = number of gaps transitions +2n So, minimize the cycle weight is to the gap number

35

36

37 Our approach We also convert it to optimization problem Using more complicated model Using Genetic Algorithm to solve it. F(A) = X*C(A)+Y*P(A)+Z*N(A)+T*M(A)+ P*L(A).

38 (a) (b) The results of our approach tested on simulated data. The false negative rate is set as 0.1. The false positive rate is 0.05. The false negative rate is set as 0.1. The false positive rate is 0.01.

39 Experimental Results of our GA tested on Real data from chromosome 1 (a) It shows the results of our GA run with the data which is a contig with about 95 clones and about 120 probes (b) It shows the results of our GA run with the data which is a contig with about 172 clones and about 136 probes


Download ppt "Physical Mapping Problem"

Similar presentations


Ads by Google