Presentation is loading. Please wait.

Presentation is loading. Please wait.

DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701

Similar presentations


Presentation on theme: "DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701"— Presentation transcript:

1 DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701 rdeaton@uark.edu

2 What is DNA Computing (DNAC) ? The use of biological molecules, primarily DNA, DNA analogs, and RNA, for computational purposes.

3 Why Nucleic Acids? Density (Adleman, Baum): –DNA: 1 bit per nm 3, 10 20 molecules –Video: 1 bit per 10 12 nm 3 Efficiency (Adleman) –DNA: 10 19 ops / J –Supercomputer: 10 9 ops / J Speed (Adleman): –DNA: 10 14 ops per s –Supercomputer: 10 12 ops per s

4 What makes DNAC possible? Great advances in molecular biology –PCR (Polymerase Chain Reaction) –DNA Microarrays –New enzymes and proteins –Better understanding of biological molecules Ability to produce massive numbers of DNA molecules with specified sequence and size DNA molecules interact through template matching reactions

5 What is a the typical methodology? Encoding: Map problem instance onto set of biological molecules and molecular biology protocols Molecular Operations: Let molecules react to form potential solutions Extraction/Detection: Use protocols to extract result in molecular form

6 P HYSICAL S TRUCTURE OF DNA Nitrogenous Base 34 Å Major Groove Minor Groove Central Axis Sugar-Phosphate Backbone 20 Å 5’ C 3’ OH 3’ 0H C 5’ 5’ 3’ 5’

7 What is an example? “Molecular Computation of Solutions to Combinatorial Problems” Adleman, Science, v. 266, p. 1021.

8

9 Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

10 I NTER-STRAND H YDROGEN B ONDING AdenineThymine to Sugar-Phosphate Backbone to Sugar-Phosphate Backbone (+)(-) (+)(-) Hydrogen Bond GuanineCytosine to Sugar-Phosphate Backbone to Sugar-Phosphate Backbone (-) (+) (-) (+) (-)

11 S TRAND H YBRIDIZATION A B a b A B a b b B a A HEAT COOL b a A B OR 100° C

12 DNA L IGATION  ’’ ’’ ’’ ’’ Ligase Joins 5' phosphate to 3' hydroxyl ’’ ’’  

13 Encoding 0 1 2 ‘GCATGGCC ‘AGCTTAGG ‘ATGGCATG CCGGTCGA’ CCGGTACC’ ‘GCATGGCCAGCTTAGG CCGGTCGA’ ‘GCATGGCCATGGCATG CCGGTACC’ 0021

14

15 Massively Parallel Search V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V6V6 E 0->6 V0V0 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6 V5V5 E 4->5 V4V4 V1V1 V2V2 E 5->1 E 1->2

16 Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

17 DNA Polymerase

18 POLYMERASE CHAIN REACTION

19 Start = V0, Stop = V6 V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V6V6 E 0->6 V0V0 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6 V5V5 E 4->5 V4V4 V1V1 V2V2 E 5->1 E 1->2

20 Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

21 G EL E LECTROPHORESIS - SIZE SORTING Buffer Gel Electrode Samples Faster Slower

22 Right Length V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V6V6 E 0->6 V0V0 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6

23 Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

24 A NTIBODY A FFINITY CACCATGTGAC GTGGTACACTG B PMP + Anneal CACCATGTGAC GTGGTACACTG B + CACCATGTGAC GTGGTACACTG B PMP Bind Add oligo with Biotin label Heat and cool Add Paramagnetic-Streptavidin Particles Isolate with Magnet N S

25 Every Vertex V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6

26 Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

27 Hamiltonian Path V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6

28 Mismatches

29 DNA Word Design Importance of Template-Matching Hybridization Reactions in DNA Computing (DNAC) Sequence design should implement DNAC architecture. –Planned Hybridizations –Problem Size –Subsequent Processing Reactions Designed sequences should minimize unplanned “cross-hybridizations.” Consequences of Bad Designs: Errors and Poor Efficiency

30 DNA Word Design Design problem is hard. As number of sequences required to represent the problem increases, this constraints increasingly conflicts with the requirement of non-crosshybridization. How much of DNA sequence space is available for computation?

31 Why In Vitro? In Vitro Selection and Evolution PCR as tool for selection Ability to synthesis huge, random starting populations Mutagenesis Oligos manufactured under conditions for use Use massive parallelism of DNAC to solve word design problem

32 Protocol Outline Start with huge population of random sequences with attached primers. Anneal rapidly to quench oligos in mismatched configurations. Using temperature as a control, melt most mismatched pairs. Amplify and purify Repeat

33

34

35 Experimental Results

36

37 Latest Results

38 DNA Memories

39 Overview Input DNAs (Unknown Seq.) Sequences Comple- mentary to Input DNAs Memory DNA Strands (With the 3’ end Comple- mentary to the Input DNAs) New Unknown Input DNAs Learning Recall Output Tag 1 Random Probe Separates Memory DNA Strands that Match or Partially Match the New Inputs from Those That Don’t Match Labeled Tag Sequence Complements

40 Learning Learning: Information acquired from examples rather than programmed Protocol to store input DNAs (possibly of unknown sequence) Higher level representation of the input sequences Not individual sequence memories but whole populations Clustering of input sequences in vitro Massively random and parallel copying or sampling depending on number of inputs and probes

41

42 Base-by-Base Amplification Input DNA Tag ProbeExtension

43 Sampling Input DNA Tag ProbeExtension

44 Energy Surface Manipulation through Learning Energy Input Sequence Energy Input Sequence Before LearningAfter Learning

45 Tags Non-Crosshybridizing Sequences Convenient for Input/Output in absence of input sequence information Manipulate memory without input sequences Implement DNA 2 DNA Computations (Landweber and Lipton, DNA 3)

46 Recall Hybridization to retrieve memories Similar sequences patterns matched Pattern matching done against whole memory Single memory associated with single tags Memory composite of output on multiple tags

47

48

49 Experiments Test learning and recall with plasmid Test of sensitivity in concentration Test coverage of input sequence space with: –Plasmids (5k bp) –E. Coli (5M bp) Test sequence resolution of protocols

50

51 Learning Input 1 is a 3 kb linear DNA (pBluescript) Input 2 is a 5 kb linear DNA (  x 174) I1 I2 I1I2 Input DNAs 40 60 100 M1 M2 Memory DNAs

52 Input 1 Input 2 Stained Gel Blot with Memory 1 (pBluescript) Blot with Memory 2 (  x 174) Plasmid inputs learned, similar sequences recalled, and dissimilar not matched. Recall

53 Concentration Sensitivity 1.Plasmids digested with Hpa II 2.1  g pBluescript 3.10ng - 800ng  x 174 4.Blotted with  x 174 memory 5.1%  x 174 detected in background of pBluescript

54 Input Space Coverage Randomly digested input Learning on both inputs Blots nearly identical

55 E. coli 1.E. coli digested 2.219bp fragment of  x 174 added 3.Learning with and without fragment 4.Fragment distinguished when learned

56 Application

57 Team Russell Deaton, University of Arkansas, Computer Science and Engineering Junghuei Chen, University of Delaware, Chemistry and Biochemistry Hong Bi, University of Delaware, Chemistry and Biochemistry Max Garzon, University of Memphis, Computer Science Harvey Rubin, University of Pennsyvania, School of Medicine David Wood, University of Delaware, Computer and Information Science

58 Acknowledgement This work was supported by the NSF QuBIC program, award number EIA- 0130385


Download ppt "DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701"

Similar presentations


Ads by Google