Vladimir V. Ufimtsev Adviser: Dr. V. Rykov A Mathematical Theory of Communication C.E. Shannon Main result: Entropy function - average value of information.

Slides:



Advertisements
Similar presentations
DNA! spooled gene chromosomes chromatin double helix.
Advertisements

Genetic control of protein structure and function
Nucleic Acid Structure and Function. Function of DNA (DeoxyriboNucleic Acid) Contains sections called “genes” that code for proteins. These genes are.
Nucleic Acids. Elements Nucleic Acids Contain C, H, O,N, P.
The structure of DNA.
DNA: The Genetic Material Chapter DNA Structure DNA is a nucleic acid. The building blocks of DNA are nucleotides, each composed of: –a 5-carbon.
DNA: Deoxyribonucleic Acid Biology. Structure of DNA DNA nucleotide has 3 parts: Sugar molecule Deoxyribose Phosphate group Nitrogen-containing base DNA.
1 Review List the chemical components of DNA Relate Cause and Effect Why are hydrogen bonds so essential to the structure of DNA 2 Review Describe the.
Chapter 12 The Structure of DNA. DNA the Genetic Carrier! Now, thanks to Griffith, Avery, Hershey and Chase’s experiment Biologists are equipped with.
Warm Up Where is DNA located within a cell? Why is DNA important?
AS Biology. Gnetic control of protein structure and function Genetic control of protein structure and function.
What you’ll learn How to relate the structure of DNA to its function How to explain the role of DNA in protein production. How to distinguish among different.
DNA The molecule of heredity. The molecules of DNA is the information for life (determine an organism’s traits) DNA achieves its control by determining.
The Central Dogma of Molecular Biology DNA → RNA → Proteins Biology II D. Mitchell.
The Structure of DNA The building block of DNA (and RNA) is the nucleotide. Each nucleotide has 3 parts: A sugar (deoxyribose in DNA) A phosphate group.
Coding for Life Introduction Nucleic acids are very special chemicals in cells which have the information... to control cellular activities, and concerned.
Notes: Pages 6 & 7.  1. 5-Carbon Sugar called DEOXYRIBOSE  2. Phosphate Group  3. Nitrogen Base (A, T, C, or G)
 DNA (deoxyribonucleic acid) is a two stranded molecule called double helix  Each strand are made of smaller parts called nucleotides  The two strands.
National 5 Biology Course Notes Part 4 : DNA and production of
DNA (deoxyribonucleic acid) consists of three components.
DNA Deoxyribose Nucleic Acid. DNA (deoxyribonucleic acid) Genetic Information in the form of DNA is passed from parent to offspring. Genes are the code.
Assessment Statements: Describe the structure of DNA.
AS Biology. Gnetic control of protein structure and function DNA STRUCTURE.
DNA Structure RHSA.
DNA Structure.
Introduction to DNA (Deoxyribonucleic acid). What do you know?
DNA C ODES B ASED ON H AMMING S TEM S IMILARITIES A.G. Dyachkov 1, A.N. Voronina 1 1 Dept. of Probability Theory, MechMath., Moscow State University, Russia.
Chap. 10 : Nucleic Acids & Protein Synthesis I. DNA – deoxyribonucleic acid - function – store and use information to direct activities of the cell and.
Defined: Molecule that stores genetic information Nucleotides: Three repeating units of nucleic acids –1) Sugar –2) Phosphate Group –3) Nitrogen Base.
Understanding DNA Deoxyribonucleic Acid Double Helix: Two strands twisted around each other like a winding staircase.
Characteristic of Life!!
DNA Structure and Function FA Learning Goal:. Every nucleotide is made up of… 1.Sugar 2.Phosphate 3.Nitrogen base 4.All of the above.
© 2010 Pearson Education, Inc. DNA: STRUCTURE DNA: Deoxyribonucleic acid Was known to be a chemical in cells by the end of the nineteenth century Has the.
General, Organic, and Biological Chemistry Copyright © 2010 Pearson Education, Inc.1 Chapter 21 Nucleic Acids and Protein Synthesis 21.3DNA Double Helix.
Do you like to read mystery novels or watch people on television solve crimes? Detectives search for clues that will help them solve the mystery. Geneticists.
DNA –Was known as a chemical in cells by the end of the nineteenth century –Has the capacity to store genetic information –Can be copied and passed from.
DNA Structure, Function & Replication. DNA stands for… DeoxyriboNucleic Acid.
The Structure of DNA Read the title aloud to students.
DNA Introduction. What is DNA? Genetic information of life Type of Nucleic Acid Double Stranded.
DNA (Deoxyribonucleic Acid). What is DNA? DNA is an encoded molecule that determines traits by giving instructions to make proteins.
DNA: Structure and Replication DNA DNA. DNA is often called the blueprint of life. DNA contains the instructions for making proteins within the cell.
DNA: STRUCTURE AND REPLICATION. DNA: The Code of Life  DNA is the molecule that contains all of the hereditary material for an organism  It is found.
DNA, RNA & Protein Synthesis. A. DNA and the Genetic Code 1. DNA controls the production of proteins by the order of the nucleotides.
Nucleic Acids by Anthony Carpi, Ph.D. from edited by Jungho Kim.
GENETICS. Objectives: Objective 10- Identify the differences between DNA & RNA. Objective Identify the mechanisms through which DNA can be mutated.
Dr Nazir A. Zafar Advanced Algorithms Analysis and Design Advanced Algorithms Analysis and Design By Dr. Nazir Ahmad Zafar.
DNA Deoxyribonucleic acid. DNA structure DNA is a nucleic acid –composed of many nucleotides –A nucleotide is composed of a sugar (deoxyribose), a phosphate.
Defined: Molecule that stores genetic information Nucleotides: Three repeating units of nucleic acids –1) Sugar –2) Phosphate Group –3) Nitrogen Base.
DNA: The Molecule of Heredity
10.2 DNA and RNA are polymers of nucleotides
Chapter 12-1, Part III DNA Structure.
The Genetic Material DNA Structure.
DNA! spooled gene chromosomes chromatin double helix.
Nucleic Acids NUCLEIC ACIDS AND DNA.
Genetics.
DNA Structure and Replication Notes
DNA Replication & Protein Synthesis
DNA and Replication.
Deoxyribonucleic Acid
DNA vs. RNA.
DNA & Replication.
DNA (Deoxyribonucleic Acid)
DNA & RNA Notes Unit 3.
DNA Structure.
The Structure of DNA (Ch 12.2)
Nucleic Acids & Protein Synthesis
Replication Makin’ copies
DNA Chapter 12.
Structure and Replication
DNA.
Presentation transcript:

Vladimir V. Ufimtsev Adviser: Dr. V. Rykov

A Mathematical Theory of Communication C.E. Shannon Main result: Entropy function - average value of information obtained from a channel. Error Detecting and Error Correcting Codes R.W. Hamming Main result: Matrices that can be used to encode messages and provide more reliable transmission across a channel. A structure for Deoxyribose Nucleic Acid J. D. WATSON, F. H. C. CRICK, M. H. F. Wilkins, R. E. Franklin, Main result: Structure found for the building block of life. There’s Plenty of Room at the Bottom R.P. Feynman Main result: Anticipated Science at the nanoscale ( meters).

Letdenote a set consisting of all vectors (codewords) of length n built over i.e. Letsuch that: 1) 2) 3) Let be such that: is referred to as a Code of length n, size M, and minimum distance d.

Volume of the sphere around x, of radius d: A sphere in centered at x having radius d: A space is HOMOGENEOUS when the volume of a sphere does not depend on where it is centered i.e. A space is NON - HOMOGENEOUS when the volume of a sphere does depend on where it is centered.

For any code there are 3 conflicting parameters; Length: n Size: M Minimum distance: d The aim of coding theory is: Given any 2 parameters, find the optimal value for the 3 rd. We need small n for fast transmission, large M for as much information as possible to be encoded and large d so that we can detect and correct many errors.

Exact formulas for sphere volumes and code sizes are extremely difficult to obtain sometimes. In most cases only upper and lower bounds can be obtained for these parameters. We will be working in a NON-HOMOGENEOUS space making the obtainment of exact formulas for sphere volumes and code sizes VERY HARD. Hamming Upper Bound on Code Size in with any metric: Varshamov-Gilbert Lower Bound on Code Size in with any metric:

Let G be a simple graph on vertices and e edges. G contains an M-clique if: CLIQUES:

If: Then there exists a code of size M.

Let Then: Hence there exists a code of size M and so:

The rules of base pairing (nucleotide paring): A - T: adenine (A) always pairs with thymine (T) C - G: cytosine (C) always pairs with guanine (G)

Each base has a bonding surface Bonding surface of A is complementary to that of T (2 bonds) Bonding surface of G is complementary to that of C (3 bonds) Hybridization is a process that joins two complementary opposite polarity single strands into a double strand through hydrogen bonds.

Orientation of single DNA strands is important for hybridization.

Direct Shifted Folded Loop

Interest into DNA computing was sparked in 1994 by Len Adleman. Adleman showed how we can use DNA molecules to solve a mathematical problem. (Hamiltonian path problem). DNA computing relies on the fact that DNA strands can be represented as sequences of bases (4-ary sequences) and the property of hybridization. In Hybridization, errors can occur. Thus, error-correcting codes are required for efficient synthesis of DNA strands to be used in computing.

Sequence is a subsequence of if and only if there exists a strictly increasing sequence of indices: Such that: is defined to be the set of longest common subsequences of and is defined to be the length of the longest common subsequence of and

X = ( A T C T G A T ) Z = ( T C G T ) - subsequence of X X = ( A T C T G A T ) Y = ( T G C A T A ) ( T C A T )– L (X,Y) LCS(X,Y) = 4

Original Insertion-Deletion metric (Levenshtein 1966): This metric results from the number of deletions and insertions that need to be made to obtain ‘ y ’ from ‘ x ’. For vectors that have the same length: the number of deletions that will be made is: likewise, the number of insertions that will be made is:

A common subsequence is called a common stacked pair subsequence of length between x and y if two elements, are consecutive in x and consecutive in y or if they are non -consecutive in x and or non- consecutive in y, then and are consecutive in x and y. Let, denote the length of the longest sequence occurring as a common stacked pair subsequence subsequence z between sequences x and y. The number, is called a similarity of blocks between x and y. The metric is defined to be

The upper bound for the average sphere volume in this metric will be: The Varshamov-Gilbert bound becomes:

ACGT A C G T Thermodynamic weight of virtual stacked pairs. Can use statistical estimation of sphere volume.

There are many possibilities for metrics on the space of DNA sequences. All discussed metrics are non-homogeneous i.e. the sizes of the spheres in the metric spaces depend on the location of their centers. A universal method that will allow us to calculate lower bounds for optimal code sizes was given.

Length (n)Min. size Minimum distance (d) = 6

Length (n)Min. size Minimum distance (d) = 7

Length (n)Min. size Minimum distance (d) = 8

Length (n)Min. size Minimum distance (d) = 9

Length (n)Min. size Minimum distance (d) = 10

LengthLCSMin dist.SizeV-G bound