Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences Mark J Gibbs, John S Armstrong.

Slides:



Advertisements
Similar presentations
Genetic Research Using Bioinformatics: LESSON 2:
Advertisements

Locating conserved genes in whole genome scale Prudence Wong University of Liverpool June 2005 joint work with HL Chan, TW Lam, HF Ting, SM Yiu (HKU),
BARCODING LIFE, ILLUSTRATED Goals, Rationale, Results ppt v1
Optimization of SVM Parameters for Promoter Recognition in DNA Sequences Robertas Damaševičius Software Engineering Department, Kaunas University of Technology.
Aims to establish a catalogue of all organisms (10-30 million species) Ultimately a small portable hand held device will be used to identify samples using.
Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes Fulgerator Paul Hebert, Erin Penton, John Burns,
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – The Transcription.
Accurate Method for Fast Design of Diagnostic Oligonucleotide Probe Sets for DNA Microarrays Nazif Cihan Tas CMSC 838 Presentation.
with an emphasis on DNA microarrays
PCR Primer Design
Biotechnology Past Paper Questions. 1. Outline the process of DNA profiling (genetic fingerprinting), including ways in which it can be used. 6 marks.
DNA FINGERPRINTS.
Physical Mapping of DNA Shanna Terry March 2, 2004.
DNA Technology and Genomics Chapter 20 A. P. Biology Mr. Knowles Liberty Senior High School.
Genetics Techniques: RFLP & PCR AP Biology Unit 3.
DNA FINGERPRINTS. No two people in the world have the same DNA (except Identical twins) A majority of DNA is actually the same for all humans. About 0.10.
Study Guide- Gene Technology 1.a. Inbreeding: Breeding of an organism with the same characteristics. Causes genetic disorders. b. Hybridization: Crossing.
Section 2 Genetics and Biotechnology DNA Technology
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Molecular identification of living things. Molecular Markers Single locus marker Multi-locus marker RFLP Microsatellite DNA Fingerprinting AFLP RAPD.
DNA FINGERPRINTING. 1.What do you think DNA fingerprinting is? 2. What do you think DNA fingerprinting can be used for?
The Unifying Themes of Biology
Biotechnology Use of Natural Biological Systems to Produce a Product or Provide a Desired Process.
Classifying the Diversity of Life Targets: 17. State the goals of taxonomy. 18. Describe how evolutionary biology and molecular biology influence classification.
POLYMERASE CHAIN REACTION (PCR) Bridges Polymerase Chain Reaction  Simple reaction  Produces many copies of a specific fragment of DNA  Live.
Gel Electrophoresis A molecular biology tool. Purpose To separate and analyze/compare fragments of DNA.
 DNA Fingerprinting. Variation in Human DNA  Of 3 billion nucleotides in human DNA more than 99% are identical  Of 1% that are different  significant.
Genomics and Forensics
Researchers use genetic engineering to manipulate DNA. Section 2: DNA Technology K What I Know W What I Want to Find Out L What I Learned.
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
By Chris Paine Genes Essential idea: Every living organism inherits a blueprint for life from its parents. Genes and.
The “ABC’s” of Floral Madness Architecture of a Prototypical Problem Space John Greenler and Doug Green.
Biotech. Cloning a mammal PCR This is the polymerase chain reaction. It is a technique to multiply a sample of DNA many times in a short period of time.
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Structure of the Genome Denaturation, Renaturation and Complexity.
Chapter 18 Classification. Classifying A great diversity of organisms requires a universal way to name them Taxonomy – allows biologists to name and classify.
DNA Technology Ch. 20. The Human Genome The human genome has over 3 billion base pairs 97% does not code for proteins Called “Junk DNA” or “Noncoding.
Plasmid Isolation Prepared by Latifa Aljebali Office: Building 5, 3 rd floor, 5T250.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – the Transcription.
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
Higher Human Biology Unit 1 Human Cells KEY AREA 5: Human Genomics.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Bio II: Forensics.  DNA molecules are found in the nucleus of cells in the human body in chromosomes.  People have 23 pairs of chromosomes, with an.
Click on a lesson name to select. Objectives 1.Describe how selective breeding is used to produce organisms with desired traits. 2.Compare inbreeding.
Something’s Fishy By Victoria Eavis and Rebecca Mantel Mentored by Patrice Buckley.
DNA Questions What makes up a DNA backbone? How would you describe how DNA looks? Name the 4 bases that make up DNA. “T” base can only match with? What.
Biotechnology.
GENETIC MARKERS (RFLP, AFLP, RAPD, MICROSATELLITES, MINISATELLITES)
The Transcriptional Landscape of the Mammalian Genome
Components of life and Ecosystems
Sequencing Introduction
Artificial Intelligence Methods (AIM)
A Very Basic Gibbs Sampler for Motif Detection
Recombinant DNA Technology
13.1 Applied Genetics Selective Breeding
Human Cells Human genomics
Ch. 13 Genetic Engineering
Section 2 Genetics and Biotechnology DNA Technology
Chapter 4 “DNA Finger Printing”
Basics of Genetic Algorithms (MidTerm – only in RED material)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basics of Genetic Algorithms
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Restriction Fragment Length Polymorphism (RFLP)
9-3 DNA Typing with Tandem Repeats
9-2 Replication of DNA.
Fluorescent in situ hybridization (FISH)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences Mark J Gibbs, John S Armstrong and Adrian J Gibbs BMC Bioinformatics 2005, 6:90 Presented by Miguel Gonzalez

Outline Background Background Results Results Discussion Discussion

Background Organism identification Organism identification Comparative Gene Sequencing Comparative Gene Sequencing DNA probes DNA probes

The Problem! Using contemporary biological research is too time consuming and expensive. Using contemporary biological research is too time consuming and expensive. Usually complex techniques are involved. Usually complex techniques are involved.

The Solution Develop a method for identifying sequences that is not extremely specific. Develop a method for identifying sequences that is not extremely specific. Probes can be found that bind to more than target sequence to produce unique binding patterns or fingerprints. Probes can be found that bind to more than target sequence to produce unique binding patterns or fingerprints.

Hypothesis To develop a method for identifying sequences efficiently using distinguishing sub-sequences (DSSs). To develop a method for identifying sequences efficiently using distinguishing sub-sequences (DSSs).

Strategy The study uses the methods of taxonomy where combinations of characters shared by different members of a target organisms. The study uses the methods of taxonomy where combinations of characters shared by different members of a target organisms. The advantage is that identification requires fewer characters and questions to identify an individual target. The advantage is that identification requires fewer characters and questions to identify an individual target.

Strategy The minimum number of characters for this method is defined by the binary logarithm X = log 2 Y, X = # of characters; Y = # of targets The minimum number of characters for this method is defined by the binary logarithm X = log 2 Y, X = # of characters; Y = # of targets Ex. 10 characters could identify a set of 1024 targets. Ex. 10 characters could identify a set of 1024 targets.

Testing Hypothesis Three sets of cytochrome oxidase c 1 (CO1) sequences were used: animal, insect, and moth Three sets of cytochrome oxidase c 1 (CO1) sequences were used: animal, insect, and moth CO1-animal had 96 species CO1-animal had 96 species CO1-insect had 92 species CO1-insect had 92 species CO1-moth had 201 species CO1-moth had 201 species

Target Sequence ClustalX was performed on the 3 sets of sequences to find a target region within sequences. ClustalX was performed on the 3 sets of sequences to find a target region within sequences. Pools of sub-sequences were created ranging from lengths of 6-31 nucleotides Pools of sub-sequences were created ranging from lengths of 6-31 nucleotides From the sub-sequences, distinguishing sub- sequences were identified From the sub-sequences, distinguishing sub- sequences were identified

Results

Results

Results

Discussion A method was produced where sub-sequences are found which, distinguish the gene sequences or groups of gene sequences from which they came from. A method was produced where sub-sequences are found which, distinguish the gene sequences or groups of gene sequences from which they came from. Sequence diversity and sub-sequence length were found to be major factors influencing the number of subsequences available as probe targets. Sequence diversity and sub-sequence length were found to be major factors influencing the number of subsequences available as probe targets.