Comparative Genomics.

Slides:



Advertisements
Similar presentations
System Biology October 2013 Gustavo de Souza IMM, OUS.
Advertisements

DNA BLAST Lab.
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller August 10, 2004.
A new method of finding similarity regions in DNA sequences Laurent Noé Gregory Kucherov LORIA/UHP Nancy, France LORIA/INRIA Nancy, France Corresponding.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Gene Regulatory Networks - the Boolean Approach Andrey Zhdanov Based on the papers by Tatsuya Akutsu et al and others.
Comparative Genomics Bio Informatics Scott Gulledge.
Genomics, Proteomics and Metabolomics. Genomics l The complete set of DNA found in each cell is known as the genome l Most crop plant genomes have billions.
Bryan Heck Tong Ihn Lee et al Transcriptional Regulatory Networks in Saccharomyces cerevisiae.
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Journal club 06/27/08. Phylogenetic footprinting A technique used to identify TFBS within a non- coding region of DNA of interest by comparing it to the.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Genomics Lecture 8 By Ms. Shumaila Azam. 2 Genome Evolution “Genomes are more than instruction books for building and maintaining an organism; they also.
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
Finish up array applications Move on to proteomics Protein microarrays.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Remember the limitations? –You must know the sequence of the primer sites to use PCR –How do you go about sequencing regions of a genome about which you.
Using Mixed Length Training Sequences in Transcription Factor Binding Site Detection Tools Nathan Snyder Carnegie Mellon University BioGrid REU 2009 University.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Biotechnology. Polymerase Chain Reaction PCR is the cloning of DNA (amplification). Copies are made and the amount of DNA can be rapidly increased. Useful.
Genomic and comparative genomic analysis BIO520 BioinformaticsJim Lund.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Bioinformatics Lecture to accompany BLAST/ORF finder activity
No reference available
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster.
Starter What do you know about DNA and gene expression?
Other uses of DNA microarrays
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Introduction Biodiversity, in the simplest terms, means variation in living systems and that is extremely important to any habitat that wants to continue.
Visualizing Biosciences Genomics & Proteomics. “Scientists Complete Rough Draft of Human Genome” - New York Times, June 26, 2000 The problem: –3 billion.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Basics of Comparative Genomics
Departments of Biology and Mathematics
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Pipelines for Computational Analysis (Bioinformatics)
Genomes and their evolution
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
Predicting Active Site Residue Annotations in the Pfam Database
Robert Britton - Instructor Debora Rodrigues - TA
1 Department of Engineering, 2 Department of Mathematics,
Bioinformatics and BLAST
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
1 Department of Engineering, 2 Department of Mathematics,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics II
1 Department of Engineering, 2 Department of Mathematics,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
The Future of Genetic Research
Bioinformatics Vicki & Joe.
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Basics of Comparative Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Vermont Genetics Network Outreach Proteomics Module
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Vermont Genetics Network Outreach Proteomics Module
Presentation transcript:

Comparative Genomics

What is Comparative Genomics? It is the comparison of one genome to another. Genomics DNA (Gene) Functional Transcriptomics RNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic reaction

How is it different from what we’ve already covered and been doing in class?

How is it different from what we’ve already covered and been doing in class? When we BLAST a sequence is that comparative genomics?

Difference is in Scale and Direction Other “omics” Comparative One or several genes compared against all other known genes. Entire Genome compared to other entire genomes. Use information from many genomes to learn more about the individual genes. Use genome to inform us about the entire organism.

What are some questions that comparative genomics can address? How has the organism evolved? What differentiates species? Which non-coding regions are important? Which genes are required for organisms to survive in a certain environment?

Different Questions Require Different Comparisons From: Hardison. Plos Biology. Vol 1 (2): 156-160

From: Miller et al. Annu. Rev. Genom. Human. Genet. 2004.5:15-56.

Example of a Comparative Genomic Study: Kellis et al. 2003. Nature. 423:241-254. Purpose of the paper is to use the genomes of three closely related Saccharomyces strains to inform our knowledge about S. cerevisiae.

First step is to align the genomes as above First step is to align the genomes as above. Several webtools are available for genome alignment and other comparative tasks. For some go to http://www.dcode.org/

By comparing the genomes they not only found numerous new regulatory motifs, but they were able to reduce the total S. cerevisiae gene estimate by 500. The authors could have also focused their comparisons on what parts of these similar genomes account for each strain’s uniqueness instead just focusing on gaining insights into S. cerevisiae.

99% of protein coding genes in humans have homologs in mice 99% of protein coding genes in humans have homologs in mice. And 80% are thought to be 1:1 homologs. From: http://www.macdevcenter.com/pub/a/mac/2004/06/29/bioinformatics.html

The Saccharomyces study could have also taken the comparative approach to gain insights into the functional similarities that defines the organisms as relatives. Recently 5 Campylobacter species’ genomes were sequenced, and analyzed to determine what is the core genetic blueprint of the genus (as well as differences between the species).

From: Fouts et al. 2005. PLoS Biology. 3(1):72-85.

Authors could have also taken a greater focus onto how the Campylobacter species evolved (or Saccharomyces). How each one evolved, and the steps by which they diverged from a common ancestor.

How did E. coli 0157:H7 evolve? From: Wick et al. 2005. Journal of Bacteriology. 187 (5)

What about the other “omics”? Genomics DNA (Gene) Functional Transcriptomics RNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic reaction

From: Redfern et al. 2005. Journal of Chromatography B. 815:97-107

Downsides to the technique Genetic Drift - how can we tell what differences are really selection and important to organism function and not a result of genetic drift? Computationally intensive - large amounts of data that are being compared, still coming up with the tools to process and compare genomes. In order for the comparisons to be statistically relevant many more genomes will need to be sequenced.