Cyber Metagenomics; Challenge to See The Unseen Majority in The Ocean

Slides:



Advertisements
Similar presentations
Calit2 " Talk Nortel Visiting Team December 12, 2005 Dr. Larry Smarr Director, California Institute for Telecommunications and Information.
Advertisements

Advancing the Metagenomics Revolution Invited Talk Symposium #1816, Managing the Exaflood: Enhancing the Value of Networked Data for Science and Society.
The Coming Revolution in Environmental Awareness
Presentation for the Microbe Project Interagency Team
Sequencing Genomics: The New Big Data Driver IntermezzoTalk SURFnet7, Part of GigaPort3 Utrecht, Netherlands December 7, 2011 Dr. Larry Smarr Director,
Microbial Metagenomics Drives a New Cyberinfrastructure Invited Talk School of Biological Sciences University of California, Irvine March 3, 2006 Dr. Larry.
Calit2s Program in Nano-science, Nano-engineering, and Nano-medicine Invited Talk Review of Nano-cancer project April 11, 2006 Dr. Larry Smarr Director,
Creating a Community Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (a.k.a. CAMERA) Invited Talk Honoring David Kingsbury.
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) Invited Talk CONNECT Board Meeting La Jolla, CA April 26, 2006.
Exploring Our Inner Universe Using Supercomputers and Gene Sequencers Physics Department Colloquium UC San Diego October 24, 2013 Dr. Larry Smarr Director,
Collaborations Between Calit2, SIO, and the Venter Institutea Beginning " Talk to the Venter Institute Board La Jolla, CA December 5, 2005 Dr. Larry Smarr.
The CAMERA Project Metagenomics 2006 Oct 3-5, 2006 Paul Gilna, Calit2, UCSD.
JGI Timeline 1997 JGI April 2003 Human Genome Program Officially Ended Human Genome Program Officially Launched 1990 Joint Genome Institute ………………….(JGI)
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Tucson High School Biotechnology Course Spring 2010.
DESIGNING THE MICROBIAL RESEARCH COMMONS: AN INTERNATIONAL SYMPOSIUM NATIONAL ACADEMY OF SCIENCES, WASHINGTON, DC, 8-9 OCTOBER 2009 Paul Gilna, B.Sc.,
Genomics at the Speed of Light: Understanding the Living Ocean The Gordon and Betty Moore Foundation 2nd Annual Marine Microbiology Investigator Symposium.
Microbial Metagenomics and Human Health Invited Talk Health Sciences Advisory Board School of Medicine University of California, San Diego May 8, 2006.
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) Invited Keynote Annual Meeting CENIC 2006 Oakland, CA March 13,
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Annotating Metagenomes Using the NMPDR Rob Edwards Department of Computer Sciences, San Diego State University Mathematics and Computer Sciences Division,
Central Dogma Information storage in biological molecules DNA RNA Protein transcription translation replication.
C A M E R A A Metagenomics Resource for Marine Microbial Ecology July 27, 2007 Paul Gilna UCSD/Calit2 Saul A. Kravitz J. Craig Venter Institute.
The Sorcerer II Global Ocean Sampling Expedition: Metagenomic Characterization of Viruses within Aquatic Microbial Samples Shannon J. Williamson, Douglas.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Microbial Genomes Features Analysis Role of high-throughput sequencing Yeast - the eukaryotic model microbe Databases –TIGR CMR –NCBI Microbial Genomes.
Zachary Bendiks. Jonathan Eisen  UC Davis Genome Center  Lab focus: “Our work focuses on genomic basis for the origin of novelty in microorganisms (how.
“ OptIPuter Tech Transfer to the Broader e-Science and HPC Communities " OptIPuter All Hands Meeting La Jolla, CA December 20, 2006 Dr. Larry.
Environmental Genome Shotgun Sequencing of the Sargasso Sea
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Center for Earth Observations and Applications Advisory Committee.
Topic 6 Growth & Reproduction of Bacteria
es/by-sa/2.0/. Metagenomics Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st Floor, Room.
Presentation Title April 4, 2002 CAMERA- Metagenomics meets the Cyberinfrastructure David T. Kingsbury Gordon and Betty Moore Foundation BERAC - October.
Genomics at the Speed of Light: Understanding the Living Ocean Invited Talk JASON Summer Program La Jolla, CA July 12, 2006 Dr. Larry Smarr Director, California.
Microbial taxonomy and phylogeny
Development of Bioinformatics and its application on Biotechnology
Molecular Microbial Ecology
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Ocean & Climate Atmospheric CO 2, DMS, … Ocean/Atmosphere Circulation Dust-Iron Influx, pH Ocean Nutrient Fields Ecosystem State Biomass Primary Productivity.
The Metagenomics RAST server: Annotation, Analysis, and Comparisons Perfect for Pyrosequencing Rob Edwards Department of Computer Science, San Diego State.
Probes can be designed in an evolutionary hierarchy.
“Quantified Self- On Being a Personal Genomic Observatory” Keynote in the “Humans as Genomic Observatories” Meeting Session in the Genomics Standards Consortium.
The Sargasso Sea “Metagenome”
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Invited Talk 2006 Synthetic Biology Symposium Aliso Creek Inn.
Innovative Research Alliances Invited Talk IUCRP Fellows Seminar UCSD La Jolla, CA July 10, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications.
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Invited Talk Metagenomics 2006 UCSD La Jolla, CA October.
“Living in a Microbial World” Global Health Program Council on Foreign Relations New York, NY April 10, 2014 Dr. Larry Smarr Director, California Institute.
Chapter 21 Eukaryotic Genome Sequences
EBI is an Outstation of the European Molecular Biology Laboratory. Bioinformatics Challenges in Data Handling and Presentation to the Bioinformaticists.
Big Picture Of ≈1.7 million species classified so far, roughly 6000 are microbes True number of microbes is obviously larger than 6000 “Imagine if our.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
“Observing the Dynamics of the Human Immune System Coupled to the Microbiome in Health and Disease” CASIS Workshop on Biomedical Research Aboard the ISS.
Sara E. Richardson Calit2 Summer Undergraduate Research Scholarship Program Advisor: Jurgen Schulze Ivl.calit2.net/wiki CAMERA is.
“ Collaborations Between Calit2, SIO, and the Venter Institute—a Beginning " Talk to the UCSD Representative Assembly La Jolla, CA November 29, 2005 Dr.
“CAMERA Goes Live!" Presentation with Craig Venter National Press Club Washington, DC March 13, 2007 Dr. Larry Smarr Director, California Institute for.
es/by-sa/2.0/. Metagenomics Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st Floor, Room.
Bioprospecting Lecture 17. Marine sponges with cancer promise Hundreds of compounds isolated from natural environments are in use or in development for.
Environmental Genome Shotgun Sequencing of the Sargasso Sea Venter et. al (2004) Presented by Ken Vittayarukskul Steven S. White.
“Genomics: The CAMERA Project" Invited Talk 5 th Annual ON*VECTOR International Photonics Workshop UCSD February 28, 2006 Dr. Larry Smarr Director,
High throughput biology data management and data intensive computing drivers George Michaels.
Metagenomics The study of metagenomes, genetic material recovered directly from environmental samples. Term: Coined in 1998 to refer to the idea that a.
“ Building an Information Infrastructure to Support Microbial Metagenomic Sciences " Presentation to the NBCR Research Advisory Committee UCSD La Jolla,
Invited Talk Metagenomics 2006 UCSD La Jolla, CA October 4, 2006
Environmental Genome Shotgun Sequencing of the Sargasso Sea
Seminar in Bioinformatics (236818)

“Building an Information Infrastructure to Support Genetic Sciences"
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Metagenomics Microbial community DNA extraction
Presentation transcript:

Cyber Metagenomics; Challenge to See The Unseen Majority in The Ocean Kayo Arima California Institute for Telecommunications and Information Technology (Calit2)-University of California, San Diego Division

Science Falkowski and Vargas 304 (5667): 58 Looking Back Nearly 4 Billion Years In the Evolution of Microbe Genomics Eukaryote has the nuclei . Prokaryotes has genes but no nuclear membrane. Science Falkowski and Vargas 304 (5667): 58

Much of Genome Work Has Occurred in Animals Evolution is the Principle of Biological Systems: Most of Evolutionary Time Was in the Microbial World You Are Here Much of Genome Work Has Occurred in Animals Source: Carl Woese, et al

Two completely different approach to get microbial genomic information Microbial whole genomics Metagenomics Environmental sample Culture (grow) in lab Isolate the colony Culture the isolated colony DNA extraction Enz. digestion Shotgun sequencing Gene assembly Environmental sample DNA extraction Enz. digestion Shotgun sequencing Scaffold assembly                  Source: Karin Remington J. Craig Venter Institute

Down Side of Metagenomics Often fragmentary Often highly divergent Rarely any known activity No chromosomal placement No organism of origin Ab initio ORF predictions Huge data

GenBank Protein Data Bank Genomic Data Is Growing Rapidly, But Metagenomics Will Vastly Increase The Scale… 100 Billion Bases! 35,000 Structures GenBank Protein Data Bank www.ncbi.nlm.nih.gov/Genbank www.rcsb.org/pdb/holdings.html Total Data < 1TB

First Genome 1995 6 Genomes/ Year 2000 Full Genome Sequencing is Exploding: Most Sequenced Genomes are Bacterial First Genome 1995 6 Genomes/ Year 2000 Ongoing Genomes Completed Genomes 90 Metagenomes Total 422 Total 1665 www.genomesonline.org

Marine Metagenomics Microbes account for more than 90% of ocean biomass, mediate all biochemical cycles in the oceans and are responsible for 98% of primary production in the sea. Metagenomics is a breakthrough sequencing approach to examine the open-space microbial species without the need for isolation and lab cultivation of individual species.

PI Larry Smarr Paul Gilna Ex. Dir. PI Larry Smarr

Marine Genome Sequencing Project Measuring the Genetic Diversity of Ocean Microbes Sorcerer II Data from this area has already reach to 10% of GenBank. The Entire Data Will Double Number of Proteins in Embank!

Sample Metadata from GOS Site Metadata Location (lat/long, water depth) Site characterization (finite list of types plus “other”) Site description (free text) Country Sampling Metadata Sample collection date/time Sampling depth Conditions at time of sampling (e.g., stormy, surface temperature) Sample physical/chemical measurements (T (oC), S (ppt), chl a (mg m-3), etc) “author” Experimental Parameters Filter size Insert size

Flat File Server Farm W E B PORTAL Traditional User Response Request Dedicated Compute Farm (1000 CPUs) TeraGrid: Cyberinfrastructure Backplane (scheduled activities, e.g. all by all comparison) (10000s of CPUs) Data- Base 10 GigE Fabric Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server Source: Phil Papadopoulos, SDSC, Calit2 + Web Services Sargasso Sea Data Sorcerer II Expedition (GOS) JGI Community Sequencing Project Moore Marine Microbial Project NASA Goddard Satellite Data Community Microbial Metagenomics Data Web (other service) Local Cluster Environment Direct Access Lambda Cnxns

Who is there? Marine Metagenomics Metabolic pathway discovery Drug discovery Microbial genetic survey Environmental survey Symbiosis Who is there? Evolution study Endosymbiosis Organism discovery Microbial genomic survey Bioenergy discovery Biogeochemistry mapping Marine conservation