Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.

Slides:



Advertisements
Similar presentations
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Advertisements

Beyond PubMed and BLAST: Exploring NCBI tools and databases Kate Bronstad David Flynn Alumni Medical Library.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Lecture 2.21 Retrieving Information: Using Entrez.
UCSC Genome Browser Tutorial
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Bootcamp: Data Resources1 Paul Bain Reference and Education Services Librarian Countway Library of Medicine Countway.
Genomic Database - Ensembl Ka-Lok Ng Department of Bioinformatics Asia University.
How to access genomic information using Ensembl August 2005.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Doug Brutlag 2011 Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine Genomics, Bioinformatics.
Login: BITseminar Pass: BITseminar2011 Login: BITseminar Pass: BITseminar2011.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Doug Brutlag 2011 Next Generation Sequencing and Human Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University.
Making Sense of the ENCODE Project (ENCyclopedia Of DNA Elements) Data Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences.
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 1.
Spring 2006, v7 Copyright OpenHelix. No use or reproduction without express written consent 1 The UCSC Genome Browser Search, retrieve and display the.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
The UCSC Genome Browser Introduction
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Copyright OpenHelix. No use or reproduction without express written consent1.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Copyright OpenHelix. No use or reproduction without express written consent1.
Genomics and Personalized Care in Health Systems Lecture 5 Genome Browser Leming Zhou, PhD School of Health and Rehabilitation Sciences Department of Health.
Copyright OpenHelix. No use or reproduction without express written consent1.
Organizing information in the post-genomic era The rise of bioinformatics.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Sackler Medical School
Copyright OpenHelix. No use or reproduction without express written consent1.
Web Databases for Drosophila An introduction to web tools, databases and NCBI BLAST Wilson Leung08/2015.
数据库使用 杨建华 2010/9/28. Outline of the Topics UCSC and Ensembl Genome Browser (Blat vs Blast vs Blastz vs Multiz) 挖掘数据用 Table Browser 或 BioMart 用户友好化你的数据.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Copyright OpenHelix. No use or reproduction without express written consent1.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Johnson - The Living World: 3rd Ed. - All Rights Reserved - McGraw Hill Companies Genomics Chapter 10 Copyright © McGraw-Hill Companies Permission required.
What do we already know ? The rice disease resistance gene Pi-ta Genetically mapped to chromosome 12 Rybka et al. (1997). It has also been sequenced Bryan.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
Copyright OpenHelix. No use or reproduction without express written consent1.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
Accessing and visualizing genomics data
Genomes at NCBI. Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools lists 57 databases.
Welcome to the combined BLAST and Genome Browser Tutorial.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Web Databases for Drosophila
Introduction to Genes and Genomes with Ensembl
Introduction to Bioinformatics
The Transcriptional Landscape of the Mammalian Genome
Regulation of Gene Expression
GEP Annotation Workflow
Next Generation Sequencing and Human Genome Databases
Gene Safari (Biological Databases)
Problems from last section
Presentation transcript:

Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of Pittsburgh

Genomic achievements since the Human Genome Project

Objective Organism Whole Genome Sequence Databases Genome Browsers

Topics  Genome Sequencing Projects  NCBI Genome resources  Integrated Microbial Genome  UCSC Genome Bioinformatics  Genome Browsers  UCSC Genome Browser  UCSC Table Browser  NCBI Map viewer  Generic Genome Browser (Gbrowse)

Genome Biology Human Genome Project Video

Chromosome Structure

Genome Biology: Karyotype Adapted from NGHRI Trisomy 21 Monosomy X

Genome Biology: Karyotype NHGRI

Genome Biology: Molecular Cloning p53 CFTRNFkB 8 September,

Genome Biology : Time Line 1976 RNA Bacteriophage MS Human Genome Draft Seq 2003 Published Complete Human Ref Genome 2007 Diploid Genome seq of an Individual Human 2011 Published Complete Genomes: 1863 organisms 1995 Haemophilus Influenza 2008 Jim Watson Genome Yeast C. elegans 2002 Drosophila

DNA Sequencing Cost

Oxford Nanopore A 20-node installation, using 8,000-nanopore cartridges, is expected to deliver a complete human genome at 50- fold coverage in 15 minutes, according to the company, or 3 terabases of data per day, based on a sequencing speed of 300 bases per second. For that setup, the cost per gigabase is expected to be under $10.

Organism Whole Genome Sequences

Organism Whole Genome Sequences Human Mouse Rat Dog Cow Chimp Rabbit ……..

Genomes OnLine Database (GOLD) Global comprehensive access to information regarding complete and ongoing genome projects, as well as metagenomes & metadata

Genome Resources

Search for organism’s whole genome sequence

Genome Resources NCBI:  Genomes Resources : LinkLink Genome: JGI: Integrated Microbial genome LinkLink

NCBI Genome

NCBI BioProject  Query: Check the status of genome sequencing for an organism, such as honey bee. Answer:  Enter search term under BioProject  Select the appropriate organism  The BioProject summary page will provide information of available projects and sequencing status  Click on Project Type for more detailed information  Explore Related Resources

Link to the video tutorial: Resources NCBI Genome Project: NCBI Genome: Find the genomic sequence for an organism, such as rabbit.

NCBI Genome Project A collection of complete and in-progress large-scale sequencing, assembly, annotation, and mapping projects for cellular organisms. The database is organized into organism-specific overviews that function as portals for browsing and retrieving projects pertaining to each organism. CLICK Rabbit

NCBI Genome Project : Rabbit Genome

NCBI Genome Project : Rabbit Genome

Link to the video tutorial: Resources Integrated Microbial Genome (IMG): Find the genomic sequence for a bacteria, such as Salmonella enterica

Human genome sequence

Genomic achievements since the Human Genome Project

Genome Biology: Structural Variations

Genome Reference Consortium Link to the PLoS Biology paper on the GRC :

NCBI Genome Resources /

What is a Genome Browser? Genome Browsers enable researchers to visualize & browse entire genomes with annotated data including: gene prediction and structure proteins expression regulation variation comparative analysis etc. Annotated data is usually from multiple diverse sources.

Eukaryotic Genome Browsers Display: Vertical Display: Horizontal

Non-vertebrate Genome Browsers

Genome Browsers The Big Three  NCBI MapViewer  UCSC Genome Browser  EBI Ensemble Generic Genome Browser (Gbrowse) Display: Vertical Display: Horizontal

UCSC Genome Browser

UCSC Genome Browser Default Tracks

UCSC Genome Browser Page mRNA and EST Tracks Expression (such as microarray) Comparative Genomics As a group Individual species Variation and Repeats (including SNPs, copy number variation) Groups of data (Tracks) ENCODE Tracks Phenotype and Disease Tracks Regulation (including TFBS)

Navigating the Human Genome Browse the region of human chromosome 7 between 54, to 55,974,438 bp (chr7:54,318,043-55,974,438)

Link to the video tutorial: Resource UCSC Genome Browser: Browse the region of human chromosome 7 between 54, to 55,974,438 bp. What genes are present in this region ?

UCSC Genome Browser

UCSC Genome Browser: Navigating a Genomic Region

UCSC Genome Browser: Navigating a Genomic Region What genes are present in this region?

Bioinformatics Institutions

UCSC Genome Browser: Navigating a Genomic Region What is RefSeq ?

NCBI Sequence Databases GenBank  archival database of nucleotide sequences from >160,000 organisms More infoMore info RefSeq  based on GenBank record, non-redundant expert verified databases of reference sequences More infoMore info

International Nucleotide Sequence Database Collaboration

Primary Vs Derivative databases

RefSeq Scope & Accessions Genomic DNA NC_ complete genome, complete chromosome, complete plasmid NG_ genomic region NT_ genomic contig mRNA - NM_ Protein - NP_ more about RefSeq scope and accessions...

RefSeq Status Codes Provisional Reviewed Predicted Genome Annotation  more about RefSeq status codes more about RefSeq status codes

UCSC Genome Browser: Navigating a Genomic Region

UCSC Genome Browser: Navigating a Genomic Region

Display Options  Hide: removes a track from view  Dense: all items collapsed into a single line  Squish: each item = separate line, but 50% height + packed  Pack: each item separate, but efficiently stacked (full height)  Full: each item on separate line

UCSC Genome Browser: Navigating a Genomic Region

Gene Description

Gene Description Informative description other resource links microarray data mRNA secondary structure links to sequences protein domains/structure orthologs in other species Gene Ontology™ descriptions mRNA descriptions pathways genetic association studies comparative toxicology gene model

UCSC Genome Browser: Navigating a Genomic Region Find SNPs present in this region

Link to the video tutorial: File: UCSC_part2.swf Resource UCSC Genome Browser: Browse the region of human chromosome 7 between 55,033,691 to 55,282,150 bp. What genetic variations are present in this region ? Retrieve the DNA sequence of this genomic region showing SNPs in red and all gene exons in blue

UCSC Genome Browser: Navigating a Genomic Region

UCSC Genome Browser: Navigating a Genomic Region

UCSC Genome Browser: Navigating a Genomic Region

BLAT: Map a protein sequence into the genome

UCSC Blat: Place a Peptide Seq into the Genome Peptide Seq: NKSSHFYSNVGLQIQTYELQESNVQLKLTVVET Nucleotide seq: AAATCCTCACATTTTTACTCAA ATGTTGGACTTCAAATTCAGACAT ATGAACTTCAGGAAAGC AATGTTCA

Link to the video tutorial: File: Blat.swf Resource UCSC BLAT: Place a mRNA or peptide sequence into the human genome

UCSC Blat

UCSC Blat

UCSC Blat Peptide Seq: NKSSHFYSNVGLQIQTYELQESNVQLKLTVVET

Thank you! Any questions? Carrie IwemaAnsuman Chattopadhyay