A Gentle Introduction to UCSC Genome Browser 陳任志, 游岳齊.

Slides:



Advertisements
Similar presentations
Downloading a multiple alignment for your region of interest from the UCSC Genome Browser ( that can be uploaded in ConTra for.
Advertisements

Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Psi-BLAST, Prosite, UCSC Genome Browser Lecture 3.
9 Genomics and Beyond Brief Chapter Outline
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
InterPro/prosite UCSC Genome Browser Exercise 3. Turning information into knowledge  The outcome of a sequencing project is masses of raw data  The.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Copyright OpenHelix. No use or reproduction without express written consent1.
UCSC Archaeal genome browser Advanced browsing September 19, 2006 David Bernick, Aaron Cozen and Todd Lowe September 19, 2006 David Bernick, Aaron Cozen.
Lab 3.41 Demo: Exploiting the UCSC Genome Browser Stefanie Butland UBC Bioinformatics Centre
UCSC Genome Browser Tutorial
Data Mining in Ensembl with EnsMart. 2 of 24 All genes from a candidate region Genes with a particular protein domain Members of a protein family Genes.
Genome Browsing with the UCSC Genome Browser
UCSC Archaeal genome browser September 19, 2006 David Bernick, Aaron Cozen and Todd Lowe September 19, 2006 David Bernick, Aaron Cozen and Todd Lowe.
Prosite and UCSC Genome Browser Exercise 3. Protein motifs and Prosite.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Data retrieval BioMart Data sets on ftp site MySQL queries of databases Perl API access to databases Export View.
NGS Analysis Using Galaxy
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 1.
Spring 2006, v7 Copyright OpenHelix. No use or reproduction without express written consent 1 The UCSC Genome Browser Search, retrieve and display the.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Copyright OpenHelix. No use or reproduction without express written consent1.
Gene Expression Omnibus (GEO)
1 The Genome Browser allows you to –Browse the Rice-Japonica, Maize and Arabidopsis genomes. –View the location of a particular feature on the rice genome.
The UCSC Genome Browser Introduction
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Genomics and Personalized Care in Health Systems Lecture 5 Genome Browser Leming Zhou, PhD School of Health and Rehabilitation Sciences Department of Health.
Copyright OpenHelix. No use or reproduction without express written consent1.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.
Data Mining in Ensembl with BioMart Nov,
Sackler Medical School
Copyright OpenHelix. No use or reproduction without express written consent1.
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
数据库使用 杨建华 2010/9/28. Outline of the Topics UCSC and Ensembl Genome Browser (Blat vs Blast vs Blastz vs Multiz) 挖掘数据用 Table Browser 或 BioMart 用户友好化你的数据.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
GVS: Genome Variation Server Materials prepared by: Warren C. Lathe, PhD Updated: Q Version 2.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
SAGExplore web server tutorial. The SAGExplore server has three different modules …
Copyright OpenHelix. No use or reproduction without express written consent1.
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
Copyright OpenHelix. No use or reproduction without express written consent1.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Copyright OpenHelix. No use or reproduction without express written consent1.
Genomes at NCBI. Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools lists 57 databases.
Welcome to the combined BLAST and Genome Browser Tutorial.
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Lab 7.2.
NGS Analysis Using Galaxy
Visualization of genomic data
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
Visualization of genomic data
Presentation transcript:

A Gentle Introduction to UCSC Genome Browser 陳任志, 游岳齊

Options I. Genome Browser II. ENCODE III. Blat IV. Table Browser V. Gene Sorter VI. In Silico PCR VII. Proteome Browser VIII. Utilities IX. Downloads

I. Genome Browser Human (Homo sapiens) Genome Browser Gateway Provides any section of entire human genome Non-Standard Join Certificates – some sequence joins between adjacent clones in this assembly could not be computationally validated the sequencing center responsible for the particular chromosome provides an electronic certificate –should state why the submitter thinks the join is valid

Query Clade: 具有相同祖先的一群生物 vertebrate: 脊椎動物 deuterostome: 後口類 insect: 昆蟲 nematode: 線蟲

Chimp: 黑猩猩 Rhesus: 恆河猴 Opossum: 負鼠 X. tropicalis: 蛙 Tetraodon: 河豚 Fugu: 河豚

Assembly date Display image width

Entire chromosome – chr7 (all of chromosome 7) Cytological band – 20p13 (region for band p13 on chr 20) Chromosomal coordinate range – chr3: (first million bases of chr 3, counting from p arm telomere) mRNA, EST, or STS marker Keywords from the GenBank description of an mRNA (huntington)

Search Result Position zoom in/out Restriction Enzyme mRNA Conservation SNPs

Display option

Gen Browser Query (x)

Gen Browser Results #1 (x)

Gen Browser Results #2 (x)

Gen Browser Details (x)

Gen Browser Syntax (x) Entire chromosome – chr7 (all of chromosome 7) Cytological band – 20p13 (region for band p13 on chr 20) Chromosomal coordinate range – chr3: (first million bases of chr 3, counting from p arm telomere) mRNA, EST, or STS marker Keywords from the GenBank description of an mRNA (huntington)

II. ENCODE Stands for “Encyclopedia Of DNA Elements” Public research consortium to carry out a project to identify all functional elements in the human genome sequence Launched by The National Human Genome Research Institute (NHGRI) Conducted in three phases: – pilot project phase (survey existing methods) – technology development phase (develop new methods) – planned production phase (…)

ENCODE Formats Browser Extensible Data Format (BED) – for efficient access to genomic annotations General Feature Format (GFF) – for data where there are a set of linked features Gene Transfer Format (GTF) – a refinement of GFF that tightens the specification Multiple Alignment Format (MAF) – a series of multiple alignments in one format Wiggle Format (WIG) – for continuous-valued data in track format

ENCODE Options Regions (hg16) – old database (+mRNA, EST, & STS markers) Regions (hg17) – new database (+mRNA, EST, & STS markers) Data Status – the current status of ENCODE datasets Downloads – sequence and annotation data downloads Submission – for the submission of ENCODE-related data

ENCODE Query+Results

ENCODE Details hg16

ENCODE Details hg17

III. Blat To quickly find sequences of 95% and greater similarity of length 40 bases or more BLAST-Like Alignment Tool, not BLAST Use: Paste in a query sequence to find its location in the the genome takes up just under 1 GB of RAM

Blat Query Query sequence Upload file

Blat Results Browser viewDetail view

Blat Result Browse

Blat Result Details

IV. Table Browser To get the data associated with a track in text format, to calculate intersections between tracks, and to retrieve DNA sequence covered by a track

Table Browser Query

Table Browser Results

Table Browser Options Describe Table Schema – schema for SQL table format Filter – regular expression filter – range control Intersection?? Correlation?? Summary Statistics

Table Browser Schema

Table Browser Filter

Table Browser Intersection??

Table Browser Correlation??

Table Browser Summary Statistics

V. Gene Sorter Displays a sorted table of genes that are related to one another Correlation is color-coded – a highly expressed gene is colored red – a less expressed gene is shown in green

Gene Sorter Query

Gene Sorter Results

Gene Sorter Details #1

Gene Sorter Details #2

VI. In Silico PCR In-Silico PCR searches a sequence database with a pair of PCR primers Returns: a sequence output file in fasta format containing all sequence in the database that lie between and include the primer pair

PCR PCR: polymerase chain reaction ,大量複製特定的 DNA 序列

In Silico PCR Query Two primer sequence Max product size Number of match

In Silico PCR Results Melting temperature Match in uppercase Mismatch in lowercase Forward primer Reverse primer

VII. Protein Browser UCSC Proteome Browser Gateway provides a wealth of protein information presented in the form of graphical images and links to external internet sites – SwissProt information – Proteome browser tracks – Protein property histograms – UCSC links / Domain information – Comparative 3D structures – Pathways / Fasta format

Protein Browser Query Swiss-Prot/TrEMBL protein ID

Protein Browser Tracks polarityhydrophobicity cysteinesglycosylation

Protein Browser Histograms

Protein Browser 3D structures

VIII. Utilities Some tools (for preparing input) – Batch Coordinate Conversion (liftOver) converts genome coordinates and genome annotation files between assemblies WHY? –occasionally, a chunk of sequence may be moved to an entirely different chromosome as the map is refined – DNA Duster formatting tool – Protein Duster formatting tool

IX. Downloads Offers downloads to complete genomes – Human – Chimpanzee – Rhesus – Dog – Cow – Mouse – Rat – Opossum – Chicken