Sackler Medical School

Slides:



Advertisements
Similar presentations
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Advertisements

Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
Psi-BLAST, Prosite, UCSC Genome Browser Lecture 3.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction methods Gene indices Mapping cDNA on genomic DNA Genome-genome.
PROMoter SCanning/ANalysis tool. Goal Creating a tool to analyse a set of putative promoter sequences and recognize known and unknown promoters, with.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
InterPro/prosite UCSC Genome Browser Exercise 3. Turning information into knowledge  The outcome of a sequencing project is masses of raw data  The.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
UCSC Genome Browser Tutorial
How to access genomic information using Ensembl August 2005.
Prosite and UCSC Genome Browser Exercise 3. Protein motifs and Prosite.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Day 2! Chapter 15 Eukaryotic Gene Regulation Almost all the cells in an organism are genetically identical. Differences between cell types result from.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
NGS Analysis Using Galaxy
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 1.
Locating genes in Plasmodium falciparum You have seen how artemis is used to view, analyse and annotate bacterial genomes, but now we are going to move.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
Eukaryotic Gene Expression The “More Complex” Genome.
The UCSC Genome Browser Introduction
is accessible at: The following pages are a schematic representation of how to navigate through ALE-HSA21.
GeneWise and Artemis Exercises Spliced Alignment using GeneWise Click on the GeneWise hyperlink on the course links page,
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Copyright OpenHelix. No use or reproduction without express written consent1.
Genomics and Personalized Care in Health Systems Lecture 5 Genome Browser Leming Zhou, PhD School of Health and Rehabilitation Sciences Department of Health.
Copyright OpenHelix. No use or reproduction without express written consent1.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Molecular Biology in a Nutshell (via UCSC Genome Browser) Personalized Medicine: Understanding Your Own Genome Fall 2014.
Copyright © 2009 Pearson Education, Inc. Regulation of Gene Expression in Eukaryotes Chapter 17 Lecture Concepts of Genetics Tenth Edition.
Mark D. Adams Dept. of Genetics 9/10/04
数据库使用 杨建华 2010/9/28. Outline of the Topics UCSC and Ensembl Genome Browser (Blat vs Blast vs Blastz vs Multiz) 挖掘数据用 Table Browser 或 BioMart 用户友好化你的数据.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
How can we find genes? Search for them Look them up.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Copyright OpenHelix. No use or reproduction without express written consent1.
GENE REGULATION RESULTS IN DIFFERENTIAL GENE EXPRESSION, LEADING TO CELL SPECIALIZATION Eukaryotic DNA.
-1- Module 3: RNA-Seq Module 3 BAMView Introduction Recently, the use of new sequencing technologies (pyrosequencing, Illumina-Solexa) have produced large.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Finding genes in the genome
Accessing and visualizing genomics data
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Genomes at NCBI. Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools lists 57 databases.
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
The regulation of Caspase 8 chIP-seq motifs mRNA expression DNA methylation.
? ? Individual 1Individual 2 1. Questions This is a pedigree for a disease involving a mutation within an imprinted gene. The disease manifests only when.
Regulation of Gene Expression by Eukaryotes
Visualization of genomic data
Visualization of genomic data
Genome Center of Wisconsin, UW-Madison
Introduction to G-OnRamp
Daily Warm-Up Thursday, January 9th
Ensembl Genome Repository.
Follow-up from last night: XSEDE credits
Presentation transcript:

Sackler Medical School UCSC Genome Browser Dror Hollander Gil Ast Lab Sackler Medical School

Understanding the Genome histone midifications genes GC content promoters repetitive elements conservation SNPs nucleosome occupancy miRNA gene expression non-coding RNA secondary structure alternative splicing exon-intron structure How can you examine a genomic segment while taking all of these factors into account? DNA protein RNA

Lecture Overview UCSC Genome Browser Galaxy Interface & selected tracks Detecting alternative splicing events Chromatin organization & epigenetics BLAT PCR Galaxy

UCSC Genome Browser (64 eukaryote genome genomes) track track Basic design: “the Genome Browser stacks annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information” (64 eukaryote genomes) genome track track

Genome Browsing…

Basic Genome Browser Interface chromosomal position genomic coordinates zoom start codons in green mark and drag here to zoom in stop codons in red UCSC Genes track Refseq Genes track Black - feature has a corresponding entry in the Protein Data Bank (PDB) Dark blue - transcript has been reviewed or validated by either the RefSeq, SwissProt or CCDS staff Medium blue - other RefSeq transcripts Light blue - non-RefSeq transcripts intron CDS UTR gene direction (> / <)

Basic Genome Browser Interface Configure track visualization:

Basic Genome Browser Interface “RefSeq track shows known human protein-coding and non-protein-coding genes taken from the NCBI RNA reference sequences collection (RefSeq). The data are updated daily” “The UCSC track shows gene predictions based on data from RefSeq, Genbank, CCDS and UniProt… includes both protein-coding and putative non-coding transcripts… Compared to RefSeq, this gene set has generally about 10% more protein-coding genes, approximately five times as many putative non-coding genes, and about twice as many splice variants” Let’s examine a few examples online… Press on track name for full visualization Examples: UCSC: show RefSeq description, GeneCards & GO annotation RefSeq: show summary and genomic alignment Show gene with alternative ATG: ?????

Basic Genome Browser Interface A few more tracks worth mentioning: miRNA (Genes and Gene Prediction Tracks -> sno/miRNA) conservation (Comparative Genomics -> Conservation) Expression tracks Regulation tracks (chromatin structure and modifications, DNA methylation, etc.; includes ENCODE data) RNA secondary structure (Genes and Gene Prediction Tracks -> EvoFold) SNPs (Variation and Repeats -> SNPs) Mention that additional tracks are available for the hg18 genome

Basic Genome Browser Interface Get genomic DNA for the viewed coordinates: Convert sequence to a different genome assembly or genome: Convert browser window to an image file:

Lecture Overview UCSC Genome Browser Galaxy Interface & selected tracks Detecting alternative splicing events Chromatin organization & epigenetics BLAT PCR Galaxy

Detecting Alternative Splicing Events Via Human mRNAs & Spliced ESTs tracks (mRNA and EST Tracks) “The mRNA track shows alignments between human mRNAs in GenBank and the genome” “…alignments between human expressed sequence tags (ESTs) in GenBank and the genome… ESTs are single-read sequences, typically about 500 bases in length” gene DNA Mention the different fields in each mRNA / EST mRNA

Detecting Alternative Splicing Events Via Alt Events track (Genes and Gene Prediction Tracks) – based on UCSC genes gene DNA cassetteExon >

Detecting Alternative Splicing Events Via Burge RNA-seq track (Expression) Click on the track name to choose tissues gene DNA Burge RNA-seq

Different Alternative Splicing Types Exon skipping Alternative splice site (3’) Intron retention Exon skipping (PTBP1; chr19:804,980-806,685) Alternative 3’ splice site; (PTBP1; chr19:804,980-806,685) Alternative 5’ splice site; (SLC11A2; chr12:51,401,760-51,402,755) Intron retention; (CD163; chr12:7,639,792-7,641,193)

Lecture Overview UCSC Genome Browser Galaxy Interface & selected tracks Detecting alternative splicing events Chromatin organization & epigenetics BLAT PCR Galaxy

Histone Modifications

Transcription Factor Binding

DNA Methylation

Lecture Overview UCSC Genome Browser Galaxy Interface & selected tracks Detecting alternative splicing events Chromatin organization & epigenetics BLAT PCR Galaxy

BLAT BLAT = Blast-Like Alignment Tool BLAT query BLAT query BLAT = Blast-Like Alignment Tool BLAT is designed to find similarity of >95% on DNA and >80% for protein CACGCCCAGACCTGCCTTCCGGGGACAGCCAGCCCTCGCTGGACCAGACCATGGCCGCGGCCTTCGGTGCACCTGGTATAATCTCAGCCTCTCCGTATGCAGGAGCTGGTTTCCCTC

Lecture Overview UCSC Genome Browser Galaxy Interface & selected tracks Detecting alternative splicing events Chromatin organization & epigenetics BLAT PCR Galaxy

amplicon in fasta format PCR coordinates strand primers amplicon in fasta format amplicon Primers: CREB-ATF2_F: cggccaatgtagctcctcta Tm= 60.37 CREB-ATF2_R: CTCAATGGGTGCGTCAACTA Tm= 59.72 temperatures

Lecture Overview UCSC Genome Browser Galaxy Interface & selected tracks Detecting alternative splicing events Chromatin organization & epigenetics BLAT PCR Galaxy

Galaxy “Galaxy allows you to do analyses you cannot do anywhere else without the need to install or download anything. You can analyze multiple alignments, compare genomic annotations, profile metagenomic samples and much much more...”

Galaxy – What Is It Good for? Getting the best out of UCSC Operating on UCSC data Supports operations both at the interval level, and at the sequence level Designed for biologists!

Galaxy – Typical Workflow Extract sets of coordinates either upload from computer or from UCSC table browser Operate on different sets of coordinates (intersect, subtract etc.) Fetch genomic sequences of coordinates