Module 4: Understanding KO designs Mark Thomas Wellcome Trust Sanger Institute.

Slides:



Advertisements
Similar presentations
The Central Dogma Information flow in cells DNA RNA Protein Transcription Translation Language The cat sat on the mat THE CAT SAT ON THE MAT Le chat sest.
Advertisements

A UCI TMF TUTORIAL: FINDING TARGETING VECTORS AND DOWNLOADING SEQUENCE FOR A MUTANT ALLELE OF YOUR GENE OF INTEREST VIA THE INTERNATIONAL KNOCKOUT MOUSE.
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Homology Based Analysis of the Human/Mouse lncRNome
Genomic Innovations- Orthology Paralogy. Genomic innovation.
Working with gene lists: Finding data using GEO & BioMart June 5, 2014.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Using HapMap.Org A Tutorial Lincoln Stein, Cold Spring Harbor Laboratory.
PROMoter SCanning/ANalysis tool. Goal Creating a tool to analyse a set of putative promoter sequences and recognize known and unknown promoters, with.
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
UCSC Genome Browser Tutorial
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Investigating the Importance of non-coding transcripts.
Proteins, Mutations and Genetic Disorders. What you should know One gene, many proteins as a result of RNA splicing and post translational modification.
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
Using XML technologies to implement complex tables in short- term statistics Francesco Rizzo
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Copyright OpenHelix. No use or reproduction without express written consent1.
PhenCode Linking Human Mutations to Phenotype. PhenCode Brings the deep information on genotypes and phenotypes in locus specific databases (LSDBs) into.
Copyright OpenHelix. No use or reproduction without express written consent1.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Galaxy: Integrative, Reproducible Analysis of Genomics Data Genomic and Proteomic Approaches to Heart, Lung, Blood and Sleep Disorders Jackson Laboratories.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
The Royal Society London, May 19-21st, 2010Mouse models for human disease Phenotype database interoperability and integration Damian Smedley, EBI.
Week 11 Creating Framed Layouts Objectives Understand the benefits and drawbacks of frames Understand and use frame syntax Customize frame characteristics.
1 of 38 Data Mining in Ensembl with BioMart. 2 of 38 Simple Text-based Search Engine.
Data Mining in Ensembl with BioMart Nov,
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
National Levee Database Interactive Reports Instructions NLD Point of Contact 1 US Army Corps of Engineers.
SCRIPPS GENOME ADVISER Galina Erikson Senior Bioinformatics Programmer The Scripps Translational Science Institute Scripps Translational Science Institute.
Sackler Medical School
Copyright OpenHelix. No use or reproduction without express written consent1.
Curation Tools Gary Williams Sanger Institute. SAB 2008 Gene curation – prediction software Gene prediction software is good, but not perfect. Out of.
数据库使用 杨建华 2010/9/28. Outline of the Topics UCSC and Ensembl Genome Browser (Blat vs Blast vs Blastz vs Multiz) 挖掘数据用 Table Browser 或 BioMart 用户友好化你的数据.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
Data Mining in Ensembl with BioMart Giulietta Spudich.
Copyright OpenHelix. No use or reproduction without express written consent1.
ID Mapping to accessions from different databases. COST Functional Modeling Workshop April, Helsinki.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Maik Friedel, Thomas Wilhelm, Jürgen Sühnel FLI-Jena, Germany Introduction: During the last 10 years, a large number of complete.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Welcome to the combined BLAST and Genome Browser Tutorial.
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
GeneConnect Use Cases and Design August 3, GeneConnect Database IDs are linked by Direct Annotation, Inferred Annotation, or Sequence Alignment.
C Copyright © 2009, Oracle. All rights reserved. Using SQL Developer.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Chapter 2 Genes Code for Proteins. 2.1Introduction Early work measuring recombination frequencies between genes led to the establishment of “linkage groups”:
Genetically Engineered Mouse Models of Prostate Cancer
Data Mining with BioMart
BTY100-Lec#4.2 DNA to Protein (Central Dogma).
ID Mapping tools: Converting Accessions between Databases
Ensembl Genome Repository.
Genetically Engineered Mouse Models of Prostate Cancer
A Tutorial Lincoln Stein, Cold Spring Harbor Laboratory
Hugo J. Bellen, Shinya Yamamoto  Cell 
Welcome to the GrameneMart Tutorial
Problems from last section
Introduction to Alternative Splicing and my research report
Welcome - webinar instructions
Presentation transcript:

Module 4: Understanding KO designs Mark Thomas Wellcome Trust Sanger Institute

Aims Understand design criteria and selection of critical exons View designs using genome browsers Assess the effect of a particular design on gene transcripts Search for design/allele information using BioMart

Design criteria One design, three alleles i)KO-first tagged allele ii)Wild-type conditional allele iii)Null-allele

Design criteria Tagged allele – requires insertion of reporter cassette, ideally located 5’ Wild-type conditional allele – requires native splicing to be maintained Null-allele – requires a critical exon that induces a frameshift Other considerations; - alternative splicing (ie. nAGxxnAG, exon skipping) - repeat elements - conserved elements - targeting efficiency (ie. Insertion size) - domain structure - translation reinitiation

Design criteria

Viewing designs Links to genome browsers Export for csv files

Viewing designs Design x

Viewing designs Conservation Repeats Design Gene Models mRNA transcripts

Viewing designs Conservation Repeats Design mRNA transcripts Gene Models U-oligos D-oligos

Viewing designs ConservationRepeats Design mRNA transcripts Gene Models

Assessing CE selection Transcript and Protein identifier links

Assessing CE selection Exons link

Assessing CE selection Exon sizes (bp) Intron/Exon phases U-oligos (intron 2) D-oligos

Assessing CE selection Coding Exons Protein Summary

KO: transcripts in VEGA

Fahd2a KO:Fahd2a Critical Exon

KO: transcripts in VEGA Fahd2a KO:Fahd2a Exon 3 - frameshifted

KO: transcripts in VEGA Fahd2a KO:Fahd2a Exon 3 Exon 3 – Alt. frame Exon 1

Custom design tool

Artificial intron designs AG|G or AG|A Knockout-first wt conditional Null allele

Artificial intron designs Exon 1 Exon 1b 1a Targeted allele

Using BioMart to search Simple searches using knockoutmouse.org More complex searches using biomart.org

Simple searches Simple searches using knockoutmouse.org Possible to browse by products, phenotype, location and name

Simple searches Simple searches using knockoutmouse.org

Simple searches Results page displays data from multiple resources

Simple searches Results page displays data from multiple resources

BioMart searches BioMart allows users to query databases using a standard interface.

BioMart searches GWAS Central - searching for interesting gene targets gwascentral.org Step 1. Use filters to restrict search parameters

BioMart searches Select attributes for results output

BioMart searches Apply additional filters to further restrict results

BioMart searches Results from GWAScentral knockoutmouse.org

BioMart searches knockoutmouse.org Vectors, ES cells and mice available

BioMart searches BioMart builds queries using a series of dropdown menus, identifying; Database selection Filters – defines search terms Attributes - selects results to be returned

BioMart searches Search results are returned in a table, containing links to data sources

BioMart searches Different databases/datasets can be linked in BioMart, allowing results to be combined. IKMC dataset Europhenome dataset

Resources Ensembl – VEGA – vega.sanger.ac.ukvega.sanger.ac.uk UCSC – genome.ucsc.edugenome.ucsc.edu MGI – IKMC portal – Europhenome – GWAScentral – BioMart –

Genbank Genbank files can be downloaded for every vector in IKMC

Genbank

Design criteria Max 3.5kb

Translation Re-initiation - critical exons should be as 5’ as possible Unless, the resulting KO translation is less than 35aa, as under these conditions translation may be reinitiated, producing an N-terminally truncated protein. Bcam

Translation Re-initiation ATGGAACCCCCTGACGCCCGCGCAGGGCTGCTGTGGCTCACCT TCCTGCTGTCGGGCTACTCAG GTTGATGGTACTGGGGCTCGACACCGTCTGGCTTCTGTGGAACCACAGGGCTCAGAGTTC CTGGGCACAGTCCACTCTCTGGGCCGCGTACCCCCATACGAGGTAGACTCTCGTGGGCGC CTGGTGATAGCAAAGGTCCAGGTGGGCGATGGACGGGACTACGTGTGCGTAGTGAAGGCT GGGGCAGCGGGTACCTCAGAGGCCACCTCAAGTGTCCGTGTGTTTG 1 MEPPDARAGL LWLTFLLSGY SG*WYWGSTP SGFCGTTGLR VPGHSPLSGP 51 RTPIRGRLSW APGDSKGPGG RWTGLRVRSE GWGSGYLRGH LKCPCV Translation