Presentation is loading. Please wait.

Presentation is loading. Please wait.

BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto.

Similar presentations


Presentation on theme: "BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto."— Presentation transcript:

1 BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto

2 1. what we do, our tools 2. Savant Genome Browser Outline Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

3 WHAT WE DO Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

4 main focus: genomic analysis using output from high- throughput sequencing (HTS) machines high throughput: sequence billions of nucleotides per week poor data quality: “reads” are shorter; error profiles are poorly understood What we do Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

5 HTS Pipeline Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

6 What to do with all these reads? Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

7 1. Assembly Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ ASSEMBLY: reconstruct the donor’s genome “HapSembler”: specialized for highly polymorphic species

8 2. Alignment Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ ALIGNMENT find region in a “reference” genome that matches closely with each read; suggests similar origin from “donor” “SHRiMP”: Short Read Mapping Package

9 3. Genetic Variation Discovery Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ GENETIC VARIATION DISCOVERY find differences between two genomes between donor and reference between two samples (e.g. tumour vs. normal) “VARiD”, “MODiL”, and “CNVer”

10 Genetic Variation Single Nucleotide Polymorphism (SNP): genomes have different nucleotides at corresponding positions VARiD – VARiation IDentification Insertions and Deletions (Indels): genomes have additional sequence put in or sequence taken out at corresponding locations MODiL – Mixtures of Distributions Indel Locator Copy Number Variation (CNV): genomes have a different number of the same sequence CNVer Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

11 Our Bioinformatics Tools READ MAPPING (SHRiMP) SNP DETECTION (VARiD) SNP DETECTION (VARiD) INDEL DETECTION (MODiL) INDEL DETECTION (MODiL) CNV DETECTION (CNVer) CNV DETECTION (CNVer) ASSEMBLY (HapSembler) ASSEMBLY (HapSembler) VISUALIZATION (SAVANT) VISUALIZATION (SAVANT) COMPRESSION

12 SAVANT GENOME BROWSER Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

13 Genome Browsing, the old way Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

14 Challenge presented by HTS datasets genomic data is generated in high volumes HTS machines generate billions of bases per run interpretation and analysis challenge typical pipeline employs many separate tools for computation and visualization Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

15 Tools for HTS data analysis ToolCostComputationVisualization Read Alignment e.g. Bowtie, BWA FreeYN File Format Conversion e.g. Galaxy, SAMTools FreeYN Other Comand-line Tools e.g. Genetic Variation Discovery, Comparitive Genomics, etc. FreeYN UCSC Genome BrowserFreeNY Integrative Genomics ViewerFreeNY GBrowseFreeNY CLC Genomics Workbench$$$YY Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ substantial disconnect between the processes of computational analysis and visualization

16 Tools for Genomic Data Analysis ToolCostComputationVisualization Read Alignment e.g. Bowtie, BWA FreeYN File Format Conversion e.g. Galaxy, SAMTools FreeYN Other Comand-line Tools e.g. Genetic Variation Discovery, Comparitive Genomics, etc. FreeYN UCSC Genome BrowserFreeNY Integrative Genomics ViewerFreeNY GBrowseFreeNY CLC Genomics Workbench$$$YY Savant Genome BrowserFreeYY Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ substantial disconnect between the processes of computational analysis and visualization

17 ASIDE: Cytoscape? platform for visual analysis of networks extensive plugin framework Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ Bader Lab

18 Savant Genome Browser platform for integrated visual analysis of genomic data feature-rich genome browser computationally extensible via plugin framework Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

19 (Very) Short List of Features FASTA, BED, WIG, GFF, tab-delimited, BAM local and remote Data Format Support pack, squish for BED and GFF tracks mismatch, SNP, matepair modes for BAM tracks Alternative Visualization Modes very fast data access (<1s) small memory footprint (<250 MB) Speed and Interactivity sessions, bookmarking of interesting regions, track locking, data selection Extras ~ none, works on all major operating systems System Requirements Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

20 FEATURE DEMONSTRATION Savant Genome Browser - http://compbio.cs.toronto.edu/savant/ INTERFACE HTS READ ALIGNMENTS EXAMPLE PLUGIN: SNP FINDER

21 Power of visual analytics task: find the correct parameter for command-line tool Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

22 Plugin Framework unlocks the potential for performing visual analytics beneficial for both users and tool developers tool developers: simple platform for development and dissemination of work plugin development is easy API contains over a hundred prebuilt functions (e.g. get track data, add bookmarks, draw custom graphics, etc.) Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

23 CONCLUSIONS Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

24 Conclusions Savant is a platform for integrated visualization and analysis of genomic data stand-alone genome browser novel features: e.g. table view, visualization modes, data selection, etc. computationally extensible through plugin framework makes interpretation and analysis of genomic data easier and more efficient Savant Genome Browser - http://compbio.cs.toronto.edu/savant/

25 Acknowledgements RecepAndrewVladMike Brudno YueMarc Vanessa Orion JoeNilgun Paul Vera MiskoYoni

26 Thanks! Savant Genome Browser - http://compbio.cs.toronto.edu/savant/


Download ppt "BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto."

Similar presentations


Ads by Google