Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.

Slides:



Advertisements
Similar presentations
Introductory to database handling Endre Sebestyén.
Advertisements

Bioinformatics Platform Three-tier Architecture Object-based Relational Database implemented using Oracle Middleware implemented using Entity-Class Operations,
Discovery Studio AtlasStore: Protein/Ligand Database Steve Potts, Ph.D., MBA Product Manager Biological Informatics
Bioinformatics for genomics Kickoff Bioinformatics Expertise Center 10 November 2009 Judith Boer Dept. of Human Genetics.
Bioinformatics (and Systems Biology?) in Biomedical Research Donald Dunbar Systems Biology Club 30th November 2005.
Pathways analysis Iowa State Workshop 11 June 2009.
Provenance in a Collaborative Bio-database RAASWiki Donald Dunbar & Jon Manning Queen’s Medical Research Institute University of Edinburgh Use Cases for.
Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy.
Centers of Excellence for Influenza Research and Surveillance 6 th Annual Meeting Aug 1, 2012 Status of IRD Development.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics USC School of Medicine Library.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
Five Slides About EGAN Jesse Paquette UCSF Helen Diller Family Comprehensive Cancer Center
Where we are and where we are going From biology to data and back again Chris Evelo Department of Bioinformatics - BiGCaT Maastricht University.
Data Mining in Ensembl with EnsMart. 2 of 24 All genes from a candidate region Genes with a particular protein domain Members of a protein family Genes.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Larry Lam Southern California Bioinformatics Summer Institute 2009 Graeber Lab – Crump Institute for Molecular Imaging UCLA A Data Management and Analysis.
Data retrieval BioMart Data sets on ftp site MySQL queries of databases Perl API access to databases Export View.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Introduction: Drupal is a free and open-source content management system (CMS). A content management system(CMS) is a computer program that allows publishing,
Copyright OpenHelix. No use or reproduction without express written consent1.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
BIF Group Project Group (A)rabidopsis: David Nieuwenhuijse Matthew Price Qianqian Zhang Thijs Slijkhuis Species: C. Elegans Project: Advanced.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
ChipDB: An interactive database system for high- throughput expression analysis Peter Young, John Barnett, Bing Ren, Ezra Jennings and Richard Young Whitehead.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
MMAP: mouse Metabolomics Analysis Platform Preeti Bais 09/09/2014.
Copyright OpenHelix. No use or reproduction without express written consent1.
1 of 38 Data Mining in Ensembl with BioMart. 2 of 38 Simple Text-based Search Engine.
Copyright © 2009 Pearson Education, Inc. Genomics, Bioinformatics, and Proteomics Chapter 21 Lecture Concepts of Genetics Tenth Edition.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Bioinformatics Core Facility Guglielmo Roma January 2011.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
EuPathDB: an integrated resource and tool for eukaryotic pathogen bioinformatics Aurrecoechea C., Heiges M., Warrenfeltz S. for the EuPathDB team CTEGD,
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Ela Hunt, MRC research fellow Department of Computing Science SyntenyVista BIOINFORMATICS RESEARCH CENTRE.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Copyright OpenHelix. No use or reproduction without express written consent1.
Data Mining at PLEXdb : Plant and Plant Pathogen Gene Expression Database.
A collaborative tool for sequence annotation. Contact:
Copyright OpenHelix. No use or reproduction without express written consent1.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Accessing and visualizing genomics data
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
National Cancer Institute Uma Mudunuri ABCC, NCI-Frederick ISRCE Monthly Meeting, Nov 9th 2010 bioDBnet The biological DataBase network.
CellExpress Tutorial A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System :8080 NTU.
Networks and Interactions
Hub Updates for Year 3 Carl Kesselman.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Functional Annotation of the Horse Genome
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Network biology An introduction to STRING and Cytoscape
Gene Safari (Biological Databases)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

BHF CoRE Bioinformatics

Data Integration and our Data Mining Tool Our strategy is to help biologists make the most of their ‘-omics’ data We analyse array and sequence data using current methods Biologists mine their results in a custom built, secure web based platform We help integrate other relevant data from biologist’s lab and the literature

Data Mining Tool Wish list – Web accessible – Secure – Complex queries across datasets – Technology agnostic – Query cross species – Annotation, statistics and graphs – Links to external databases – Include downstream tools

Data Mining Tool Login via EASE or htaccess (+ vpn) Built in PHP with mySQL back end Generic database structure for statistics – counts, intensity, fold change, p-value Separate annotation tables Includes experiment details and QC info Query builder type interface Output as tables with links Gene set enrichment, heat-map and literature

Data Integration Across technologies – array, sequencing – gene expression, methylation, proteomics, genetics Across species – Human, mouse, rat, fly, fish At the gene level – Probe level for within array – Entrez gene within species – orthologous groups across species

Development New platform: Drupal – Some nice features – New look and feel Web services – interactions, diseases, TF binding sites, miRNA… More use of literature data – Top 10 co-cited on gene detail page – Better visualisation – Better text mining Correlation data (expression profiles) – searchable with other stats Cross experiment gene sets

Thanks Jon Manning John Mullins Our collaborators British Heart Foundation

| “Providing bioinformatics services to biology teams throughout the research process”