Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.

Slides:



Advertisements
Similar presentations
Due Diligence of Technology Mission Critical: The Rocky Research scientist, engineers & mfg specialist are dedicated to assist potential investors in.
Advertisements

A centre of expertise in digital information management UKOLN is supported by: Open Science at Genome Scale Dr Liz Lyon, Director, UKOLN,
A centre of expertise in digital information management UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data.
Health Information Supplier Forum ‘Open data, a platform for change’ Garry Coleman, Health & Social Care Information Centre.
AIDS 2012 Mobile tech in PMTCT programme evaluation.
HP Quality Center Overview.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Moving libraries to Web scale Matt Goldner Product & Technology Advocate 14 June 2011.
Breast cancer is a complex and heterogeneous disease Tumor samples Protein expression Clinical features Mutational status Adapted from TCGA, Nature 2012.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Jianlin Cheng, PhD Informatics Institute, Computer Science Department University of Missouri, Columbia Fall, 2011.
1 The UK Opportunity: what is experimental medicine? UNLOCK YOUR GLOBAL BUSINESS POTENTIAL Pre- clinical develop- ment Phase I Phase II Phase III Product.
The NIH Roadmap for Medical Research
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Research-driven data standards CIMI 11 th April 2013.
A crowdsourcing effort that poses questions (Challenges) about biology, modeling and data analysis: – Transcriptional networks.
CceHUB A Knowledge Discovery Environment for Cancer Care Engineering Research Ann Christine Catlin HUBzero Workshop November 7, 2008.
Crowdsourcing pharmacogenomic data analysis: PGRN-Sage RA Responder Challenge PGRN Spring Meeting April 30, 2013 HARVARD MEDICAL SCHOOL.
The DSpace Course Module – An introduction to DSpace.
Sage Bionetworks Mission Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by.
OntarioMD’s EMR Maturity Model Advancing Optimization and Use Ontario College of Family Practice Annual Scientific Assembly Presented By: Darren Larsen,
EXPAND WP5 Kickoff OpenNCP - Solution Usage pilots Malta.
Bioinformatics and medicine: Are we meeting the challenge?
A centre of expertise in digital information management UKOLN is supported by: Monica Duke Project.
Panel Discussion Part I Methodology Ideas from adult MR brain segmentation are used in neonatal MR brain segmentation. However, additional challenges.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Component 6 - Health Management Information Systems Unit 1-2 What is Health Informatics?
© Lockheed Martin Corporation, All Rights Reserved INFORMATION SYSTEMS & GLOBAL SERVICES 1 Applying Predictive Metrics for Service Oriented Architecture.
Sage / DREAM Breast Cancer Prognosis Challenge The goal of the breast cancer prognosis challenge is to assess the accuracy of computational models designed.
Biomedical Big Data Training Collaborative biobigdata.ucsd.edu BBDTC UPDATES Biomedical Big Data Training Collaborative biobigdata.ucsd.edu.
Sage Bionetworks Mission Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by.
Sage Congress 2012 Session 1: Synapse Michael Kellen, PhD Director of Technology, Sage Bionetworks SYNAPSE SHARED COLLABORATION SPACE GITHUB.
THE MURDOCK Study: A Rich Data Resource for Biomarker Discovery and Validation Brian D. Bennett 1, Jessica D. Tenenbaum 1, Victoria Christian 1, Melissa.
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Jisc Monitor: Updates on Developments Dr Frank Manista Senior Open Access Support Coordinator Monitor Local and Monitor UK Photo by Kate Ter Haar CC-BY.
What is the problem? Our current models of disease biology are primitive and limit doctor’s understanding and ability to treat patients Current incentives.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.
The Power To Do More, Together Joshua Marks CTO Curriki.org Ludovic Dubost CEO XWiki SANKORE CONFERENCE AT EDUCATEC-EDUCATICE November 24 th, 2011.
TSI Project Funded by Implemented by Kick-off training seminar Brussels, 4-6 Nov 2014 Digital platform Isabelle Gachie Vinson Luca Salvadori.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Open Ag Data : Landscape Analysis ●Who is involved in collecting data on agricultural investments, and from whom? ●How is data publicly shared? Which.
Innovation Project Title
Pichai Raman on behalf of cBioPortal Team Wednesday, May 25, 16
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
Translational Research Methodology
EUDAT: collaborative pan-European infrastructure providing research data services, training and consultancy This work is licensed.
Solutions to Clinical Data Visualization and Analysis
CyVerse Tools and Services
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
Tools and Services Workshop
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Evaluating state of the art in AI
Optimizing Biological Data Integration
National e-Infrastructure Vision
Data challenges in the pharmaceutical industry
ATOM Accelerating Therapeutics for Opportunities in Medicine
NCI’s Genomics Data Commons (GDC) & NCI Cloud Pilots
Dept of Biomedical Informatics University of Pittsburgh
Innovation Project Title
AACR Project GENIE at a glance.
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Knowledge-Guided Sample Clustering
Presentation transcript:

Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION COMMONS INCUBATOR Data Repository Discovery Platform Building Disease Maps Commons Pilots

What is the problem? Our current models of disease biology are primitive and limit doctor’s understanding and ability to treat patients Current incentives reward those who silo information and work in closed systems

Biological System Data Analysis Iterative Networked Approaches To Generating Analyzing and Supporting New Models Uncouple the automatic linkage between the data generators, analyzers, and validators

Two approaches to building common scientific and technical knowledge Text summary of the completed project Assembled after the fact Every code change versioned Every issue tracked Every project the starting point for new work All evolving and accessible in real time Social Coding

Synapse is GitHub for Biomedical Data Data and code versioned Analysis history captured in real time Work anywhere, and share the results with anyone Social Science Every code change versioned Every issue tracked Every project the starting point for new work All evolving and accessible in real time Social Coding

Data Analysis with Synapse Run Any Tool On Any Platform Record in Synapse Share with Anyone

Demo

Analysis Records and Visualizations

Scalable Analysis Pipelines Full case study at

Additional Data Analysis Tools and Capabilities

Digital Publication Builder

What is the problem? Our current models of disease biology are primitive and limit doctor’s understanding and ability to treat patients Current incentives reward those who silo information and work in closed systems

The Solution: Competitions to crowd-source research in biology and other fields  Why competitions? Objective assessments Acceleration of progress Transparency Reproducibility Extensible, reusable models Intensity and focus Parallel efforts  Competitions in biomedical research CASP (protein structure) Fold it / EteRNA (protein / RNA structure) CAGI (genome annotation) Assemblethon / alignathon (genome assembly / alignment) SBV Improver (industrial methodology benchmarking) DREAM (co-organizer of Sage/DREAM competition)  Generic competition platforms Kaggle, Innocentive, MLComp

Sage/DREAM Challenge: Details and Timing Phase 1 : July thru end-Sep 2012  Training data: 2,000 breast cancer samples from METABRIC cohort Gene expression Copy number Clinical covariates 10 year survival  Synapse hosting Sage curated and normalized data Data available via download or API (R) Models implemented in R and conforming to interface 2000 core cloud computing integration with Google Compute Engine (donation) Real-time scoring of model predictions and posting to leaderboard  Will evaluate accuracy of models to predict survival in: Held out samples from METABRIC Other datasets Phase 1 : July thru end-Sep 2012  Training data: 2,000 breast cancer samples from METABRIC cohort Gene expression Copy number Clinical covariates 10 year survival  Synapse hosting Sage curated and normalized data Data available via download or API (R) Models implemented in R and conforming to interface 2000 core cloud computing integration with Google Compute Engine (donation) Real-time scoring of model predictions and posting to leaderboard  Will evaluate accuracy of models to predict survival in: Held out samples from METABRIC Other datasets Phase 2: Oct 1 thru Nov 12, 2012  Evaluation of models in novel dataset.  Validation data: ~500 fresh frozen tumors from Norway group with: Clinical covariates 10 year survival  Gene expression and copy number data to be generated for model evaluation Sent to Cancer Research UK to generate data at same facility as METABRIC Models built on training data evaluated on newly generated data  Winners announced at November 12 DREAM conference  Winners to be published in Science Translational Medicine  Synapse as alternative to traditional peer review Phase 2: Oct 1 thru Nov 12, 2012  Evaluation of models in novel dataset.  Validation data: ~500 fresh frozen tumors from Norway group with: Clinical covariates 10 year survival  Gene expression and copy number data to be generated for model evaluation Sent to Cancer Research UK to generate data at same facility as METABRIC Models built on training data evaluated on newly generated data  Winners announced at November 12 DREAM conference  Winners to be published in Science Translational Medicine  Synapse as alternative to traditional peer review

Sage-DREAM Breast Cancer Prognosis Challenge one month of building better disease models together 154 participants; 27 countries 268 participants; 32 countries 290 models posted to Leaderboard breast cancer data Challenge Launch: July 17 August 17 Status

Funding Acknowledgements Synapse Team Chris Barr Matt Furia John Hill Jay Hodgson Bruce Hoff Michael Kellen Bennett Ng Geoff Shannon Xavier Schildwachter Eric Wu

Slide Backups