MAGNet Center: Andrea Califano NCIBI: Brian Athey Simbios: Russ Altman Creating a DBP Community to Enhance the NCBC Biomedical Impact NCBC Work Group Report,

Slides:



Advertisements
Similar presentations
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Advertisements

NA-MIC National Alliance for Medical Image Computing National Alliance for Medical Image Computing: NAMIC Ron Kikinis, M.D.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
NCBC “Big P” Activities Russ Altman on behalf of NCBC PIs and NIH/NCBC Staff.
Priority Research Direction: Portable de facto standard software frameworks Key challenges Establish forums for multi-institutional discussions. Define.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
National Cancer Institute U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health NCI Perspective on Informatics and Clinical Decision.
1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science.
GeWorkbench caGrid TeraGrid Integration Scott Oster Ohio State University – Dept. of Biomedical Informatics Christine Hung Columbia University – JCSB/C2B2.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
BIOCMS: Resource Integration and Web Application Framework for Bioinformatics DHUNDY R BASTOLA †, *, ANIL KHADKA †, MOHAMMAD SHAFIULLAH † AND HESHAM ALI.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Ch10. Intermolecular Interactions and Biological Pathways
Cytoscape A powerful bioinformatic tool Mathieu Michaud
NA-MIC National Alliance for Medical Image Computing The NA-MIC Kit National Alliance for Medical Image Computing.
NA-MIC National Alliance for Medical Image Computing National Centers for Biomedical Computing Software and Data Integration Working.
Biomedical Computation at NIH Michael Marron National Center for Research Resources National Institutes of Health.
NA-MIC National Alliance for Medical Image Computing NA-MIC Software Engineering Bill Lorensen GE Research NA-MIC Engineering Core PI.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Networks and Interactions Boo Virk v1.0.
NA-MIC National Alliance for Medical Image Computing Core 1b – Engineering Highlights, Aims and Architecture Will Schroeder Kitware.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
Markus Dolensky, ESO Technical Lead The AVO Project Overview & Context ASTRO-WISE ((G)A)VO Meeting, Groningen, 06-May-2004 A number of slides are based.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
GenePattern Overview for MAGE-TAB Workshop Ted Liefeld January 24, 2007.
Helping scientists collaborate BioCAD. ©2003 All Rights Reserved.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
1 Web: Steve Brewer: Web: EGI Science Gateways Initiative.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
National Centers for Biomedical Computing Software and Data Integration Working Group Peter Lyster (Chair) NCBC Workshop Wednesday December 16 (2005)
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Towards a Common Annotation Framework for Knowledge Acquisition College Station, Texas, 2014.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Facilitate Scientific Data Sharing by Sharing Informatics Tools and Standards Belinda Seto and James Luo National Institute of Biomedical Imaging and Bioengineering.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Protein Information Resource Protein Information Resource, 3300 Whitehaven St., Georgetown University, Washington, DC Contact
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
Core 2: Bioinformatics NCBO-Berkeley. Core 2 Specific Aims 1.Apply ontologies  Software toolkit for describing and classifying data 2.Capture, manage,
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
AWT Ivo D. Dinov, Ph.D., CCB Chief Operations Officer PI: Arthur W. Toga, Ph.D. Co-PI: Tony F. Chan, Ph.D.
Slicer 3 Ron Kikinis, Steve Pieper. CTK Workshop Heidelberg, June 29/30, 2009 Slicer Goals  Stable, Usable, Cross Platform, End-User Software for Medical.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
NA-MIC National Alliance for Medical Image Computing Core 1b – Engineering Computational Platform Jim Miller GE Research.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Accessing and visualizing genomics data
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
NA-MIC National Alliance for Medical Image Computing UCSD / BIRN Coordinating Center NAMIC Group Site PI: Mark H. Ellisman Site Project.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Pipeline Execution Environment
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Automated Analysis and Code Generation for Domain-Specific Models
Building a “System” Moving from writing a program to building a system. What’s the difference?! Complexity, size, complexity, size complexity Breadth.
Presentation transcript:

MAGNet Center: Andrea Califano NCIBI: Brian Athey Simbios: Russ Altman Creating a DBP Community to Enhance the NCBC Biomedical Impact NCBC Work Group Report, 18 July 2006

Workgroup Goals Problem: The NCBCs have no workgroup to help build the DBP community Goal 1: To determine the Mission and Goals for the Applications of Systems Biology, Modeling and Analysis Working Group Goal 2: To determine how this group would interact with the 2 other NCBC Working Groups to define key sets of:  Data  Tools  Methodologies  Ontologies  DBPs Goal 3: Identify NCBC DBPs that are highly motivated to participate in the Working Group Goal 4: How to link to external communities (e.g., DREAM-like activities)? Additionally: Discuss as a focused example “Molecular Interaction Maps” in the context of the DBPs

DBP Success Depends on the Availability of an Integrated Resourceome (Not a priority for Core I/Core II Projects) Integrated Computational Biology Platform Support for gene expression data, physics- based simulations, image analysis, sequences, pathways, structure, etc. (40+ visualization and analysis modules). Access to local and remote data sources and analytical services. Support for workflow scripting. Integration with grid infrastructures. Development framework Open source development. Modular/extensible architecture, supporting pluggable components with configurable user interface. Easy integration of 3 rd party components. … The integration process must be driven by the DPB requirements rather than by Core I/II activities.

We Must Work With the “Yellow Pages” WG to Assemble an Indexable List of Most Useful Tools and Platforms Many Toolkit and platforms  Internal SimTk Genopia VTK/ITK Brainsuite geWorkbench GenePattern MiMI SAGA miBLAST MarkerInfoFinder  External GenePattern Systems Biology Workbench myGRID Cytoscape and ISB tools How do we make these tools interoperable? This must be DBP-driven because other cores (I/II) do not necessarily depend on tool interoperability.

GenePattern/geWorkbench Interoperability: An Opportunity and a Starting Point KNN WV SVM SOM GSEA ARACNE SPLASH PCA GenePattern Module Repository Wrap geWorkbench modules as GenePattern tasks Execute GenePattern modules from within geWorkbench

Interactions with the Scientific Ontology Working public DSDataSet publish(...) { DSDataSet dataSet; // do some work that assigns a value to dataSet. return dataSet; public void receive(DSDataSet dataSet, Object source) { // Consume the argument dataSet, as appropriate } Provide re-usable models of common bioinformatics concepts:  Data: sequence, expression, genotype, structure, proteomics  Complex data structures: patterns, clusters, HMMs, PSSMs, alignments  Algorithms: Clustering, matching, discovery, normalization, filtering Provide a foundation for the development of interoperable geWorkbench components Endorsed by multiple communities (caBIG, AMDeC, NCBCs) Component A Component B

Identifying Specific Tools There are tools, databases, and methods that have universal value across different DBPs.  What are they?  Which NCBC or external community is producing them  What can we do to standardize their use across the community. An Example:  Molecular Interaction Maps

A Relevant Example That Was Discussed Molecular Interaction Maps are becoming the equivalent of an anatomy atlas to map specific measurements in a functional context; e.g. QTLs, expression profiles, etc. Discussion Goal: To determine how relevant these maps are to the DBPs of the various NCBCs Limitations: Many Interactomes are limited because they are (1) too generic (e.g. missing cellular and molecular context), (2) poorly annotated (e.g. linked only to the specific data used to produce them), (3) limited to pairwise interactions, (4) lacking quality control/validation, and (5) not associated to the investigation of specific biological/biomedical problems.

Example: From Molecular Interaction Maps to Molecular Interaction Knowledge Bases What does it take to turn a ridiculome into a relevantome?  Quality control metrics (recall/precision)  Context specificity Cellular: Is the interaction specific to a cellular phenotype Molecular: Is the interaction dependent on the availability of other molecular species  Links to data (and literature)  Links to analysis of biomedical problems  Focus on specific features (e.g. mechanisms)

A Potential Template for NCBC Knowledge Bases: MAGNet Human B Lymphocytes Dataset Integrative Framework  Bayesian Evidence integration of pairwise interactions Protein-Protein, Protein-DNA Prior Knowledge Incorporation Context Specific  ARACNE, GeneWays, REDUCE B-Cell data or B-cell specific criteria  Linked to one of the largest B-Cell expression profiles microarray dataset, ChIP- Chip assays (MYC/BCL6), miRNA profiles, and Literature Captures Multi-variate dependencies  Three-way interactions via MINDY and MATRIXReduce Post-translational modulation of transcriptional regulation  Combinatorial transcriptional regulation  Signal transduction control of Transcriptional Regulation I.e. the Transferome meets the Transcriptome Links to literature (via GeneWays, NCIBI, I2B2, GATE, etc.) Other examples? Oncomine (NCIBI), GenePattern ALL/AML, Others? Example

Some Key Observations from Attendees: Systems Biology name is too narrow. Think of Alternatives:  “Working group to Biomedical Impacts of Computational Biology at NCBCs” or  NCBC Biomedical Impact Workgroup Is the intramural program a better place to create atlases and knowledge bases, since it’s not RO1 funding? They could implement contract mechanisms with extramural researchers to leverage outside expertise Keep in mind that we need to understand what will you deliver at the end of 4 years, positioning each NCBC for renewal. Which communities are using the tools? Are they better off? Individual centers can work to create a specific resourceome that can be linked and accessible to others Many working group members had a strong interest in “multi- scale” modeling and biological context

Outcomes: Create a DBP community within the NCBCs:  ACTION: Make an interactome map of the existing DBPs with potential synergies to be published in Symbios magazine Use this forum to inform target biological communities (not just NCBCs). E.g. DREAM meeting. Organize a coordinated effort to evaluate the tools and technologies and make them interoperable  ACTION: Coordinate the DBP requirements to drive the integration of specific tools and data resources Integrate data and annotation in knowledge bases and models for related DBPs. Identify other common tools, data, and methods Drop the Systems Biology name: Use something like:  “NCBC Biomedical Impact Workgroup” Commit to a regular T-con and virtual (Wiki) participation Consider a yearly retreat of NCBC DBPs possibliy in collaboration with other NIH roadmap activities (e.g. ICBPs)

NCBC DBP Interactome I: Useful Starting Point Peter Woolf, NCIBI

Distribution Model: How Can the 7 NCBCs Effectively Interoperate? Informatics for Integrating Biology and the Bedside (i2b2) Isaac Kohane, PI Center for Computational Biology (CCB) Arthur Toga, PI Multiscale Analysis of Genomic and Cellular Networks (MAGNet) Andrea Califano, PI National Alliance for Medical Imaging Computing (NA-MIC) Ron Kikinis, PI The National Center For Biomedical Ontology (NCBO) Mark Musen, PI Physics-Based Simulation of Biological Structures (SIMBIOS) Russ Altman, PI National Center for Integrative Biomedical Informatics (NCIBI) Brian D. Athey, PI