Towards a Common Annotation Framework for Knowledge Acquisition College Station, Texas, 2014.

Slides:



Advertisements
Similar presentations
1 Summary Slides from Enhancing Organism Based Disease Knowledge Using Biological Taxonomy, and Environmental Ontologies Ken Baclawski Northeastern University.
Advertisements

1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
DG INFSO- Grid Research & Infrastructures: W. Boch, M. Campolargo 1 Delivery of Industrial-strength Grid Middleware: establishing an effective European.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
INFRASTRUCTURE CALLS IN Support to existing research infrastructures Integrating Activities Topics opened in Call FP7-INFRASTRUCTURES
Course: e-Governance Project Lifecycle Day 1
CBio Meeting, March 2-3, 2006 CHISEL Group Dept of Computer Science University of Victoria, Canada Visualization of ontologies and data annotations.
Service Lifecycle Cradle 2 Grave Romy Bolton – University of Iowa Bernard Gulachek – University of Minnesota.
OMG Architecture Ecosystem SIG Federal CIO Council Data Architecture Subcommittee May 2011 Cory Casanave.
Systems Engineering in a System of Systems Context
Systems Biology of Inflammation A systems biology approach is an iterative process that includes: identification of component parts and interactions. integration.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Data Conservancy: A Life Sciences Perspective Sayeed Choudhury Johns Hopkins University
Office of Science Office of Biological and Environmental Research Susan K. Gregurick, Ph.D. Program Manager Computational Biology & Bioinformatics Biological.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Advancing Government through Collaboration, Education and Action Financial Innovation and Transformation Shared Services Workshop March 17, 2015.
Smart Integrated Infrastructure The Progression of Smart Grid Presentation to National League of Cities Martin G. Travers – President, Telecommunications.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
ISBE An infrastructure for European (systems) biology Martijn J. Moné Seqahead meeting “ICT needs and challenges for Big Data in the Life Sciences” Pula,
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
- 1 - Roadmap to Re-aligning the Customer Master with Oracle's TCA Northern California OAUG March 7, 2005.
Ensemble Computing in the National Science Digital Library (NSDL)
A complementary view from the DIGOIDUNA study Paolo Bouquet, University of Trento, Italy SMART 2010/0054.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
Cover of Politico, July 11, 2009 Catalyzing Community Change.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web services Interoperability for Geospatial decision.
Headquarters U. S. Air Force I n t e g r i t y - S e r v i c e - E x c e l l e n c e © 2008 The MITRE Corporation. All rights reserved From Throw Away.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
Stimulating Peripheral Activity to Relieve Conditions (SPARC) RFA-RM Funding Opportunity Announcement Information to Applicants A New Common Fund.
Public Health Tiger Team we will start the meeting 3 min after the hour DRAFT Project Charter April 15, 2014.
Capture the Movement: Banner 7.0 and Beyond Susan LaCour, Senior Vice President, Solutions Development California Community Colleges Banner Group.
Abstract We present two Model Driven Engineering (MDE) tools, namely the Eclipse Modeling Framework (EMF) and Umple. We identify the structure and characteristic.
Integrated Projects & Networks of Excellence Examples in IST.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
ASCAC-BERAC Joint Panel on Accelerating Progress Toward GTL Goals Some concerns that were expressed by ASCAC members.
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
Catawba County Board of Commissioners Retreat June 11, 2007 It is a great time to be an innovator 2007 Technology Strategic Plan *
© 2006 STEP Consortium ICT Infrastructure Strand.
NGCWE Expert Group EU-ESA Experts Group's vision Prof. Juan Quemada NGCWE Expert Group IST Call 5 Preparatory Workshop on CWEs 13th.
Enabling the Future Service-Oriented Internet (EFSOI 2008) Supporting end-to-end resource virtualization for Web 2.0 applications using Service Oriented.
PoDAG XXI: SEEDS SEED: NSIDC Potential Interactions NSIDC DAAC should prepare an evaluation of their desired future roles in "core activities" and in mission.
Mercury Program Margin Management Tool (MMT) 10-January-2014.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Connect. Communicate. Collaborate Operations of Multi Domain Network Services Marian Garcia Vidondo, DANTE COO TNC 2008, Bruges May.
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
11 ITLC – Middleware Report May 27, 2010 The work of a subgroup of ITAG.
Foundational Program Overview September  2004 Copyright RosettaNet. RosettaNet Foundational Programs Program Overview ProgramPhase InvestigateDesignImplement.
Module 5: Future 1 Canadian Bioinformatics Workshops
NCP Info DAY, Brussels, 23 June 2010 NCP Information Day: ICT WP Call 7 - Objective 1.3 Internet-connected Objects Alain Jaume, Deputy Head of Unit.
A Reference Model for RDA & Global Data Science Yin ChenWouter Los Cardiff University University of Amsterdam 1.
Technology-enhanced Learning: EU research and its role in current and future ICT based learning environments Pat Manson Head of Unit Technology Enhanced.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Towards an Enterprise Architecture for Wits In the context of the new Student Information System programme Prof Derek W. Keats Deputy Vice Chancellor (Knowledge.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
BG 5+6 How do we get to the Ideal World? Tuesday afternoon What gaps, challenges, obstacles prevent us from attaining the vision now? What new research.
Realising MRC’s Vision in Health and Bioinformatics MRC Open Council Meeting July 2014 Janet Valentine Head of Population Health and Informatics.
1 Acquisition Automation – Challenges and Pitfalls Breakout Session # E11 Name: Jim Hargrove and Allen Edgar Date: Tuesday, July 31, 2012 Time: 2:30 pm-3:45.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
Toward High Breakthrough Collaboration (HBC) Susan Turnbull Program Manager Advanced Scientific Computing Research March 4, 2009.
NETWORKS OF EXCELLENCE KEY ISSUES David Fuegi
Use of Learning Analytics in Massively Open Online Courses.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
Towards a unified MOD resource: An Overview
Integrating MBSE into a Multi-Disciplinary Engineering Environment A Software Engineering Perspective Mark Hoffman 20 June 2011 Copyright © 2011 by Lockheed.
Ontology-Based Information Integration Using INDUS System
Presentation transcript:

Towards a Common Annotation Framework for Knowledge Acquisition College Station, Texas, 2014

Goals 1. Capture the biology 2. Do this efficiently 3. Maximise impact 4. Do this in a future-proofy way

1. Capture the biology

2. Maximise efficiency Software engineering ◦ We are resource-limited for developers ◦ Reuse components, share APIs, eliminate overlap Knowledge Acquisition ◦ Resource-limited for curators/editors ◦ Automate where appropriate  Data-driven (see SAB report) ◦ Coordinate teams  Eliminate redundancy SAB report:: - Data driven curation - Making use of hi-throughput data - GBA, proteomics, clustering (Nexo) SAB report:: - Data driven curation - Making use of hi-throughput data - GBA, proteomics, clustering (Nexo)

3. Maximise impact Not just about number of annotations Can we incorporate impact into annotation process? SAB report:: - annotations - enabling users to make discoveries - ease of access to extended annotations SAB report:: - annotations - enabling users to make discoveries - ease of access to extended annotations

4. Future proofing Don’t over-fit requirements to what we do today Conservative predictions ◦ Integration of curation into publications and even experiment portion of data lifecycle ◦ Less resources for retrospective curation ◦ Increased pressure to interoperate across informatics systems ◦ More high-throughput data ◦ Individual gene  network view

How close are we?

Annotation Tool Landscape Previously ◦ Multiple tools with highly redundant functionality Now ◦ Converging towards smaller number of tools each with their own specific niche  Specifically: migration from MOD-centric  protein2go  (see Kimberley’s presentation)  Remaining challenges:  Still redundancy  Indirect interoperation  Stovepipes

Toolscape* *with apologies to gonuts

Toolscape

How do these tools interoperate? File-level export-transport-import Peer to peer Common service layer

Current data architecture is suboptimal

The Vision

Orion March 2014

Progress with respect to grant GO Proposal ◦ Timeline yr2 “prototype 2 nd generation annotation tool”

Idealized plan Split CCC into a UI widget and textpresso services Integrate protein2go and Orion into common framework Merge in other curation efforts ◦ Phenotype ◦ Expression Work with bioinformatics community on data-driven acquisition services

Will we be successful? Strengths ◦ Many pieces are in place ◦ Leverage work done in annotations and ontology Weaknesses ◦ Lack of resources (see next slide) ◦ Disjointed distributed teams, different goals Opportunities ◦ Technology Synergy (EBI-RDF, Monarch) ◦ Data-driven methods, exploit community Threats ◦ Other aspects of GO are neglected ◦ Aiming too high ◦ (conversely) overfitting to today’s requirements ◦ As yet unknown leap-frogger

Addressing the weaknesses Resource-limitation ◦ The time is right to get the funding  US: BD2K (May-July deadlines)  Europe: ? Integrating teams ◦ Rallying around common goal

The fallback position