Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann,


Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.

Getting Involved in OLAC Steven Bird University of Pennsylvania LREC Symposium: The Open Language Archives Community 29 May 2002.
SIF Status to ADC Co-Chairs
Earth System Curator Spanning the Gap Between Models and Datasets.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Having Your Cake and Eating It Too With Apache OODT and Apache Solr Andrew F. Hart Paul M. Ramirez.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California Facilitating Distributed.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Building Bioinformatics Tools for Research Scientists Southern California Bioinformatics Summer Institute 2008 Andrew Clark Mentor: Dr. Ping Du, Allergan,
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
A Software Architecture for Highly Data-Intensive Systems Chris A. Mattmann USC Center for Software Engineering Annual Research Review.
Working Plan of US-China Bilateral cooperation on biomedical data sharing.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Company LOGO Broader Impacts Sherita Moses-Whitlow 07/09/09.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Sakai & Next steps in Course Management David Millman April 2006.
The Earth System CoG Collaboration Environment Sylvia Murphy and Cecelia DeLuca (NOAA/CIRES), and Luca Cinquini (NASA/JPL) AGU Ocean Sciences February.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
Dept. of Architecture Ina Smith UPSpace Manager.
Web Advisory Group (WAG) Implementation Plan ITC 10/19/04 Markus Stobbs.
BIRN Update Carl Kesselman Professor of Industrial and Systems Engineering Information Sciences Institute Fellow Viterbi School of Engineering University.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Access to Personalised Medicine for PDAC patients STSM of the application of an EU-index for barriers Denis Horgan (EAPM) & Angela Brand (IPHG) on behalf.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
EDRN Biomarker Review Process Heather Kincaid, Andrew Hart, Kristen Anton, Mark Thornquist, Dan Crichton and Christos Patriotis.
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
1 DLESE-IMS Metadata, ADN Metadata and the DLESE Catalog System.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
SAS ‘05 Reducing Software Security Risk through an Integrated Approach David P. Gilliam, John D. Powell Jet Propulsion Laboratory, California Institute.
Introduction to Software Engineering. Why SE? Software crisis manifested itself in several ways [1]: ◦ Project running over-time. ◦ Project running over-budget.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California EDGE: The Multi-Metadata.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Ed Armstrong – PI Luca Cinquini Chris Mattmann NASA Jet Propulsion Laboratory Frank O’Brien Zach Siegrist System Science Applications, Inc. 18 July 2012.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
DWR Enterprise GIS Danny Luong Division of Technology Services Enterprise GIS Services November 17, 2009 California Department of Water Resources.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Informatics and the caTissue Wrapper for the Early Detection Research Network Chris A. Mattmann, Ph.D. Senior Computer Scientist Instrument Software/ Science.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Joseph JaJa, Mike Smorul, and Sangchul Song
Grid Portal Services IeSE (the Integrated e-Science Environment)
VI-SEEM Data Repository
VI-SEEM Data Repository
Introduction to Implementing an Institutional Repository
The Digital Library for Earth System Science
Using CuCMS: a workshop
Presentation transcript:

Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann, Andrew Hart Andrew Clark Southern California Bioinformatics Summer Institute, 2009

Aug. 20, JPL, SoCalBSI '092 Agenda  Introduction  Biomarkers and cancer research  Early Detection Research Network (EDRN)  The NCI & JPL  EDRN Infrastructure  Project objective  eCAS Curator additions  The eCAS  Catalog and Archive Service  Data curation  Architectural & design considerations  Software engineering  Meta-data processing  Results & conclusions  Acknowledgements

Aug. 20, JPL, SoCalBSI '093 Introduction  Biomarkers and cancer research  Constant research is underway to discover and identify reliable biomarkers of cancer in the human body.  What is a biomarker?  “ A biological molecule found in blood, other body fluids, or tissues that is a sign of a normal or abnormal process, or of a condition or disease. ”  source:

Aug. 20, JPL, SoCalBSI '094 Biomarker research  The more information that is collected and shared between research sites and medical laboratories:  The more effective diagnosis will become.  The more specialized treatments can be devised to minimize the devastating effects of cancer on its host.

Aug. 20, JPL, SoCalBSI '095 The Early Detection Research Network  The NCI is concerned with managing biomarker research data and disseminating information to the public.  Formed the EDRN in 1999  “to provide up-to-date information on biomarker research” to the scientific and medical communities and to the general public. source:

Aug. 20, JPL, SoCalBSI '096 The Jet Propulsion Laboratory  FFRDC, operated by Cal-Tech, for NASA  JPL’s technology for cataloging and managing extremely large sets of data provided the underlying infrastructure needed by the EDRN to accomplish its own mission.

Aug. 20, JPL, SoCalBSI '097 The EDRN Infrastructure  My mentors, Dr. Chris Mattmann and Andrew Hart, and their team continue ongoing development of the underlying software grid.  JPL software engineers work with bioinformatics experts to develop the public interface to the EDRN, a web-based portal available to the general public:

Aug. 20, JPL, SoCalBSI '098 Project objective  Overall:  To participate as a bioinformatics software engineer at JPL.  To contribute to the EDRN software infrastructure.  Specifically:  Improve the functionality of the eCAS Curator.

Aug. 20, JPL, SoCalBSI '099 EDRN Catalog and Archive Service  JPL software customized for cataloging and archiving biomarker data, including specimen details, specimen images and related information. A B C EDRN Staging Server EDRN Public Portal WWW Research data 2. Curation -Meta-data edits -Pub. survey & cross reference -Expert review 1. Data Ingestion 3. Product Release Pre-release data Released data xml Dataset meta-data Curator

Aug. 20, JPL, SoCalBSI '0910 eCAS data curation  Data ingested from research sites undergoes a curation phase before its publication to the public portal. A B C EDRN Staging Server EDRN Public Portal WWW Research data 2. Curation -Meta-data edits -Pub. survey & cross reference -Expert review 1. Data Ingestion 3. Product Release Pre-release data Released data xml Dataset meta-data Curator

Aug. 20, JPL, SoCalBSI '0911 eCAS Curator  The curation activities would benefit from additional software tools as part of the overall eCAS workflow. A B C EDRN Staging Server EDRN Public Portal WWW Research data 2. Curation -Meta-data edits -Pub. survey & cross reference -Expert review 1. Data Ingestion 3. Product Release Pre-release data Released data xml Dataset meta-data Curator

Aug. 20, JPL, SoCalBSI '0912 Architectural & design considerations  Software engineering:  EDRN tools are primarily web applications  Design and integrate modular components  Meta-data management:  Meta-data: information that describes the content of other information.  Meta-data management is crucial to the data curation and the operation of the EDRN system.

Aug. 20, JPL, SoCalBSI '0913 Data curation with eCAS A B C EDRN Staging Server EDRN Public Portal WWW Research data 2. Curation -Meta-data edits -Pub. survey & cross reference -Expert review 1. Data Ingestion 3. Product Release Pre-release data Released data xml Dataset meta-data Curator Internal EDRN policy files contain meta-data definitions and configuration details that describe the dataset expected from each research site. 1

Aug. 20, JPL, SoCalBSI '0914 Data curation with eCAS A B C EDRN Staging Server EDRN Public Portal WWW Research data 2. Curation -Meta-data edits -Pub. survey & cross reference -Expert review 1. Data Ingestion 3. Product Release Pre-release data Released data xml Dataset meta-data Curator Curators edit and revise dataset meta-data to make the final product records complete and accurate. 2

Aug. 20, JPL, SoCalBSI '0915 Data curation with eCAS A B C EDRN Staging Server EDRN Public Portal WWW Research data 2. Curation -Meta-data edits -Pub. survey & cross reference -Expert review 1. Data Ingestion 3. Product Release Pre-release data Released data xml Dataset meta-data Curator Accepted data made available through web portal. Meta-data definitions provide searchable fields and descriptions of dataset contents to portal users. 3

Aug. 20, JPL, SoCalBSI ' A dataset policy file

Aug. 20, JPL, SoCalBSI '0917 Dataset meta-data configuration

Aug. 20, JPL, SoCalBSI '0918 Curator tool Browser based meta-data editor.

Aug. 20, JPL, SoCalBSI '0919 Curator tool Selecting datasets for metadata editing Metadata items retrieved from backend.

Aug. 20, JPL, SoCalBSI '0920 Results and conclusions  Final result  Meta-data management tool integrated with the eCAS and curation functionality incorporated into the workflow.

Aug. 20, JPL, SoCalBSI '0921 Conclusion  The goal of software engineering in bioinformatics should be to:  support scientists’ activities  facilitate better research and collaboration  simplify/bring clarity to complex tasks

Aug. 20, JPL, SoCalBSI '0922 Conclusion The combined effectiveness of software tools and expert curation make the EDRN a more powerful scientific resource that helps drive progress in biomarker research.

Aug. 20, JPL, SoCalBSI '0923 Acknowledgements  Thanks to my mentors and supporters at JPL:  Chris Mattmann, Andrew Hart  Thanks to the SoCalBSI faculty and staff:  Dr. Momand, Drs. Johnston, Dr. Sharp, Dr. Warter-Perez, Ronnie Cheng  Thanks to the SoCalBSI funding sources:  The National Science Foundation  The National Institutes of Health  Economic and Workforce Development