Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.

Slides:



Advertisements
Similar presentations
21 st Century Science and Education for Global Economic Competition William Y.B. Chang Director, NSF Beijing Office NATIONAL SCIENCE FOUNDATION.
Advertisements

The Data Conservancy: A Digital Research and Curation Virtual Organization D4Science World User Meeting November 25, 2009.
Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Xsede eXtreme Science and Engineering Discovery Environment Ron Perrott University of Oxford 1.
1 US activities and strategy :NSF Ron Perrott. 2 TeraGrid An instrument that delivers high-end IT resources/services –a computational facility – over.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Spatial Decisions INNOVATING GEOSPATIAL INTELLIGENCE PLENARY II 17 September GIS Asia. Hanoi Kapil Chaudhery Spatial Decisions.
Funding Opportunities at NSF Jane Silverthorne International Arabidopsis Consortium Workshop January 15, 2011.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Structural Genomics – an example of transdisciplinary research at Stanford Goal of structural and functional genomics is to determine and analyze all possible.
NSF and Environmental Cyberinfrastructure Margaret Leinen Environmental Cyberinfrastructure Workshop, NCAR 2002.
Office of Science Office of Biological and Environmental Research J Michael Kuperberg, Ph.D. Dan Stover, Ph.D. Terrestrial Ecosystem Science AmeriFlux.
NSF on the web- An indispensable resource
Roles and Goals Greg Riccardi. iDigBio People University of Florida o Larry Page, Jose Fortes, Pamela Soltis, Bruce McFadden, Renato Figueiredo, Reed.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Drivers for a PRAGMA Biodiversity Science Expedition Reed Beaman Florida Museum of Natural History University of Florida.
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
Harnessing the Power of Environmental Data for Decision-Making IABIN Phase II.
The BIO Directorate Microbial Biology Emphasis BIO Advisory Committee April, 2005.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
Bill Newhouse Program Lead National Initiative for Cybersecurity Education Cybersecurity R&D Coordination National Institute of Standards and Technology.
Unidata Policy Committee Meeting Bernard M. Grant, Assistant Program Coordinator for the Atmospheric and Geospace Sciences Division May 2012 NSF.
The FY 2009 Budget Thomas N. Cooley, NSF Council of Colleges of Arts and Sciences March 13, 2008.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
 Unsolicited proposals  1x year:  preproposals (4 pages);  invited full (~30 pages)  Some special competitions  E.g., sustainability, climate investment.
Microbial Biology at the National Science Foundation Dr. Lita M. Proctor Division of Biological Infrastructure Biosciences Directorate National Science.
Imagine a World…. With easy, unlimited access to scientific data from any field Where you can easily plot data of interest and display it any way you want.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
The Environmental Genomics Thematic Programme Data Centre Dawn Field, Director.
National Science Foundation Experimental Program to Stimulate Competitive Research (NSF EPSCoR) May 24, 2012 National Academies 1.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
NSF/BIO National Synthesis Centers Judith A. Verbeke, Ph.D. Division Director (Acting) Division of Biological Infrastructure.
Judith E. Skog Biological Sciences Directorate Emerging Frontiers Division H. Richard Lane Geological Sciences Directorate Earth Systems Science.
Organizational Structure Coordination and Leadership Group (CLG) AD Council BIOCISEEHRENGGEOMPSSBE OIIA Charge: Coordinating NSF’s cyberinfrastructure.
E-Science and Technology Infrastructure for Biodiversity and Ecosystem Research.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
Soil and Water Conservation Modeling: MODELING SUMMIT SUMMARY COMMENTS Dennis Ojima Natural Resource Ecology Laboratory COLORADO STATE UNIVERSITY 31 MARCH.
08/05/06 Slide # -1 CCI Workshop Snowmass, CO CCI Roadmap Discussion Jim Bottum and Patrick Dreher Building the Campus Cyberinfrastructure Roadmap Campus.
Overview of NSF and the Directorate for Biological Sciences (BIO) Overview of NSF and the Directorate for Biological Sciences (BIO) Tom Brady Division.
TCUP Leadership Forum January 3, 2014 Sylvia M. James, Ed.D. Division Director, Human Resource Development (HRD)
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Context: The Strategic Plan for Establishing the Network Integrated Biocollections Alliance Judith E. Skog, Office of the Assistant Director, Biological.
Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation NIEHS Webinar October 27, 2015 Image Credit: Exploratorium. Integrating.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop BISQUE.
1 e-Arts and Humanities Scoping an e-Science Agenda Sheila Anderson Arts and Humanities Data Service Arts and Humanities e-Science Support Centre King’s.
1 Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in.
Forging the eXtremeDigital (XD) Program Barry I. Schneider Program Director, Office of CyberInfrastructure January 20, 2011.
Cultural Heritage in Tomorrow ’s Knowledge Society Cultural Heritage in Tomorrow ’s Knowledge Society Claude Poliart Project Officer Cultural Heritage.
High throughput biology data management and data intensive computing drivers George Michaels.
Big Data in Indian Agriculture D. Rama Rao Director, NAARM.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Navigating NSF Programs Esin Gulari Dean, College of Engineering & Science Clemson University.
The National Science Foundation Independent Federal Agency Support for all fields of fundamental science and engineering.
South Big Data Innovation Hub
Greater Peterborough Region DNA Cluster
Joslynn Lee – Data Science Educator
GO-FAANG Workshop 7-8 October 2015
Natural History Collections (NHC) Biodiversity Data Informatics 101
Cyberinfrastructure for the Life Sciences
A Funders Perspective Maria Uhle Co-Chair, Belmont Forum Directorates for Geosciences, US National Science Foundation.
Grand Challenges in e-Science
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Presentation transcript:

Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure CASC

National Data Infrastructure Acquisition & Generation Storage & Curation Analysis, Modeling & Visualization Data Policy Education & Workforce Foundational Research in Cyber-technologies Collaboration, Partnerships & Grand Challenges

NITRD Big Data R&D Strategies Strategy I: Create next generation capabilities by leveraging emerging Big Data foundations, technologies, processes, and policies (Foundational Research) Strategy II: In addition to the generation of knowledge from data, also emphasize using trustworthy data and resulting knowledge to make decisions and take confident action (Grand Challenges) Strategy III: Ensure the long term sustainability, access, and development of high value data sets and data resources (NDI) Strategy IV: Improve the national landscape for Big Data education and training to fulfill increasing demand for both deep analytical talent and analytical capacity for the broader workforce (Ed& Workforce) Strategy V?: (Data Policy)

Biology as an Information Science Life exists because of the ability to encode, exchange, and interpret information. Bioinformatics programs in BIO support: Development of methods to represent and manipulate biological information, rules, and processes in digital form Development of tools and resources to support biolological research using computational methods.

Populations & Community Ecology Ecosystem Science Evolutionary Processes Molecular Biophysics Research Resources Genetic Mechanisms Systematic Biology & Biodiversity Neural Systems Cellular Dynamics and Function Synthetic and Systems Biology Plant Genome Research Program Developmental Systems Physiological and Structural Systems

BIO Grand Challenges Understanding the Brain Understanding Biological Diversity Interactions of the Earth, Climate, and Biosphere Phenomics: Genotype to Phenotype. Synthetic Biology

innovative sustaining general BIO-specific large small life cycle scale scope CI for Life Sciences Portfolio Balance

Implementing a National Data Infrastructure: Acquisition and Generation Instrumentation Observing & experimental infrastructure (NEON), New molecular technologies(Cryo EM) Digitization Imaging technologies & feature extraction (Bisque, ADBC) Data Mining Annotation, Knowledgebases (Phenoscape) Computational approaches Protein structure prediction (Bio XFEL). Crowd sourcing Citizen science networks (eBird)

Implementing a National Data Infrastructure: Curation & Storage Curation (Science communities)  Standards (metadata, formats, APIs, QAQC, etc)  Portals (DataOne, Arabidopsis Information Portal, Biodiversity portals)  Data repositories (PDB, TAIR, Gramene, REDfly Storage Infrastructure (Shared infrastructure)  Tools (data management technologies, cyber security, identity management, DOI’s, etc)  Storage capacity (xSede partners, campuses, clouds)

Implementing a National Data Infrastructure: Modeling and Analysis Modeling and Analytic environments Tools organized around bio research communities (bioKepler, Galaxy, Predictive Ecosystem Analyzer) Computational gateways Connecting users to shared infrastructure (iPlant, CIPRES, Neuro Science Gateway)

Advances in Biological Informatics Innovation Awards – smaller, shorter projects, emphasis on innovative, high risk research to develop new approaches. Development Awards – larger efforts focused on delivery of a database, software tool or informatics resource. Sustaining Awards – limited funds for operations and maintenance of critical infrastructure

Mapping ABI Tracks across NSF BIO – PDB, NEON, iDigBio, iPlant, GoLife, PGRP, Centers MPS – Math BIO. CDS&E ENG – Bioengineering, Synthetic Bio CISE – IIA, BigData, GEO - Earthcube, GeoInformatics, BCO DMO Crosscutting – SI2, DIBBS, BioMAPS, CDS&E International - BBSRC