Joint SCAR/COMNAP Delegates Meeting (SCAR Lecture) A Strategy for Data and Information Management in the 21st Century 9 th July 2010 Kim Finney (Manager,

Slides:



Advertisements
Similar presentations
21 st Century Science and Education for Global Economic Competition William Y.B. Chang Director, NSF Beijing Office NATIONAL SCIENCE FOUNDATION.
Advertisements

SCAR Data Management SSG Plenary 30 th July 2010 Kim Finney (Manager, Australian Antarctic Data Centre & Chief Officer, SCAR Standing Committee on Antarctic.
SCAR Strategy for Education and Training Colin Summerhayes (SCAR) Azizan Abu Sameh (Malaysia) Brian Storey (NZ)
MAHLON “CHUCK” KENNICUTT II ANTARCTIC RESEARCH (SCAR)
The SCAR Standing Committee on Antarctic Data Management (SCADM) Promoting and curating the vital legacy of Antarctica Contact information:
Report to SCADM September 2011 SCAR Standing Committee on Antarctic Geographic Information (SCAGI) Adrian Fox (BAS)
SCAR DIMS Implementation 31st July 2010 Kim Finney (Manager, Australian Antarctic Data Centre & Chief Officer, SCAR Standing Committee on Antarctic Data.
The SCAR Standing Committee on Antarctic Data Management (SC-ADM) New directions in Antarctic data management Taco de Bruin – NIOZ Royal Netherlands Institute.
SCAR Standing Committee on Antarctic Data Management Establishing a National Antarctic Data Centre (NADC) Helen Campbell.
Scientific Committee on Antarctic Research AMD User/Provider Survey Amsterdam 7 th September 2009.
Scientific Committee on Antarctic Research Data Management Plans Amsterdam 8 th September 2009.
Scientific Committee on Antarctic Research Meeting Goals & Structure Amsterdam 7 th September 2009.
AAA and SCADM Helen Campbell (presented by Phil Anderson) SCAR Standing Committee on Antarctic Data Management (SCADM) SCAR Standing Committee on Antarctic.
Scientific Committee on Antarctic Research Polar Information Commons Roles for SCADM/SCAGI.
SCADM Report Working Paper 10. Overview SCAR Data and Information Management Strategy (DIMS) – endorsed Oct Introduction to the draft SCAR Data.
NEPTUNE Canada Ocean Sciences Enters the Data- Intensive Scene Benoît Pirenne, University of Victoria, BC, Canada.
Developing the ICSU World Data System (WDS) Mustapha Mokrane ICSU Secretariat Science and Information Technology Officer.
Group on Earth bservations Discussion Paper on a Framework Dr. Ghassem Asrar August 1, 2003.
European EO data & model fusion ( to maximise the value of ESA EO Data ) Alan ONeill National Centre for Earth Observation.
DRIVER Step One towards a Pan-European Digital Repository Infrastructure Norbert Lossau Bielefeld University, Germany Scientific coordinator of the Project.
CLIMATE MONITORING FROM SPACE -- challenges, actions & perspectives Yang Jun China Meteorological Administration WMO Cg-XVI Side Event An architecture.
Professor Dave Delpy Chief Executive of Engineering and Physical Sciences Research Council Research Councils UK Impact Champion Competition vs. Collaboration:
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
E-(Social) Science: Where next? 4th International e-Social Science Conference Manchester Malcolm Atkinson Director e-Science Institute UK e-Science Envoy.
The Technology Premium: Finding Competitive Advantage June 2008 Lesley Price Head of Regeneration and Skills.
HE in FE: The Higher Education Academy and its Subject Centres Ian Lindsay Academic Advisor HE in FE.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Cloud Computing for Education & Cloud Learning Minjuan Wang to BT Research Center (Abu Dhabi) Educational Technology San Diego State University
Collaborative Open Access Projects: Collaborative promotion of research outputs Iryna Kuchma, eIFL Open Access program manager, eIFL.net Presented at Open.
STFC and the UK e-Infrastructure Initiative The Hartree Centre Prof. John Bancroft Project Director, the Hartree Centre Member, e-Infrastructure Leadership.
02-Oct-2008 European Forum for GeoStatistics 2008 in Bled Concept for an Integrated Web Solution / an Infrastructure for Geostatistics (Subproject 3)
The White Rose Collaborative Collection Partnership Brian Clifford University of Leeds.
European Clearing-House Mechanism Portal Toolkit Expert Group Meeting
13th Fiesole Collection Development Retreat, St Petersburg, May 2011 (Primary) Data: The New Special Collections for Research Libraries? Wouter Schallier.
Ira van den Broek Taco de Bruin - NIOZ Royal Netherlands Institute for Sea Research - Netherlands National Polar Data Centre Joint SCADM/SCAGI meeting,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Eye on Earth (EoE), Citizen Science and the Invasive Alien Species project Malene Bruun NRC’s for EIS June 17, 2011.
ODINAFRICA Ocean Data and Information Network for Africa.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
SCADM Chief Officer’s Report SCADM 2014 meeting, Auckaland, New Zealand – August, 2014.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
NATIONAL CENTRE FOR ANTARCTIC AND OCEAN RESEARCH Indian National Antarctic Data Center (INADC) 7 th September 2011 Indian Antarctic Data Centre Management.
INTERNATIONAL POLAR YEAR P.A. Berkman - Science 301:1669 (19 September 2003)
The ICDP Information Network Telework and Information Management in Scientific Drilling Projects Jens Klump and Ronald Conze GeoForschungsZentrum Potsdam.
International Polar “Year” - March 1, March 1, 2009.
Oceans Portal Workshop 30 th March 2004 Healthy oceans: cared for, understood and used wisely for the benefit of all, now and in the future healthy oceans:
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Future Space Exploration A Summary of “The Global Exploration Roadmap”, International Space Exploration Coordination Group, August 2013 Summarized by:
Imperial College London The Antarctic Roadmap Challenges (ARC) Project WORKSHOP Tromsø, Norway 23–25 August 2015.
Antarctic Data Management Lee Belbin Manager, Australian Antarctic Data Centre Chairman, Joint Committee on Antarctic Data Management.
Future Perspectives of Ocean Observatories in Germany [Name of the infrastructure / site / time series…] Contact person: [name, ] [Institution(s)
MEDIN Work Plan for By March 2011 MEDIN will be 3 years into the original 5 year development plan started in Would normally ask for continued.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP) Enabling science through seamless and open access to marine data.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
British Antarctic Survey Polar Science For Planet Earth (PSPE) Images can be downloaded here from the BAS image collection here:
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Discovering and accessing data from a distributed network of data centres S. Mazzeo (ESA)
Clouds , Grids and Clusters
INTAROS WP5 Data integration and management
Review of RA-I 16th Session Resolutions related to
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Ocean Biogeographic Information System (OBIS)
Presentation transcript:

Joint SCAR/COMNAP Delegates Meeting (SCAR Lecture) A Strategy for Data and Information Management in the 21st Century 9 th July 2010 Kim Finney (Manager, Australian Antarctic Data Centre & Chief Officer, SCAR Standing Committee on Antarctic Data Management)

In 1962 John F. Kennedy announced a man would be put on the moon by the end of the decade. Well, space is there, and we're going to climb it, and the moon and the planets are there, and new hopes for knowledge and peace are there (JFK, 1962) But...if we wanted a challenge...we have one in our own backyard. 50 years on and we may now know more about the surface of the moon than we do about our Antarctic and Southern Ocean environments.

Have better maps of the moon than Antarctica. –LIMA (15m spatial accuracy) of Antarctica –Lunar Reconnaissance Orbiter (1.0m spatial accuracy).

Only a small fraction of the Southern Ocean seafloor topography has been surveyed by ships. –Satellite altimetry is helping to fill in the broad-scale features >10- 15km in width (Sandwell and Smith)

Can land a rover on Mars but: Transport in Antarctica is still difficult subject to severe restrictions and limitations resulting from weather and terrain extremes. Struggle with developing underwater technology for sampling biodiversity. –Most promising AUVs still have power, sea-state, instrument, speed, navigation limitations. Sensors mainly for physical parameter detection. Rover AUV

Why are we not further advanced in our understanding of, and access to, the Antarctic environment ? –Lack a grand collective vision ? –How well are we collaborating (scientifically and logistically) ? –Still like to treat Antarctica as a heroic frontier for testing the resilience of man ?

What has this to do with data management ? –Its no longer about heroic polar men, blazing trails into the unknown collecting small amounts of data. –Its about launching autonomous mobile and fixed sensors to all points of interest, sharing the vast volumes of data generated, piecing these data together at local, regional, continental and global scales. –Its the era of networked data and visionary collaboration. Heroes in this age will be those that have the skills, vision and technological innovation to build and exploit these data networks.

21 st Century Data Management In the next 10 years most scientists working on Antarctic data will never travel to the ice. –Advantageous perhaps for those countries without physical ice- based research facilities. Contributions instead to data network building ? Marine Sensor Network (courtesy of IMOS)

Data managers wont operate as an adjunct to science. The new polar scientist by default will be data management literate and proficient with data networks.

Data production in many disciplines is doubling annually (UK e-infrastructure Steering Group, 2006). –Data stores need to be optimised for the disciplines they support and the access paradigms expected by those communities. Copied from Kirk Borne, 2008 Computing power doubles every 18 months (100X in 10 yrs) I/O bandwidth 10% p/a (3X in 10 yrs) Data doubling every year (1000X in 10 yrs) NSCA example: 1 st 19 yrs generated =1 PB Year 20 (2007) = 2 PB Year 21 (2008) = 4 PB Year 2025 ? PB ??? Borne (2008)

Scientific communities will become dependent on very large, openly accessible databases. –necessitating stable financial support for repositories.

Datasets becoming very complex – multi or hyper- dimensional. –Will require dimensionality reduction via machine discovery of patterns, substructures and correlations in the data (Djorgovski, 2009). –Requires even more emphasis on: skills in data visualisation, algorithm development, data access, data description, stable repositories, distributed computing.

How Prepared Are We ? SCAR/COMNAP Report Card: 1.Collaborative logistical infrastructure development and utilisation 2.Pan Antarctic observation network 3.National investment in polar data management repositories 4.Data sharing and access 5.Investment in building professional skills in data analysis and/or data management. snowflake scores are out of 10

Investment In Repositories Source: DIMS (2009) 2 or less staff in all but UK and Australian Centres. Approx 33 nations participate in SCAR. Belgium – SCAR MarBIN Data Centre – only on temporary funding.

How Prepared Are We ? SCAR/COMNAP Report Card: 1.Collaborative logistical infrastructure development and utilisation 2.Pan Antarctic observation network 3.National investment in polar data management repositories 4.Data sharing and access 5.Investment in building professional skills in data analysis and/or data management. snowflake scores are out of 10

What Is Required ? SCAR has already invested in developing a Data and Information Strategy (DIMS) –Individual academic institutions not best placed to manage long-term repositories (or develop sustainable national infrastructure), –National Antarctic Programs (as represented through COMNAP) are better positioned. –Suggest both SCAR and COMNAP have much to gain by pursuing DIMS in unison. –SCAR has the vision BUT COMNAP has the capacity and capability.

Antarctic Master Directory (NASA) Human Readable Metadata registering metadata National Data Centres A human user can search a metadata catalogue. But data might not be linked to these descriptive records (only 53% of records have data). DIMS Implementation Plan designed to move us from a metadata centric infrastructure to

An infrastructure that delivers data/information through specialised, networked, national or institutional data portals. Antarctic Master Directory (with a registry interface) (NASA) Standard machine to machine interfaces data store Data Portal Scientists are able to use a data discovery portal from one country, that can also access data from another countrys data store. data store Harvests from Service Registry Portals Standard Interfaces Protocols

What if I dont have anywhere to put my data but Im happy to share it ? data bucket Internet Cloud Tom Jerry I wonder what data is out there for the polar regions ? Submit data Metadata Catalogue & Registry Register in Discover/retrieve Polar data Search Publish to AND provides virtual physical storage for orphan data....AND allows us to search for data using public search engines.

Possible approach: –COMNAPs Data Management Expert Group (DMEG) reviews SCAR DIMS and Implementation Plan for its fit with COMNAP business objectives. –National members collaborate to resource project(s) in the Plan focussed on delivering outputs/outcomes for a specific program of science that is supported by both SCAR and COMNAP members (this constrains infrastructure development on meeting immediate user needs). –Projects are run as international managed collaborations – signed off by MOU and subject to project management. –COMNAP and SCAR jointly review the function, structure and role of SCADM, SCAGI and DMEG with a view to streamlining approaches to Antarctic data infrastructure development.

Conclusion The data deluge is already here ! Our ability to manage and harness this deluge will be a key determinant of the quantity of high quality science we can produce in the C 21st. Effectively sharing logistical/research management information underpins how we can collectively get more value out of existing and future investments made in deploying to Antarctica. Data Infrastructures – have to be planned, designed, funded and managed. They are expensive – but the pain can be shared ! Lets make sure we know more about Antarctica than the moon before we are due to land there again (2020 ?).