Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail.

Slides:



Advertisements
Similar presentations
Virtual Biodiversity ViBRANT 8th e-Infrastructure Concertation Meeting CERN, Geneva, November 4-5th 2010 Vincent S. Smith Natural History Museum, UK
Advertisements

Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Facilitating biodiversity science through
PlantCollections A Community Solution An Institute of Museum and Library Services National Leadership Grant Building Digital Resources.
Vision and Ambition for LifeWatch ICT Infrastructure Axel Poigné (Fraunhofer IAIS) Vera Hernández-Ernst (Fraunhofer IAIS) Alex Hardisty (Cardiff University)
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
EU BON citizen science gateway Veljo Runnel University of Tartu Natural History Museum.
Dimitris Koureas, Vince Smith & Simon Rycroft Natural History Museum London Linking data, services and communities using Virtual Research Environments.
Data Publishing Workflows: Strategies and Standards
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Harnessing the Power of Environmental Data for Decision-Making IABIN Phase II.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
11 th GBIF Global NODES Meeting Incentivising and Strategising Publishing of Biodiversity Data Vishwas Chavan Senior Programme Officer for Digitisation.
Virtual Biodiversity ViBRANT Vince Smith & Dave Roberts Natural History Museum, London ViBRANT Virtual Biodiversity.
General strategy. Introduction Global “financial crisis” Beginning to cascade into GBIF Now thinking about the forward strategy and next work programme.
Online tools and standards for Biodiversity data in the Semantic Web Dr Dimitris Koureas Biodiversity Informatics Group | Department of Life Sciences The.
@dimitriskoureas making small data… big. Publications based on countless specimens, images, maps, keys and datasets Typically generated by small communities.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
1 Collaborations and Partnerships John Broome CODATA-International.
NHM Digital Collection Programme Ian Owens, Natural History Museum, London Digital Specimen 2014, Berlin, September 2014.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
ViBRANT Virtual Biodiversity Research Project overview Isabella Van de Velde Royal Belgian Institute of Natural Sciences, Brussels.
Enhancing formal and professional training capacity in Biodiversity Informatics: Collaboration and funding opportunities Dimitris Koureas Natural History.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
BY INNOCENT AKAMPURIRA, UgaBIF NODE MANAGER, UNCST 2011 TDWG CONFERENCE NEW ORLEANS, USA 16 TH – 21 ST OCTOBER
Every datum counts! Capitalising on small contributions to the big dreams of mobilising biodiversity information Vishwas Chavan, Eamonn O’ Tuama, Samy.
Encyclopedia of Life Established May 2007 First version of portal went online Feb year goals –Assemble infinitely expandable web pages for all.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
E-Science and Technology Infrastructure for Biodiversity and Ecosystem Research.
LifeWatch E-Science and Observatory Infrastructure for Biodiversity & Ecosystem Science Olaf Bánki.
Soil and Water Conservation Modeling: MODELING SUMMIT SUMMARY COMMENTS Dennis Ojima Natural Resource Ecology Laboratory COLORADO STATE UNIVERSITY 31 MARCH.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
Biodiversity Read the lesson title aloud to students.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Dr Dimitris Koureas Lead of Research Data & Partnerships Natural History Museum London Executive Secretary, Biodiversity Information Standards (TDWG) Co-chair,
Virtual Biodiversity ViBRANT Vocabularies, Standards, merging and linking Data Olaf Banki University of Amsterdam ViBRANT Virtual Biodiversity.
Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on Virtual Research Environments Pedro Andrade (CERN)
GBIFS Seminar with the Science Committee and the Nodes Strategy Group Analysis of the content published by the GBIF network – Better understanding what’s.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
Scratchpads An online platform for biodiversity data Laurence Livermore Biodiversity Informatics | Department of Life Sciences Natural History Museum London.
Scratchpads Virtual Research Environments for taxonomic and biodiversity related data.
Introductory remarks Wouter Los LifeWatch Infrastructure for Biodiversity and Ecosystem Research.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Nordic Cooperation on Biodiversity Informatics Hannu Saarenmaa NordBIN meeting Uppsala /03.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Henning Scholz Museum für Naturkunde Berlin. Mission & Vision Mission: To understand the Earth and its life, in dialog with its people. Vision: As a research.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Norman Morrison Senior Research Fellow, The University of Manchester Biodiversity Virtual e-Laboratory An e-Infrastructure and e-Science environment supporting.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Kathleen Shearer Data management: The new frontier for libraries.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE EGI-InSPIRE RI EGI strategy towards the Open Science Commons Tiziana Ferrari EGI-InSPIRE Director at EGI.eu.
Ecological Niche Modelling in the EGI Cloud Federation
Dimitris Koureas Lead, Research Data and Partnerships
IaaS Layer – Solutions for “Enablers”
Virtual Research Environments The story of Scratchpads
Brief introduction to the project
EC FP7 - Cooperation Theme 6: Environment (incl. climate change)
Who’s Who in Bioinformatics: The European Landscape
LifeWatch Cloud Computing Workshop
Bird of Feather Session
Digital Objects: The Science
Presentation transcript:

Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail IG breakout session Amsterdam, 23 Sep 2014

The problem – Capturing and integrating biodiversity data How to we join up these activities?How do we use this as a tool? Species conservation & protected areas Impacts of human development Biodiversity & human health Impacts of climate change Food, farming & biofuels Invasive alien species What infrastructures do we need? (technologies, tools, standards…) What processes do we need? (Modelling, workflows…) What data do we need? (Genes, localities…)

LinkD Challenge 1: mobilising data at all scales

LinkD Challenge 2: linking & aggregating data at different scales National Efforts c.5M (e.g. NHM Data Portal) Communities c.50k (e.g. Scratchpads) Global Efforts c.500M (e.g. GBIF Data Portal)

LinkD Challenge 3: Synthesising data, e.g. modelling human pressures on biodiversity Projecting Responses of Ecological Diversity In Changing Terrestrial Systems 2M records, 19k sites, 34k spp. Management Practices EcosystemsAgro-systems Small aggregated datasets Species richness in different ecosystems Land-use change Pollution Invasive species Infrastructure Models to predict how biodiversity responds to human pressures

The problem – integrating biodiversity research Figure from Costello M.J et al, doi: /science

c new sp and subsp. described every year c new sp and subsp. described every year The problem – integrating biodiversity research

Key problems Landscape is complex, fragmented & hard to navigate Many audiences (policy makers, scientists, amateurs, citizen scientists) Many scales (global solutions to local problems) Figure adapted from Peterson et al 2010 An informaticians view of biodiversity

Investigator-focused 'small data‘ Locally generated 'invisible data' 'incidental data' dark data 20% 80% Published and discoverable data Dark data more important mainly due to their volume 1 1 Heidorn PB. Library Trends 57:

Incentives for mobilising long-tail research Leverage effort and data impact Increase exposure and citability of work Provide easy to use and long-lasting VRE Promote the culture of openness in science

Increase exposure and citability of work Scholarly data publication Enable easy publication of data and data descriptors Link data journals with data sources (repositories, VREs) using common data exchange standards Small data contributions

Leverage effort and data impact Virtual Research Environments Empower researchers through development and deployment of service-driven digital research environments 515 Scratchpad Communities by 6,321 active registered users covering 176,950 taxa in 932,296 pages. 134 paper citations in 2013 In total more than 2,500,000 visitors

Leverage effort and data impact Long tail data External data & services

Leverage effort and data impact Enable long tail researchers to do science online by processing own data together with data from cross-disciplinary sources Provide workflows for the processing of data in major areas of biodiversity research: ecological niche modelling, ecosystem functioning, and taxonomy. The BioVeL approach Design and Construct – Run – Share and Discover scientific workflows

Leverage effort and data impact A highly dynamic but fragmented landscape

Data curation Data curation Data publishing Data publishing Data mobilisation & generation Data mobilisation & generation Data analysis Data analysis Leverage effort and data impact Seamless virtual research environments that incentivise mobilisation of long tail research

H VRE Proposal: LinkD Topic: EINFRA Virtual Research Environments Estimated Budget: € 8-9 m Consortium: c. 24 partners LinkD Linking data, services and communities for predictive modelling of the biosphere Deliver a coherent and accessible ecosystem of federated services and deploy a network of research and collaboration enabling tools to support scientific excellence towards the long term vision of predicting modelling of the biosphere Builds upon: ViBRANT | BioVeL | pro-iBiosphere | EU-BON Strategic links to: ESFRI projects (incl. LifeWatch, ELIXIR)