Download presentation
Presentation is loading. Please wait.
Published byAsher Bond Modified over 9 years ago
1
Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail IG breakout session Amsterdam, 23 Sep 2014
2
The problem – Capturing and integrating biodiversity data How to we join up these activities?How do we use this as a tool? Species conservation & protected areas Impacts of human development Biodiversity & human health Impacts of climate change Food, farming & biofuels Invasive alien species What infrastructures do we need? (technologies, tools, standards…) What processes do we need? (Modelling, workflows…) What data do we need? (Genes, localities…)
3
LinkD Challenge 1: mobilising data at all scales
4
LinkD Challenge 2: linking & aggregating data at different scales National Efforts c.5M (e.g. NHM Data Portal) Communities c.50k (e.g. Scratchpads) Global Efforts c.500M (e.g. GBIF Data Portal)
5
LinkD Challenge 3: Synthesising data, e.g. modelling human pressures on biodiversity www.predicts.org.uk Projecting Responses of Ecological Diversity In Changing Terrestrial Systems 2M records, 19k sites, 34k spp. Management Practices EcosystemsAgro-systems Small aggregated datasets Species richness in different ecosystems Land-use change Pollution Invasive species Infrastructure Models to predict how biodiversity responds to human pressures
6
The problem – integrating biodiversity research Figure from Costello M.J et al, 2013. doi: 10.1126/science.1230318
7
c. 17000 new sp and subsp. described every year c. 17000 new sp and subsp. described every year The problem – integrating biodiversity research
8
Key problems Landscape is complex, fragmented & hard to navigate Many audiences (policy makers, scientists, amateurs, citizen scientists) Many scales (global solutions to local problems) Figure adapted from Peterson et al 2010 An informaticians view of biodiversity
9
Investigator-focused 'small data‘ Locally generated 'invisible data' 'incidental data' dark data 20% 80% Published and discoverable data Dark data more important mainly due to their volume 1 1 Heidorn PB. Library Trends 57:280-299
10
Incentives for mobilising long-tail research Leverage effort and data impact Increase exposure and citability of work Provide easy to use and long-lasting VRE Promote the culture of openness in science
11
Increase exposure and citability of work Scholarly data publication Enable easy publication of data and data descriptors Link data journals with data sources (repositories, VREs) using common data exchange standards Small data contributions
12
Leverage effort and data impact Virtual Research Environments Empower researchers through development and deployment of service-driven digital research environments 515 Scratchpad Communities by 6,321 active registered users covering 176,950 taxa in 932,296 pages. 134 paper citations in 2013 In total more than 2,500,000 visitors
13
Leverage effort and data impact Long tail data External data & services
14
Leverage effort and data impact Enable long tail researchers to do science online by processing own data together with data from cross-disciplinary sources Provide workflows for the processing of data in major areas of biodiversity research: ecological niche modelling, ecosystem functioning, and taxonomy. The BioVeL approach Design and Construct – Run – Share and Discover scientific workflows
15
Leverage effort and data impact A highly dynamic but fragmented landscape
16
Data curation Data curation Data publishing Data publishing Data mobilisation & generation Data mobilisation & generation Data analysis Data analysis Leverage effort and data impact Seamless virtual research environments that incentivise mobilisation of long tail research
17
H2020 2015 VRE Proposal: LinkD Topic: EINFRA-9-2015 Virtual Research Environments Estimated Budget: € 8-9 m Consortium: c. 24 partners LinkD Linking data, services and communities for predictive modelling of the biosphere Deliver a coherent and accessible ecosystem of federated services and deploy a network of research and collaboration enabling tools to support scientific excellence towards the long term vision of predicting modelling of the biosphere Builds upon: ViBRANT | BioVeL | pro-iBiosphere | EU-BON Strategic links to: ESFRI projects (incl. LifeWatch, ELIXIR)
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.