Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach.

Similar presentations


Presentation on theme: "Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach."— Presentation transcript:

1 Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach Natural History Museum and Biodiversity Institute University of Kansas

2 Overview Overall Goals History Current version Implementation Future

3 Niche Modeling Yeah yeah yeah Data –Environmental –Occurrence Computational limitations

4 Lifemapper 1.0 NSF funded Experimental app. Successful DC project Enthusiastic users Limited by –Data quality –Architectural decisions

5 Lifemapper 2.0 Demo pipeline processing specimen data from GBIF cache Funded by NSF/EPSCoR Simpler, controlled architecture Goals –On demand computation –Model archive –Data and analysis service

6

7 Components Cluster Spatial data library (SDL) Workflow controller Open-source Python

8 Operation Client –retrieves point data –constructs request –sends job to cluster by REST Cluster front end receives /schedules job Cluster nodes –retrieve environmental data –dispatch job to OM Client –polls for status –retrieves and stores model/projection

9 Data Environmental data –URL in job, retrieved via WCS by node –Caches on nodes for efficiency Point data –Could be REST or WFS URL Result data –Model (ruleset) stored on file system –Projection (raster map) registered in SDL

10 Cluster 64 node, 128 processors 2 TB storage NPACI Rocks Sun Grid Engine scheduler HTTP REST service –Run openModeller (GARP or other algorithm) –Get status –Get result data

11 Spatial Data Library Mapserver with custom python W*S Layer metadata in PostGIS Independent service - so could –be standalone –be one of multiple SDLs servicing pipeline Will have –search/query webservice –browsable web interface

12 Workflow Controller Could simply generate jobs … so easy to integrate Currently –Harvests from GBIF –Generates jobs per species –Reproduce LM1 but with refined data and scalable system

13

14 Overall system Standalone system –Only outside connection is REST service –Easily moved to smaller/larger system –Or multiple systems for failover for high demand Interface - easier than existing SOAP Designed to provide high throughput, not rapid evaluation of single model Open Source Software

15 Implementation Status Core components operational –harvest data –generate jobs –output projections –store back to SDL No user interface yet W*S so existing viz solutions easy

16 What does the future hold? Fine-tune Taxonomic resolution Data cleaning Multiple –algorithms –projection scenarios Analysis services

17 Acknowledgements Funding –NSF Award (EPSCoR 0553722) –Kansas Technology Enterprise Corporation openModeller –CRIA and more Original GARP –David Stockwell, SDSC Environmental data –Climate Research Unit –Int. Panel on Climate change –Normalization BDWorld, Tim Sutton, Pete Brewer GBIF and contributing collections Lifemapper1 Team, especially Ricardo Pereira

18 Questions?


Download ppt "Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach."

Similar presentations


Ads by Google