InvertNet: Year 2 Progress & Plans Chris Dietrich, David Raila and Omar Sobh University of Illinois iDigBio HUB Summit II, Gainesville FL.

Slides:



Advertisements
Similar presentations
Avoiding CMS Pitfalls Presented by John Piechowski, Managing Director – Northwoods Software.
Advertisements

CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
Virtualizing Entomology Collection Student: Di Wang (Alan) Sponsors: John Marris: Curator, Entomology Research Museum Stuart Charters: Department of Applied.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
InvertNet: A New Platform for Biodiversity Research and Outreach Chris Dietrich Illinois Natural History Survey University of Illinois ECN 2011, Reno.
BiodIS K-State Biodiversity Information System David Allen and Mike Haddock K-State Libraries Coalition for Networked Information December 15, 2009.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Mgt 240 Lecture Website Construction: Software and Language Alternatives March 29, 2005.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Chapter 5 Application Software.
Welcome to the Minnesota SharePoint User Group. Introductions / Overview Project Tracking / Management / Collaboration via SharePoint Multiple Audiences.
Currently 7 Thematic Collection Networks with 130 participating institutions A dvancing D igitization of B iodiversity C ollections (ADBC NSF Program)
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Public Participation in Digitization of Biodiversity Specimens Workshop Julie Speelman September 28, 2012.
Advanced Computing and Information Systems laboratory iDigBio Cloud and Appliances: Concept, Processes and Progress Jose Fortes (on behalf of the iDigBio.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
SobekCM’s Community Ecosystems & Socio-Technical Practices Presented by Mark V. Sullivan June 10 th, 2014 Sobek image created by Jeff Dahl and is shared.
Implementation of HUBzero as a Knowledge Management System in a Large Organization HUBBUB Conference 2012 September 24 th, 2012 Gaurav Nanda, Jonathan.
BISQUE: Enabling Cloud and Grid Powered Image Analysis Ramona Walls iPlant Collaborative
SharePoint and SharePoint Online: Today and what's next? Presented by Luke Abeling – IT Platforms.
The Macroalgal Digitization Project Chris Neefus, Department of Biological Sciences University of New Hampshire, Durham, New Hampshire.
SCAN Survey Results: Engaging the Public with Insect Digitization Workflows Dr. Melody Basham Hasbrouck Insect Collection Outreach Specialist Project Director.
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
BIRN Update Carl Kesselman Professor of Industrial and Systems Engineering Information Sciences Institute Fellow Viterbi School of Engineering University.
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
University of Florida Florida State University
The Global Video Grid: DigitalWell Update & Plan For SRB Integration Myke Smith, Manager Streaming Media Technologies University of Washington / ResearchChannel.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Enabling Cloud and Grid Powered Image Phenotyping Martha Narro iPlant Collaborative Adapted.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
Battle of the Collaborators Which collaboration tool is right for you? Sam Johnson, John Alexander, and Trisha Gordon explore many of the online collaborative.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
Students: Anurag Anjaria, Charles Hansen, Jin Bai, Mai Kanchanabal Professors: Dr. Edward J. Delp, Dr. Yung-Hsiang Lu CAM 2 Continuous Analysis of Many.
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
Encyclopedia of Life Established May 2007 First version of portal went online Feb year goals –Assemble infinitely expandable web pages for all.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
Implementing an Institutional Repository: Part III 16 th North Carolina Serials Conference March 29, 2007 Resource Issues.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
DUNN & WILSON PROJECT Tales from outside the Square.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
HUBzero® Platform for Scientific Collaboration Copyright © 2012 HUBzero Foundation, LLC HUBbub 2012, Sept 24-25, Indianapolis 1 Welcome to the World of.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Context: The Strategic Plan for Establishing the Network Integrated Biocollections Alliance Judith E. Skog, Office of the Assistant Director, Biological.
Enabling Cloud and Grid Powered Image Phenotyping
Internet Documentation and Integration of Metadata (IDIOM) Presented by Ahmet E. Topcu Advisor: Prof. Geoffrey C. Fox 1/14/2009.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
BEN Tools & Isovera Services Isovera Consulting Cal Collins, Shakib Mostafa, Sergey Demidenko Feb
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
DESIGN AND DEVELOPMENT OF NOAA VIRTUAL LIBRARIES: THE INTERSECTION OF TRADITIONAL LIBRARY KNOWLEDGE AND CUTTING EDGE INFORMATION TECHNOLOGIES Dottie Anderson.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digital Repositories Build It & They Will Come Michael J. Bennett Access Services Supervisor C/WMARS,
Crowd-sourcing, Public Participation, and Data Enrichment – Using crowd-sourcing tools Biological Collections Digitisation in the Pacific , Symposium.
Joslynn Lee – Data Science Educator
UNC Digital Library Project
Content Management Systems
Archbold Biological Station
Presentation transcript:

InvertNet: Year 2 Progress & Plans Chris Dietrich, David Raila and Omar Sobh University of Illinois iDigBio HUB Summit II, Gainesville FL

InvertNet Rationale Vast majority of specimens in U.S. collections are invertebrates primarily insects and related arthropods less than 5% available online only label data usually provided Most invertebrate biodiversity research is specimen-based all knowledge of many species is embodied in collections Existing digitization methods are inadequate slow and expensive ($1+ per specimen) risk of damage to specimens from handling iDigBio Summit 2

InvertNet Goals Digitize all holdings of 22 midwestern arthropod collections (50 million + specimens) Specimen images and metadata (label info) Drawers, vials, slides Advanced imaging (including 3D) Best quality at reasonable cost (~$0.10/specimen) Provide access to images and other data via online virtual museum browsable/searchable/zoomable web interface link to other data providers (GBIF, national ADBC HUB, etc.) Provide platform for research and development of additional tools and resources Data mining and analysis Community building, collaboration, and support Education, outreach, and reference iDigBio Summit 2

InvertNet UIUC Team Chris Dietrich – Director Systematic Entomologist John Hart – CoPI Computer Science - Graphics Nahil Sobh – CoPI Computational Multiscale Nanosystems Umberto Ravaioli – CoPI Computational Multiscale Nanosystems David Raila – Senior Collaborator Computer Science – Sr. Research Programmer Others Programmers, research assistants, hourlies iDigBio Summit 2

InvertNet Collaborating Curators CollaboratorInstitution A. CognatoMSU G. Courtney, J. VanDykISU J. HollandPurdue R. Holzenthal, P. Tinerella Minnesota P. JohnsonSDSU H. Klompen, M. DalyOSU J. Rawlins, R. Davidson, J. Fetzner Carnegie Museum D. Rider, G. FauskeNDSU A. ShortKansas R. SitesMissouri D. YoungWisconsin- Madison J. ZaspelWisconsin- Oshkosh G. ZolnerowichKSU

Additional Collections Eastern Illinois University Western Illinois University Southern Illinois University Illinois State University Milwaukee Public Museum Northern Michigan University U North Dakota Valley City State University U Hawaii (added this year)

Year 1 Accomplishments: Digitization Workflows Implemented digitization workflows for slide-mounted specimensand specimens stored in vials Tested drawer digitization hardware Established web portal at UIUC using HUBzero platform -Community development for collaborators -Digitization workflow -Searchable/browsable web interface for images and label data Staging pinned collections for digitization -basic housekeeping (drawer and unit tray labels, updating nomenclature, organizing identified material) -curator exchanges to upgrade curatorial status of focal taxa Develop training materials for participants InvertNet Digitization Workshop – Spring 2012

Digitization Workflows: Slides Designed new, less expensive template for arranging sets of 20 slides on flatbed scanner Published workflow description on InvertNet.org ( Published training video demonstration of entire procedure ( iDigBio Summit 2

Digitization Workflows: Vials Developed new workflow that does not require removing labels from vials and allows multiple vials to be scanned simultaneously Published workflow description on InvertNet.org ( Published training video demonstration of entire procedure ( iDigBio Summit 2

Drawer Digitization Custom designed precision robotics system Precision machine hardware and machine control software High-res industrial camera with low- distortion telecentric lens State of the art computer vision system (OpenCV) Feature detection+image processing Integrated and customized for InvertNet Easy to use – automated iDigBio Summit 2 Delta robot

OpenCV – Computer Vision Library High performance vision library Feature/object detection, image processing, image registration/metrics … Maintained and growing InvertNet Uses Autofocus Stitching Auto-calibration - drawers Real-time quality monitoring/adjustment during capture Key specimen additional processing iDigBio Summit 2

Digitization Workflow Testbed: 3D Reconstruction Disney research SIGGRAPH 2010 Computes 3D model from multiple images at known positions Testing of capture positions needed UIUC I2PC reference algorithm in place Working on parallelization for performance, optimization for small-scale specimens Good initial results iDigBio Summit 2

Digitization Workflow: Advantages Meets cost target of 10 cents/specimen Provides rapid access to entire digitized collection Multiple images from different perspectives stitched together for 2D and 3D reconstruction and zoom capability 2D images of multiple units acquired simultaneously then segmented into individual database containers iDigBio Summit 2

Outreach Link to BugGuide: users compare photos of live bugs to images of identified specimens Crowd-sourcing label data capture (Zooniverse) iDigBio Summit 2

InvertNet IT Infrastructure Year One

InvertNet Infrastructure InvertNet Infrastructure Physical Rack Setup

Added Features in Year One Ingest Pages for Slides and Vials: Drag and Drop Chunked Uploading Tagging, Profiling, Batch Submission InvertNet Taxonomic Tree and Site Search: CoL Taxonomic Base Search terms autocompletion Search by site as well as the Digital Image Repository Zoomable Viewer: Tiled Pyramidal TIFF format This is a standard TIFF extension and is supported by most image processing applications including Photoshop, GIMP, VIPS and ImageMagick. The libtiff codec library is also perfectly capable of reading and writing such images.

Upcoming Features InvertNet Infrastructure Upgrade of base system to Hubzero1.1 Geo-located Storage for added redundancy - IdigBio Storage Burst CDN (Amazon API, GigenetCloud) Website: Ingest Pages for Drawers Responsive Design Segment, Annotation and Specimen Capture Tools Bug-Guide and Google Images tool for resources Taxonomic Collaboration: Method to have a taxonomic base that can be added onto with citing and reasons for addition or change extended by API for authorized others to interact with.

Join Us Registration is open to all and available now! iDigBio Summit 2

Acknowledgements Collaborators: J. Hart, N. Sobh, U. Ravaioli, C. Taylor, A. Cognato, G. Courtney, J. Holland, R. Holzenthal, P. Tinerella, P. Johnson, H. Klompen, M. Daly, J. Rawlins, R. Davidson, J. Fetzner, D. Rider, G. Fauske, A. Short, R. Sites, D. Young, J. Zaspel, G. Zolnerowich Funding: NSF ADBC program