Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,

Slides:



Advertisements
Similar presentations
User needs assessment and preparing a dissemination plan John Tann Kolkata, June 2011 The Atlas is funded by the Australian Government.
Advertisements

TDWG GUID-2 June 10, 2006Jessie Kennedy/Rob Gales LSID Resolution In SEEK Taxon.
The Library of Life Federated Description Services and the Library of Life or What can we do with SDD anyway? Kevin Thiele Centre for Biological Information.
SCAMIT-Morphbank Workshop Custom Workbook Submission Monday 8 August 2011 SCCWRP, Costa Mesa, CA.
Morphbank Image Repository Plus… Paleocollections Workshop April 26 – 28, 2012 Deborah L. Paul Support from NSF grants: Biological Databases.
Ontology Classifications Acknowledgement Abstract Content from simulation systems is useful in defining domain ontologies. We describe a digital library.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
A LOOMING CRISIS: MAINTAINING ACCESS TO ELECTRONIC RESEARCH PRODUCTS Daphne Fautin University of Kansas Gail Kampmeier Illinois Natural History Survey.
BUSINESS DRIVEN TECHNOLOGY
Lecture Two Database Environment Based on Chapter Two of this book:
Software Development Unit 2 Databases What is a database? A collection of data organised in a manner that allows access, retrieval and use of that data.
Roles and Goals Greg Riccardi. iDigBio People University of Florida o Larry Page, Jose Fortes, Pamela Soltis, Bruce McFadden, Renato Figueiredo, Reed.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
Drivers for a PRAGMA Biodiversity Science Expedition Reed Beaman Florida Museum of Natural History University of Florida.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
Morphbank Current Topics: Using Images & Metadata Biodiversity Informatics Course, 18 September 2009 Swedish Museum of Natural History (NRM), Stockholm.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Field Work, Herbaria, Databases, Floras, and Monographs for Plant Systematics Spring 2014.
Ontologizing morphological terms for Hymenoptera (Insecta) - implementing and benefiting from a controlled vocabulary Andrew R. Deans Gregory A. Riccardi.
Connecting Specimens, Images and Vocabulary Specify, Morphbank, Morphster Beach, Noble, Spears – KU Mast, Riccardi – FSU Miranker, Tirmizi UT.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Cynthia Parr Phenotype RCN NESCent 25 February 2013.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CSIRO Marine Research Data Centre linked databases - CAAB, MarLIN and Divisional Data Warehouse.
Data Integration and Management A PDB Perspective.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Overview PlantCollections – Publish information about public garden collections – Using existing infrastructure Morphbank – Goals and capabilities of.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
June 2006Image LSID resolvers Image LSID Resolution Prototypes Hui Dong, Bob Morris UMass Boston.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
SELTMANN 2, Katja; PRIETO-MARQUEZ 1,*, Albert; RONQUIST 2, Fredrik; RICCARDI 3, Gregory A.; DEANS 5, Andy; JAMMIGUMPULA 2, Neelima; MAST 1, Austin; WINNER.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
SCAMIT and Morphbank Nov City of San Diego Environmental Monitoring Lab Nov City of San Diego Environmental Monitoring Lab Nov
GigaScience ( is an online, open-access journal that includes, as part of its publishing activities, the database GigaDB.
MorphBank’s Approach to Determination Annotations Austin Mast | David Gaitros | Fredrik Ronquist | Peter Jörgensen | Corinne Jörgensen | Greg Riccardi.
A Dissertation defense presented to the Department of Computer Science
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
Scratchpads An online platform for biodiversity data Laurence Livermore Biodiversity Informatics | Department of Life Sciences Natural History Museum London.
NVS New Zealand National Vegetation Survey. What is NVS? NVS (National Vegetation Survey) – New Zealand’s largest archive facility for plot-based vegetation.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Botany 2012 Columbus, OH Deborah Paul, iDigInfo 0.
An Introduction to database system
Jessie Kennedy Rob Gales, Robert Kukla
Flanders Marine Institute (VLIZ)
OntoMorphBankSter: Image-driven Ontology and/or Ontology-driven Image Annotation Greg Riccardi, Austin Mast Florida State U Dan Miranker, Ferner Cilloniz,
UNC Digital Library Project
Semantic Database Builder
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Data Management: The Data Repatriation Re-integration Step or …
An ecosystem of contributions
NSDL Data Repository (NDR)
Capturing and Organizing Scientific Annotations
Metadata The metadata contains
Chapter 2 Database Environment Pearson Education © 2014.
Presentation transcript:

Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros, Fredrik Ronquist, Austin Mast, Andrew Deans, Neelima Jammingumpula, Wilfredo Blanco, Katja Seltmann, Karolina Maneva- Jakimoska, Steve Winner

Greg Riccardi: TDWG 06 Supported by NSF BDI 2 Overview Morphbank goals Progress update GUID support Annotations and Associations

Greg Riccardi: TDWG 06 Supported by NSF BDI 3 Morphbank Goals Help biologists capture, organize, and manage phylogenetic information –Store and publish images –Provide tools to create and manipulate annotations and associations –Help move to digital basis of specimen analysis –Capture peoples’ knowledge of species Example of Tree of Life process –Specimens are photographed –Images and metadata entered into database –Features (character states) are identified in images –Character state matrices are created –Character matrices are processed to produce family trees Cipres, TreeBaseCipresTreeBase

Greg Riccardi: TDWG 06 Supported by NSF BDI 4 What is Morphbank Curated repository of biological digital media and associated information –Funded by NSF to develop technology and keep images –Acquire, Protect, Distribute, Archive –Add value to images by acquiring and managing annotations and other associations Tools to create and record information supported by images Seamless integration of research and publication Not primarily a tool development –Back end repository for many clients (some examples follow) –Some client tool development planned for Morphbank

Greg Riccardi: TDWG 06 Supported by NSF BDI 5 Morphbank Progress New interfaces Better search and Filter Collections Annotations

Greg Riccardi: TDWG 06 Supported by NSF BDI 6 Morphbank Image Display 2005 Some of the fly wings in developmental DB

Greg Riccardi: TDWG 06 Supported by NSF BDI 7 Conceptual Challenges Schema for media repository Relationships between data objects Acquiring and managing annotations and associations Searching and browsing information Managing classifications

Greg Riccardi: TDWG 06 Supported by NSF BDI 8 Browse by View View description is based on morphological classification

Greg Riccardi: TDWG 06 Supported by NSF BDI 9 Specimen Display Page

Greg Riccardi: TDWG 06 Supported by NSF BDI 10 Image Display Page

Greg Riccardi: TDWG 06 Supported by NSF BDI 11 Search for Images of Specimen

Greg Riccardi: TDWG 06 Supported by NSF BDI 12 Collection Page

Greg Riccardi: TDWG 06 Supported by NSF BDI 13 GUIDs at Morphbank Map relational database to Java object model Export Java objects as RDF Develop RDF schema for objects Use LSID software to publish RDF

Greg Riccardi: TDWG 06 Supported by NSF BDI 14 Sample RDF for an Image Width and Height set 829 Animalia

Greg Riccardi: TDWG 06 Supported by NSF BDI 15 What is an Annotation? An assertion of a relationship among objects –Someone claims that several objects are associated by a relationship and gives evidence of the connection –Includes record of author and date of assertion –Objects are often datasets with provenance –Annotations often assert quality characteristics of data objects Crucial social components –Attribution, confidence, and validity –Ontologies and compliance with standards –Establishment of object naming strategy –Security policies Feature Annotation –E.g., shows an area of interest in an image that displays a particular character state

Greg Riccardi: TDWG 06 Supported by NSF BDI 16 What is a Phylogenetic Character? A morphological feature –Relevant to taxa under a taxon –Value is discrete (set of states) or continuous A value of a character may represent a characteristic of some anatomical or morphological component of a collection of taxa The value of the character is selected by sorting specimens –In the digital world, sorting images

Greg Riccardi: TDWG 06 Supported by NSF BDI 17 Morphology Publication Example

Greg Riccardi: TDWG 06 Supported by NSF BDI 18 How to Create Characters and States Select a collection of taxa and one or more features of interest Collect images as appropriate Annotate images to identify location of feature Sort images into piles according to the character state Define a state for each pile –Name and describe the state

Greg Riccardi: TDWG 06 Supported by NSF BDI 19 Advantages of Collections Searching in large datasets is hard –Filtering doesn’t work, ranking is required Identifying similarity is hard –Character definitions shared between researchers Associations between objects –Google uses associations (links) for ranking –Collections provide semantically rich associations E.g. images that are part of a character state associated with a particular taxon As amount of annotation grows –Quality of searching grows

Greg Riccardi: TDWG 06 Supported by NSF BDI 20 Technical Challenges User interface quality is crucial –Users will provide the least amount of data possible –Good tools make it easy for users to provide more data Searching the image space –Searching for characters and states –Implementing a variety of classifications, including custom and temporary classifications GUIDs and data handles are crucial Schemas and performance

Greg Riccardi: TDWG 06 Supported by NSF BDI 21 Acknowledgements Thanks to the Morphbank development and research team –Fredrik Ronquist, Austin Mast, Andrew Deans, David Gaitros, Neelima Jammingumpula, Wilfredo Blanco, Katja Seltmann, Karolina Maneva- Jakimoska, Steve Winner, Debra Paul, Peter Jorgensen Supporting Organizations –National Science Foundation, BDI panel –Florida State University School of Computational Science –NESCent National Evolutionary Synthesis Center Morphbank collaborators and contributors –Angiosperm AToL project, DigiMorph project, Electronic Field Guide project, Hymenoptera AToL project, Lepidoptera AToL project, MorphoBank project., Peabody Museum of Natural History, Robert K. Godfrey Herbarium Online Database Project at Florida State University, Specimen Image Database project, Drosophila morphogenetics project at Florida State University, PEET project Monographic Research in Parasitic Hymenoptera, ZooBank