Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

What are GUIDs and Why Do We Need Them ??? Steve Baskauf Vanderbilt Dept. of Biological Sciences
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
RDF Tutorial.
Semantic Web Introduction
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Introduction to HTML 2006 CIS101. What is the Internet? Global network of computers that are connected and communicate via a series of Protocols Protocols.
IST 221 Internet Concepts and Applications Internet, WWW and HTML 1.
Introduction to HTML 2004 CIS101. What is the Internet? Global network of computers that are connected and communicate via a series of Protocols Protocols.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Center for Environmental Studies Arizona State University Digital Research Records at Center for Environmental Studies Peter McCartney.
Topics in this presentation: The Web and how it works Difference between Web pages and web sites Web browsers and Web servers HTML purpose and structure.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
CORE 2: Information systems and Databases HYPERTEXT/ HYPERMEDIA.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
INTRODUCTION TO WEB DATABASE PROGRAMMING
IT 210 The Internet & World Wide Web introduction.
Do LSIDs Need to Resolve? Joel Sachs
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
IDs in and out of the database Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
Server-side Scripting Powering the webs favourite services.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Internet Basics Dr. Norm Friesen June 22, Questions What is the Internet? What is the Web? How are they different? How do they work? How do they.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
HOW WEB SERVER WORKS? By- PUSHPENDU MONDAL RAJAT CHAUHAN RAHUL YADAV RANJIT MEENA RAHUL TYAGI.
Full implementation of GUIDs at SERNEC institutions: A strategy that accommodates institutions of varying sizes and complex resource relationships Steven.
© 2012 IBM Corporation Best Practices for Publishing RDF Vocabularies Arthur Ryman,
Lesson 7 – World Wide Web. What is the World Wide Web?  The content of the worldwide web is held on individual web pages gathered together to form websites.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Information Interchange on the Semantic Web an interactive talk by Piotr Kaminski, University of Victoria
LSIDs in a Nutshell Jun Zhao University of Manchester 1 st December, 2005.
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
Overview PlantCollections – Publish information about public garden collections – Using existing infrastructure Morphbank – Goals and capabilities of.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Web Authoring with Dreamweaver. Unit Objectives  Be able to define keywords: HTML, HTTP (protocol), browser, web server, client/server, tag, attribute,
Interoperability How to Build a Digital Library Ian H. Witten and David Bainbridge.
AJAX. Ajax  $.get  $.post  $.getJSON  $.ajax  json and xml  Looping over data results, success and error callbacks.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
Information Networks. Internet It is a global system of interconnected computer networks that link several billion devices worldwide. It is an international.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Dublin Core Basics Workshop Lisa Gonzalez KB/LM Librarian.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Web Programming Language
Linked Data Web that can be processed by machines
Introduction to Persistent Identifiers
Introduction to Persistent Identifiers
E-commerce | WWW World Wide Web - Concepts
Middleware independent Information Service
E-commerce | WWW World Wide Web - Concepts
Cataloging the Internet
Introduction to World Wide Web
Presentation transcript:

Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences August 3, 2010

I. Challenges in Biodiversity Informatics Common interest in databasing metadata. Metadata describe resources and their properties. Resource: anything that can be assigned an identifier (e.g. a tree, a specimen, an image, a taxon, a name, etc.) Property: a string literal that describes the resource or a relationship between the subject resource and some other resource.

Example: Vanderbilt Arboretum 5935 identified and geolocated trees

Example subject resource (a tree) “native” literal property relationship property object resource (an image) text string establishmentMeans depiction

Relationship “graph” the tree (7-314) image (79657) “native” establishmentMeans depiction Tree IDEstablishment Means Image ID 7-314native native cultivated79684 Traditional database (typical for specimens)

Non-“flat” relationships in live-plant imaging live tree whole tree image leaf image bark image determination taxon standardized views Baskauf and Kirchoff (2008) Vulpina 7:16-30

Duplicate herbarium specimens live tree specimen image duplicate herbarium specimen at institution B herbarium specimen at institution A live tree same individual determination A taxon A determination B taxon B

live tree (individual organism) whole tree image leaf image bark image determination A taxon A specimen image herbarium specimen determination B taxon B Complex relationships individual-based organization system Baskauf (2010) Biodiversity Informatics 7:17-44

II. Building blocks of a Web-based metadata system 1.We need to be able to unambiguously identify the resources (globally unique identifiers =GUIDs) 2.We need standardized property definitions (e.g. Darwin Core terms) 3.We need a technological solution for communicating properties and relationships to a user anywhere (RDF/XML representation sent to user via the Internet) design principles

Building block #1: GUIDs A globally unique identifier (GUID) should be: 1.globally unique 2.actionable 3.persistent Anyone on the planet should be able to use the GUID to find out about the particular thing that it identifies, forever. That is a pretty tall order (but you can do it)!!!

1. How do you make an identifier globally unique? Create a locally unique identifier: – identifier (catalog number) unique within a collection, e.g. GIS tree ID number: – namespace (collection code) unique within the institution, e.g. vanderbilt vanderbilt/7-314 Make it globally unique by appending a domain name that you control, e.g. bioimages.vanderbilt.edu

Complete HTTP URI GUID combine “ with other pieces: This identifier looks like a URL! An HTTP URI is a uniform resource identifier as well as a resource locator (web address=URL).

2. What does actionable mean? Something happens when you put an actionable GUID in a Web browser (GUID is “resolved”). HTTP URIs –unlike LSIDs and DIOs, they work in any web browser –resolved using existing Internet infrastructure –consensus GUID of Linked Data (Semantic Web) community –

3. Persistent URIs always work URIs “break”: when filenames change: Javascript based URI: Independent of method: Both URIs eventually lead to the same page, but the second URI is simpler and won’t change. URIs “break”: when domain names disappear bioblitznashville.org vs. vanderbilt.edu Planning for URI permanence is important.

How long is “persistent”? Forever is a pretty long time. The Internet is only 40 years old and the Web only 20. Plan for your institution and domain name to last at least 10 years. Don’t change the URI of anything that you are trying to identify!

Building block #2: Standardized property definitions Recent consensus on metadata terms: Dublin Core Metadata Initiative (DCMI) = describes generic resources Friend-Of-A-Friend (FOAF) = describes people and their affiliations Darwin Core (DwC) = describes biodiversity resources Media Resources Task Group (MRTG) = describes media (e.g. images) in a biodiversity context

A property described by a metadata term: is an HTTP URI, e.g. has a definition that can be accessed via the Internet has an abbreviated form that usually makes sense to humans dwc: = so the abbreviated URI for the term is dwc:establishmentMeans

subject resource (tree) “native” object resource (image) establishmentMeans depiction native dwc:establishmentMeans foaf:depiction Resource Description Framework (RDF) graph Building block #3: Communicating relationships

native dwc:establishmentMeans foaf:depiction Resource Description Framework (RDF) graph RDF in XML format (a tiny snippet) native How do you translate relationships into language a computer can understand?

III. Why use a new way to describe metadata? People are good at figuring out what web pages mean. Computers (like a GoogleBot) have to guess what the information on a web page means. The Semantic Web (a.k.a. Web 2.0) provides a means to provide information to computers explicitly.

Content Negotiation, part 1 “I am a human. Send me web server GET MIME type: text/html I cannot send this guy a tree! Web page

Content Negotiation, part 2 web server GET MIME type: application/rdf+xml ! XML file “I am a computer. Send me

What’s so great about this? A computer can crawl the Web and discover metadata about resources that are identified by HTTP URI GUIDs. RDF metadata from many sources can be assembled into a database (RDF “triple store”). The database can be searched or used to generate web content. Source data does not need to be “sent” to the database; any “semantic web client” can retrieve it at will. The format is standard, no special communication protocols are required.

Why would this benefit me now? RDF/XML metadata files for numerous resources can be transformed directly into web pages using a single program file. single web page using XSLT and/or AJAX

Benefits (cont.) Branding in the URI.

Benefits (cont.) HTTP URI GUIDs provide direct access to metadata about a resource to anyone with Internet access. –Clickable attribution link in website –Reference link in publication PDF –Physical QR codes for Smart Phone access

QR code on a museum display