BioImage Database Project Director Image Bioinformatics Laboratory, Oxford e-Science Centre Department of Zoology, University of Oxford Oxford OX1 3PS,

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
The Messy World of Grey Literature in Cyber Security 8 th Grey Literature Conference 4-5 December 2006 New Orleans, Louisiana Patricia Erwin – I3P Senior.
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Rclis in vision and reality Thomas Krichel
Imagining the Future. WORLD WIDE WEB Tim Berners-Lee invented the World Wide Web.World Wide Web A graduate of Oxford University, England, in 1989, Tim.
Specimen Image Database (SID) Vince Smith*, Simon Rycroft & Rod Page University of Glasgow, Glasgow, UK *Present Address: Illinois Natural History Survey,
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Prototype Knowledge Base: an on-line information service in dependability and security Hugh Glaser Electronics & Computer Science University of Southampton.
© S.J. Coles 2006 Institutional Data Repositories for Chemistry Simon Coles School of Chemistry, University of Southampton, U.K.
5 th grade Computer Terminology. Minimize Minimize refers to reducing a window to an icon, or a label at the bottom of the screen in your task bar, while.
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
UDDI v3.0 (Universal Description, Discovery and Integration)
DDI3 Uniform Resource Names: Locating and Providing the Related DDI3 Objects Part of Session: DDI 3 Tools: Possibilities for Implementers IASSIST Conference,
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
4.01 How Web Pages Work.
UDDI, Discovery and Web Services Registries. Introduction To facilitate e-commerce, companies needed a way to locate one another and exchange information.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
The Thomson Reuters CITATION CONNECTION Digital Library st March – 3 rd April 2014, Jasná David Horký Country Manager – Central and Eastern Europe.
Globally Unique Identifiers and Life Science Identifiers Dave Thau University of Kansas California Academy of Sciences
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
Introductory Overview
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
Ben Szekely, IBM Cambridge Adtech © 2006 IBM Corporation TDWG GUID WorkshopFebruary 1, 2006 LSID as a Technology Overview, Participation and Related Projects.
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
The World Wide Web (abbreviated as WWW or W3 and commonly known as the Web) is a system of interlinked hypertext documents accessed via the Internet.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
The DOI Standard Nettie Lagace NISO Associate Director for Programs CEAL Workshop on Electronic Resources Standards and Best Practices March.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
LSIDs in a Nutshell Jun Zhao University of Manchester 1 st December, 2005.
Copyright OpenHelix. No use or reproduction without express written consent1.
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
Enriched Knowledge Service Platform and Cross-Database Search September, 2015.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
Globally Unique Identifiers: What, why, when, which and what now? Dave Thau University of Kansas
1 CS 502: Computing Methods for Digital Libraries Guest Lecture William Y. Arms Identifiers: URNs, Handles, PURLs, DOIs and more.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Life Science Identifiers Chris Wroe (based on material from myGrid team and IBM Life Sciences)
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Java Web Services Orca Knowledge Center – Web Service key concepts.
Introduction to Networking
ELIXIR Core Data Resources and Deposition Databases
Introduction to Persistent Identifiers
Application of the Internet
VI-SEEM Data Repository
Who is Tim Berners-Lee? Born 8 June 1955 in London (age 59)
A step-by-step guide to DOI registration
Tech introduction.
Disseminating Service Registry Records
Lívia Vasas, PhD 2018 The Nation Library of Medicine and its databases Mozilla Firefox or Google Chrome Lívia Vasas, PhD.
Lesson 3 Bioinformatics Laboratory
Educational Computing
Area: ABS Unit: Scientific Method and Research Lesson #4
Internet Vocabulary Terms
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

BioImage Database Project Director Image Bioinformatics Laboratory, Oxford e-Science Centre Department of Zoology, University of Oxford Oxford OX1 3PS, UK David Shotton IDF Members Meeting 22/06/04 © David Shotton 2004 The BioImage Database and identifiers

The nature of bioinformatics databases Provide Open Access, free to academics, for data generated by publicly funded research Few databases have assured long-term funding; many run on a shoestring None are run for the purpose of generating profit, although some generate income to offset costs by selling services to commercial companies Data are rarely replicated between databases (with the exception of basic sequence and crystal structure data), so little need for multiple resolution Many add value to raw data by providing expert annotations, organizing data according to protein families, etc. Databases typically use distinct data models and lack interoperability Some are actively adopting new Semantic Web technologies All are ripe for the use of universal resolvable identifiers

The aims of the BioImage Database Project The aims of the BioImage Database ( funded by the European COmmission ORIEL project ( are: To be a searchable database of high-quality multidimensional research images of biological specimens, both raw and processes, with detailed supporting metadata concerning: the biological specimen itself the experimental procedure details of image formation and subsequent digital processing the people, institutions and funding agencies involved the curation and provenance of the image and its metadata To integrate such multi-dimensional digital image data with other life science resources by providing links to literature and factual databases To store, use and conform with standard external identifiers such as DOIs where these are available, particularly when referencing articles

The basic BioImage metadata model (with thanks to ) Cell or organism Experiment or study Image capture Image sets of multidimensional images, including videos Subject or specimen Researcher Photographer or microscopist Camera or microscope Experimental conditions or manipulations

The BioImage home page Note the hierarchical browse categories and the alternative Browse / Search arrangement

Search result, showing Studies

What are Life Science Identifiers? LSIDs have been developed by IBM and I3C (the Interoperable Informatics Infrastructure Consortium; to serve the life sciences They uniquely identify single digital objects They provide persistent URNs resolvable though normal DNS mechanisms They are location independent They permit provenance records (versioning) While developed for the life sciences, they are in fact completely generic

What do LSIDs look like? A five-part format: urn:lsid:Authority:Namespace:Object_ID[:Revision-ID] urn:lsid: This network identifier (NID) is a mandatory prefix Authority is the root DNS name of the issuing authority Namespace is chosen by the issuing authority and constrains the scope of the object Object_ID is an alphanumeric object ID unique to the namespace Revision is an optional version of the object For example: urn:lsid:bioimage.org:BIOIMAGE:76 refers to entry 76 in the BioImage Database urn:lsid:ncbi.nlm.nih.gov:pubmed: references a PubMed article urn:lsid:ncbi.nlm.nig.gov:GenBank:T48601:2 refers to the second version of an entry in GenBank

How do they work? LSIDs are in a sense just a sociological con trick, since they are nothing more than cheap and cheerful URNs They can be published and resolved, either over the Web using DNS mechanisms, or using Web Services protocols (UDDI, WSDL, SOAP) If an LSID names an abstract concept, such as a protein name, for which multiple relevant datasets may exist, that LSID will not have any byte data associated with it, but instead will have metadata pointing to other LSIDs that themselves name concrete versions of the object, e.g. the protein sequence, the crystal structure. Those LSIDs do name actual byte data They share many of the weaknesses of conventional Web mechanisms, particularly regarding security and access control

Who are using LSIDs? LSIDs wer first introduced in 2003 LSIDs are being adopted by highly respected and influential leaders in the bioinformatics Web Services and Semantic Web community, including Mark Wilkinson of BioMOBY ( Carol Goble of MyGrid ( and and Eric Neumann, Global Head of Knowledge Management for Aventis They form the resolution mechanism behind Haystack, the first Semantic Web browser, based on Eclipse (haystack.lcs.mit.edu)

Why are people choosing LSIDs LSIDs can be retro-fitted to existing databases, permitting you to convert your own internal identifiers into unique resolvable identifiers, without altering your existing naming system Open Source software exists to permit you to establish DNS and Web Services resolution services for your URNs, such that anyone addressing your URL can send an LSID and have returned an RDF document describing what that LSID represents Like being a Web publisher, anyone can become an LSID registration agency No central third-party registration agency is required, and there are no fees to pay This no-cost decentralized mechanism, while lacking many of the safeguards and refinements of DOIs, has the same ingredients for success as Tim Berners-Lees original Web protocols We have adopted LSIDs for the BioImage Database and will establish our own LSID resolution authority as soon as we go public

End