Presentation on theme: "CLARIN Technical Infrastructure PIDs - How far are we?"— Presentation transcript:
CLARIN Technical Infrastructure PIDs - How far are we?
Usage I assume that we have a recording of an extinct language and some annotations that tell us what someone said about medicine etc researchers create relations that need to be preserved Video Recording Sound Recording Annotations Recording Session from Repository A from Repository B from Repository C How long?
Usage II Biological and cultural processes have evolved together, in a symbiotic spiral; they are now indissolubly linked, with human survival unlikely without such culturally produced aids as clothing, cooked food, and tools. The twelve original essays collected in this volume take an evolutionary perspective on human culture, examining the emergence of culture in evolution and the underlying role of brain and cognition. The essay authors, all internationally prominent researchers in their fields, draw on the cognitive sciences -- including linguistics, developmental psychology, and cognition -- to develop conceptual and methodological tools for understanding the interaction of culture and genome. They go beyond the "how" -- the questions of behavioral mechanisms -- to address the "why" -- the evolutionary origin of our psychological functioning. What was the "X-factor," the magic ingredient of culture -- the element that took humans out of the general run of mammals and other highly social organisms? Several essays identify specific behavioral and functional factors that could account for human culture, including the capacity for "mind reading" that underlies social and cultural learning and the nature of morality and inhibitions, while others emphasize multiple partially independent factors -- planning, technology, learning, and language. The X- factor, these essays suggest, is a set of cognitive adaptations for culture. ePublication Repository 1 eResource Repository 2 How long?
Usage III eResource2 Repository 2 Ontology open registry How long? eResource1 Repository 1
Currently almost 1 Mio PIDs DBD_RIF_14_12_01_064 Dutch Bilingualism Database, Ethnic Dutch, Session 64 ………. http://corpus1.mpi.nl/qfs1/media-archive/dbd_data/boumans/T- Cult/Metadata/../Media/dbd_rif_14_12_01_064.wav ……….
The Problem could use Cool URIs as the W3C TAG suggests to do addresses change too often and we cannot influence that perhaps some exceptions such as http://www.isocat.org/datcat/DC-1708 http://www.isocat.org/datcat/DC-1708 ??? you just change one entry in a database but there is a price of course
Many Suggestions URLs:http:/www.mpi.nl/imdi/doc/white-paper all HTTP URIs:http://www.isocat.org/isodcr#12345 W3C URNs:urn:nbn:nl:ui:13-54321 EU Libs etc Handles:hdl:1839/00-0000-0000-0005-82B0-2 many ARKs:http://ark.cdlib.org/ark:/13030/ft4w10060w few XRIs:xri://broadview.library.example.com/ ? (urn:isbn:0-395-36341-1) PURLs:http://purl.oclc.org/OCLC/PURL/FAQ many DOI:Handles + Business Model Publisher OpenURLs:parameterized http-get requests ? InfoURIsintegrate legacy material into Web ? etc
Evaluation StandardRobust Software Resolution System Resolution Type Security Admin Assoc Info Cost URLRFC2616noyes (DNS)singleno URN:ISSNISO2397no ? URN:ISBNISO2108no ? URN:NBNRFC3188no ? ? PURLno yessingleno HandleRFC3650yes multipleyes little DOIZ39.84…yesyes (Handle) multipleyes large ARKno (yes)multiple(no)yes? info URIRFC3668no ? XRIno ? ?? simple decision: need to have something robust now without expensive business model and dependencies
How to do you need to be registered at the PID service as accepted and trusted partner (trusted partners are only those who can demonstrate that they have a proper repository system) you have a set of resources which have URLs these resources have registered metadata descriptions you request for these resources PIDs by submitting the requested information such as URLs, MD5, minimal MD etc you can do this either manually or via an API you get back the PIDs from the service you enter these PIDs in the metadata description field (now everyone can use it for reference purposes) whenever you change the URLs you need to adapt the entry (probably use a ready-made mover)
Associated information want to check authenticity before copying etc (MD5 field) want to add citation data info extracted from metadata records want to solve the problem of having several centers manipulating the Handle record without interference want to add a pointer to access permission information proper monitoring services in MPG and CLARIN some money to create robust services
Short Overview GWDG Service http://handle.gwdg.de:8080/pidservice Java-Documentation http://handle.gwdg.de/javadocs/ 11858/00-ZZZZ-0000-0000-000C-7 -> 'View Handle' http://www.gwdg.de/aktuell/index.html -> 'Find Handle' service will be given also to CLARIN and probably for other research Infrastructure initiatives in Europe