Presentation is loading. Please wait.

Presentation is loading. Please wait.

Citing Archived Objects Daan Broeder MPI for Psycholinguistics DELAMAN meeting London 2006.

Similar presentations


Presentation on theme: "Citing Archived Objects Daan Broeder MPI for Psycholinguistics DELAMAN meeting London 2006."— Presentation transcript:

1 Citing Archived Objects Daan Broeder MPI for Psycholinguistics DELAMAN meeting London 2006

2 Referencing & Citing Information identifying the object –Unique identification Information where & how to find the object –Web & e. objects -> Information how to access it (click on a ref. in a document) Information about the object viewable in a document –Name, title,… emphasis on human readability –Proper acknowledgement to the author, depositor, hosting archive,… DELAMAN meeting London 2006

3 Identifying & Accessing the Object URL –Mixes id with protocol & location. In practice not a good idea because of linkrot http://myarchive/mycorpus/mysresource.wav PURL –Stable URLs, still mixes id and protocol http://purl.site.com/resources/abcbdbdf Handle System (CNRI) –Decouples identification from protocol/location 1839/00-0000-0000-0001-4C55-3 DELAMAN meeting London 2006

4 Accessing the Object URL –Easy access –Format & type info through mime-types but limited to that. Important for automatic tool deployment PURL –Easy access through HTTP redirect –Format & type as with URLs Handle System (DAM-LR) –Less easy access. Depends on separate resolver system –But a lot of flexibility, extra records allow: cater for object duplication via extra URLs extra type/format info. DELAMAN meeting London 2006

5 Human readable info In a document the id alone is not very informative: –http://corpus1.mpi.nl/qfs1/media- archive/dbd_data/boumans/Boumans_2003/Annotations/siblings.chahttp://corpus1.mpi.nl/qfs1/media- archive/dbd_data/boumans/Boumans_2003/Annotations/siblings.cha –1839/00-0000-0000-0001-4C55-3 Preferably also Citation info: –Depositor/Annotator: Louis Boumans –Project: DBD Corpus: Boumans 2003 –Archive: Max-Planck Institute for Psycholinguistics –…….

6 Citation Info Easy solution: Browsers & Doc readers know HTML. – Name: Hamadi sibling contact, Depositor/Annotator Louis Boumans, Project: DBD, Corpus: Boumans: 2003 http://.... Citation info downloadable from archive –Cut/paste, this makes it static. Citation info can be real-time updateable via archive service object id –Handle system supports extra records –Needs also be implemented in document viewer What kind of info makes sense for our kind of archive objects?

7

8 Citing object fragments Purpose: –Refer to parts or fragments of archive objects Encoding independent way!!! –Enable knowledgeable tools to interpret the fragment specification to visualise only the fragment Will be used in the ADDIT tool for commentary and relation drawing DELAMAN meeting London 2006

9 Citing object fragments Media file –Time segment or coordinate positions Text documents –Plain or formatted text via byte offsets –Mark-up like HTML/XML via X-Path specification Annotation files with complex structure (eaf) –Dependent on the format more difficult, information may be spread throughout the file –Need knowledge about internal structure DELAMAN meeting London 2006

10 Citing object fragments Example: hdl:1839/00-0000-0000-0001-4C55- 3#lines(10:20) Note: –Both “hdl” protocol and “lines” fragment identifier are not standard DELAMAN meeting London 2006

11 The End


Download ppt "Citing Archived Objects Daan Broeder MPI for Psycholinguistics DELAMAN meeting London 2006."

Similar presentations


Ads by Google