Archaeology and Terminology Ceri Binding Hypermedia Research Unit, University of Glamorgan, Wales, UK

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Chapter 11 Designing the User Interface
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
TU/e technische universiteit eindhoven Hera: Development of Semantic Web Information Systems Geert-Jan Houben Peter Barna Flavius Frasincar Richard Vdovjak.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
PoolParty Vasiljevic Vladica,
STELLAR Introduction Ceri Binding, Douglas Tudhope Hypermedia Research Unit, University of Glamorgan.
Associative and Spatial Relationships in Thesaurus-based Retrieval Harith Alani 1, Christopher Jones 2, Douglas Tudhope 1 1 School of Computing, University.
ICS-FORTH Which Period Is It? A Methodology To Create Thesauri Of Historical Periods Martin Doerr, Athina Kritsotaki, Stephen Stead.
Semantic Annotations in the Archaeological Domain Andreas Vlachidis, Ceri Binding, Keith May, Douglas TudhopeSTAR STAR Semantic Technologies for Archaeological.
STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan.
Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Demonstration of adding content to an ICAN Semantic Resource Roy Lowry, Adam Leadbetter, Olly Clements (NETMAR - BODC) Tanya Haddad (ICAN - OCA)
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
Digging Up Data: The Archaeotools project, Faceted Classification and Natural Language Processing in an archaeological context. Stuart Jeffrey, Julian.
Stuart Jeffrey, Julian Richards, Fabio Ciravegna Stewart Waller, Sam Chapman, Ziqi ZhangTony Austin. STAR/Archaeotools Workshop, York, 9 th May Stuart.
Copyright © 2014 All rights reserved, Government of Newfoundland and Labrador Earth Systems 3209 Unit: 2 Historical Geology Reference: Chapters 6, 8; Appendix.
1 ALiSS Adaptive Links Suggestion Service Antonio De Marinis, Stefan Jensen (EEA) Alec Ghica (Finsiel RO), Sasha Vinčić (Systemvaruhuset) Ecoterm III FAO.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
KOS-based tools for archaeological dataset interoperability: NKOS Workshop, ECDL 2010 C. Binding, K. May 1, D. Tudhope, A. Vlachidis Hypermedia Research.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
Making heritage more engaging Using Web 2.0 to connect heritage and people Janet E. Davis.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Knowledge Organization Systems and Information Discovery Douglas Tudhope Inaugural Lecture.
Metadata, the CARARE Aggregation service and 3D ICONS Kate Fernie, MDR Partners, UK.
1 Issues in Reusing and Sharing the Content of Thesauri and Taxonomies in OOR Marcia Zeng NKOS (Networked Knowledge Organization Systems/Services) My participating.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Domain Modeling In FREMA David Millard Yvonne Howard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Web Services Based on SOA: Concepts, Technology, Design by Thomas Erl MIS 181.9: Service Oriented Architecture 2 nd Semester,
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Semantic Annotation of Grey Literature from an Archaeological Digital Library Andreas Vlachidis, Doug Tudhope Hypermedia Research Unit University of Glamorgan.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The Archaeotools project, faceted classification and natural language processing in an archaeological context. University of York, April 2008.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
Domain Modeling In FREMA Yvonne Howard David Millard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter.
ICS-FORTH Thesauri of Historical Periods A Proposal for Standardization Martin Doerr, Athina Kritsotaki Heraklion, Crete, June
WISPER receives funding from the European Commission’s Information Societies Technology (IST) Programme IST WISPER Dr Gary Randall British Maritime.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Interactive history. To provide a visually attractive and interactive timeline of the major scientific discoveries throughout history.
Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004
STAR, STELLAR and SKOS Ceri Binding, Phil Carlisle, Keith May, Doug Tudhope, Andreas Vlachidis University of Glamorgan and English Heritage.
NSU Website Structure By: Debbie Jones, NSU Webmaster 1 NSU Web Services Publication - Author: NSU Webmaster Norfolk State University.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
ARIADNE is funded by the European Commission's Seventh Framework Programme Interoperability Holly Wright.
Archiving CAD in Archaeology: Ingest to Dissemination (or The ADS experience to date) Kieron Niven Archaeology Data Service, University of York, UK.
BA_EM 02 Electronic Marketing
TextCrowd – Collaborative semantic enrichment of text-based datasets
TRSS Terminology Registry Scoping Study
Vocabulary byte - The technical term for 8 bits of data.
Using DLESE: Finding Resources to Enhance Teaching
Vocabulary byte - The technical term for 8 bits of data.
Multimedia Authoring Tools
The JISC IE Metadata Schema Registry
The Benefits of Taxonomy in Content Management
opening our collections data to the public
HTML5 and Local Storage.
C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis
Breaking Down Barriers to Interoperability
Ohio Social Studies Strands
TOOLS & Projects overview
Presentation transcript:

Archaeology and Terminology Ceri Binding Hypermedia Research Unit, University of Glamorgan, Wales, UK

Translation: “I am not in the office at the moment. Send any work to be translated” [Pictures from BBC news website] Translation: “Pedestrians look left” Any process needs (human) validation...

STAR project - overview AHRC funded project in collaboration with English Heritage Centre for Archaeology, Portsmouth Aim: to investigate the potential of semantic technologies for widening access to digital archaeology resources, including disparate datasets and associated grey literature.

STAR - general architecture RRAD RPRE RDF Based Common Ontology Data Layer (CRM / CRMEH / SKOS) Grey Literature reports Grey Literature reports EH thesauri, glossaries LEAP STAN IADB Data Mapping / Normalisation Conversion (SKOS) Indexing Data access layer - Web Services, SQL, SPARQL Applications – Server Side, Rich Client, Browser Archaeological Datasets

The Archaeological Archipelagos [Keith May, English Heritage]

English Heritage controlled vocabularies 27 glossaries – from English Heritage recording manuals (2006) 6 main thesauri used: – Monument Types thesaurus – Archaeological Sciences thesaurus – Evidence thesaurus – Main Building Materials thesaurus – MDA Object Types thesaurus – Timelines thesaurus Converted to SKOS format for use within STAR

Expressive vs. controlled vocabulary “…how many of those writing [grey literature] reports would think to describe what they are recording/writing about using the same thesauri? […] it would have been a lot quicker and easier if standardised terminology had been used in the report text when describing types of monument, event and artefact, as well as dates/periods etc.” [G. Falkingham] “Grey Literature is very often the only place where field workers have any opportunity to engage in creating their own narrative of the site, both of the archaeological event and of the archaeological story of the site itself. I think it would be throwing the baby out with the bath water to concentrate solely on the data without continuing to offer highly skilled and experienced fieldworkers the opportunity to actually tell us what they think the data means...” [S. Jeffrey]

Descriptive, semi-controlled vocabulary… Deposit ColourDeposit Texture Deposit Compaction (Reddy) Brown 9Reddy) brown Brown Brown red Brown/reddy Dark brown Dark brown/orange Dark grey brown Dark orange brown Dark orange brown with darker patches Dark orange loam Dark orange/brown Dark red brown Grey brown Grey/brown Light brown Light yellow brown Medium brown Mid brown Mid red brown Orange brown Orange/brown Orangy brown Orangy brown, very light brown on edges and sides of profile Red /brown Red brown Red/brown Reddish brown Reddy brown Varies Very light brown White Yellow brown Yellow/orange brown Firm Friable Friable to loose Friable/loose Friable-loose Loose Loose/friabe Loose/friable Plastic Sticky Sticky (wet) Sticky/firm Varies Worst of all worlds? “…another of my examples has something about some flint that is ‘snuff coloured’ & I don’t know if I’ve ever seen snuff, let alone know what colour it is, or might have been over 150 years ago, and I would think it would make sense to take some kind of integrated approach from the outset, rather than the usual ‘bricolage’ of having one route for the archivists, another for those interested in searching spreadsheets, another for people interested in googling graphics, etc.” [G. Carver]

Terminology control for time periods Centuries BC / AD years 3 age system Monarchs / Roman emperors Cultural styles Geological periods Prefixes: pre, post, mid etc. Any combinations of these

Time period alignment – data cleansing / semantic enrichment Object NoPeriodMIN YEARMAX YEAR 1519AD st century AD nd century st century AD AD Mid 1st century AD First half 1st centu Mid first century AD c. AD First half first cen AD AD AD Medieval nd century AD ?1st century AD AD Medieval Romano-British Modern? post-mediaeval

Time period relationships Period P1 Time occurs before P1* meets P1 overlaps P1 starts P1* equal to P1* occurs during P1* finishes P1* overlapped by P1 met by P1 occurs after P1* includes P1* started by P1* finished by P1* [*Transitive]

Time Period Comparison – Closeness Calculation Time IU Period P1 Period P2 NMP MP Period P3 DNMP Match(P1, P2) = W1 (MP / IU) + W2 (IU / (NMP + IU)) + W3 (IU / (D + IU))

SKOS Concepts + CRM Entities skos:Concept skos:broader rdf:type Time period concepts also have implicit spatio-temporal context crm:E4.Period crm:E52.Time-Span crm:E53.Place crm:E2.TemporalEntity rdfs:subClassOf crm:P4F.has_time-span crm:P7F.took_place_at rdf:type crm:P119F.meetscrm:P118F.overlapscrm:P119F.meets crm:P115F.finishes crm:P116F.starts

Time period alignment – data processing Align data relative to closest period concepts from English Heritage ‘Timelines’ thesaurus

Data records relative to closest ‘known’ periods Time period alignment - results

Data aligned to closest ‘ known’ periods

Timeline service test client

Semantic enrichment Borderline between data cleansing and data creation… “Possibly fragment of belt buckle or nail” BELT Belt Clasp -> use STRAP FITTING BUCKLE Buckle Plate -> use BUCKLE NAIL HOBNAIL SHOEING NAIL BELT Belt Clasp -> use STRAP FITTING BUCKLE Buckle Plate -> use BUCKLE NAIL HOBNAIL SHOEING NAIL “The single most useful thing you can do to ensure the long-term preservation of your data is to plan for it to be re-used” [Archaeology Data Service] “The single most useful thing you can do to ensure the long-term preservation of your data is to plan for it to be re-used” [Archaeology Data Service]

Aligning controlled vocabularies Different scope notes, same concepts? Different thesauri, same concepts? RCHME Monument Types SARCOPHAGUS SUNDIAL WALL PAINTING WHIPPING POST RCHME Monument Types SARCOPHAGUS SUNDIAL WALL PAINTING WHIPPING POST Archaeological Objects SARCOPHAGUS SUNDIAL WALL PAINTING WHIPPING POST Archaeological Objects SARCOPHAGUS SUNDIAL WALL PAINTING WHIPPING POST RCHMS Monument Types RCHMW Monument Types

STAR general architecture STAR web services English Heritage thesauri (SKOS) Archaeological Datasets (CRM) Windows applications Browser components Full text search Browse concept space Navigate via expansion Cross search archaeological datasets Windows applications Browser components Full text search Browse concept space Navigate via expansion Cross search archaeological datasets STAR client applications STAR datasets Grey literature indexing

Windows Client Applications Browse available thesauriSearch across multiple thesauriNavigate via semantic expansion

Interactive tools to aid data entry

 Interactive selection from glossary/thesaurus concepts  Filtered to concepts actually used in indexing  Group / context types – from (enhanced) cuts and deposits glossary  Context find materials – from building materials thesaurus  Context find types – from MDA Object types thesaurus  Context sample types – from existing data values... Controlled types used in main search interface

Interactive tools to aid data entry

Summary Tension between expressive vs. controlled vocabulary, but general agreement on benefits of control Better coordination and alignment of controlled vocabularies would be beneficial Web services and interactive tools to aid data entry and search Issues encountered are not about particular technologies – more fundamental KO issues

Archaeology and Terminology Ceri Binding Hypermedia Research Unit, University of Glamorgan, Wales, UK

Accommodating Approximation crm:E52.Time-Span – modelling uncertainty Approximate time period Uncertainty Earliest start dateLatest start dateEarliest end dateLatest end date crm: P81F.ongoing_throughout crm: P82F.at_some_time_within