Presentation is loading. Please wait.

Presentation is loading. Please wait.

C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis

Similar presentations


Presentation on theme: "C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis"— Presentation transcript:

1 C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis
Semantic Technologies for Archaeology Resources: Results from the STAR Project C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis Hypermedia Research Unit, University of Glamorgan 1English Heritage

2 STAR Project - Aims Investigate semantic technologies for integrating and cross searching datasets and associated grey literature Current situation is one of fragmented datasets and applications, with different terminology systems Need for integrative metadata framework EH have designed an upper ontology based on CRM standard STAR – 3 year AHRC funded project in collaboration with English Heritage - contributing datasets and domain expertise More concrete aim is to demonstrate cross searching/browsing domain specific datasets at detailed, meaningful level

3 STAR Project - General Architecture
Applications – Server Side, Rich Client, Browser Web Services, SQL, SPARQL RDF Based Semantic Layer (CRM / CRMEH / SKOS) Indexing Conversion Data Mapping / Normalisation Grey literature – ongoing information extraction work involving GATE datasets, thesauri and grey literature all related to a common structure EH ontology (CRMEH) as overarching structure, SKOS for controlled terminology – thesauri, glossaries multiple disconnected databases – using CRM as ‘semantic glue’ to pull the data together Data processing consists of: mapping  cleansing  normalisation  extraction  consolidation (to common ontology data layer) Datasets: Raunds Roman, Raunds Prehistoric, Museum of London, Silchester Roman (LEAP - IADB), Stanwick sampling data Controlled vocabularies: Monument Types, Evidence, MDA Object Types, Building Materials, Archaeological Science, Timelines, EH Recording Manual glossaries Grey lit: Ontology-based Information Extraction applied to extract of ADS OASIS reports: Online AccesS to the Index of archaeological investigationS - Archaeological Data Service RDF layer: CIDOC Conceptual Reference Model (ISO 21127:2006) as an umbrella framework linking different datasets, grey literature and thesauri. CRM-EH extension models archaeological excavation process relating to Stratigraphic relations and phasing information, finds recording and environmental sampling. Services: Semantic search and browsing across datasets and grey literature Apps: Interactive research tools – cross platform/browser STAN RRAD MoLAS LEAP RPRE Grey literature EH thesauri, glossaries

4 Grey Literature Information Extraction (Andreas Vlachidis, Renato Souza)
Looking to extract CRM-EH period, context, find, sample entities Aim to cross search within data

5 Semantic Annotations to RDF triples
Example: “layers were found to contain a relatively large amount of broken, unabraded pottery” The phrase generates three Semantic Annotation types aligned to CRM-EH ontology: EHE1004.ContextFindDepositionEvent, EHE0005.Group, EHE0009.ContextFind <crmeh:EHE0005.Group rdf:about=" <dc:source rdf:resource=" /> <crm:P2F.has_type> <crm:E55.Type> <rdf:value>layers</rdf:value> <crmeh:EHP10F.is_represented_by rdf:resource=" </crm:E55.Type> </crm:P2F.has_type> <crm:P3F.has_note> <crm:E62.String> <rdf:value>...this deposit. These layers were found to cont...</rdf:value> </crm:E62.String> </crm:P3F.has_note> </crmeh:EHE0005.Group>

6 Multiple views possible on underlying model
Previous prototype offered point to point browsing of full CRM-EH ontology Current prototype presents a different search/browsing model in user interface based on use scenarios from STAR workshops Retains the benefits of CRM standard for inter-operability,

7 RDF Data Extraction Tool
RDF triples data extracted from relational sources, using (Keith’s) mappings for guidance. Custom data extraction tool for transforming relational tabular data to RDF. Aim – greater consistency of extracted data. New (AHRC funded) project STELLAR aims to generalise and extend STAR data extraction and mapping tool to facilitate use by third party data providers. The extracted data will be represented as CRM compliant Linked Data. Application builds up each query specifying one relationship from the model – here selecting notes associated with contexts.

8 Resultant extracted data (RDF/XML)
Resultant output data is valid RDF/XML (application can also output Ntriples format). This approach to data extraction ensures consistent entity names, namespaces, URIs and overall syntax.

9 STAR – Web Services and Client Applications
English Heritage thesauri (SKOS) Windows applications Browser components Full text search Browse concept space Navigate via expansion Cross search archaeological datasets Grey literature indexing STAR Web Services Archaeological Datasets (CRM) STAR Client Applications STAR Datasets

10 STAR web browser based search interface
Search parameters Group details Context details Search results Sample details Find details

11 Initial search Search parameters

12 Context details Context details

13 Context find details Find details

14 Context sample details

15 Group details Group details

16 Hypermedia Research Unit University of Glamorgan
Contact Information Hypermedia Research Unit University of Glamorgan Pontypridd CF37 1DL Wales, UK C. Binding, K. May, R. Souza, D. Tudhope, A. Vlachidis {cbinding, rsouza, dstudhope,

17 References Binding C., Tudhope D., May K Semantic Interoperability in Archaeological Datasets: Data Mapping and Extraction via the CIDOC CRM. Proceedings (ECDL 2008) 12th European Conference on Research and Advanced Technology for Digital Libraries, Aarhus, 280–290. Lecture Notes in Computer Science, 5173, Berlin: Springer. (preprint) Binding C., Tudhope D SKOS-based semantic web services: experiences from the STAR project. ISKO-UK KOnnecting KOmmunities Seminar: Sharing Vocabularies on the Web via SKOS, University College London. May K., Binding C., Tudhope D A STAR is born: some emerging Semantic Technologies for Archaeological Resources. Proceedings Computer Applications and Quantitative Methods in Archaeology (CAA2008), Budapest Tudhope D., Binding C., May K Semantic interoperability issues from a case study in archaeology. In: Stefanos Kollias & Jill Cousins (eds.), Semantic Interoperability in the European Digital Library, Proceedings of the First International Workshop SIEDL 2008, 88–99, associated with 5th European Semantic Web Conference, Tenerife. (preprint) Vlachidis A, Binding C, May K, Tudhope D Excavating grey literature: A case study on rich indexing of archaeological documents by the use of Natural Language Processing techniques and knowledge based resources. Proceedings British Chapter of the International Society for Knowledge Organization (ISKO UK) Conference.


Download ppt "C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis"

Similar presentations


Ads by Google