Presentation is loading. Please wait.

Presentation is loading. Please wait.

BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan.

Similar presentations


Presentation on theme: "BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan."— Presentation transcript:

1 BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan University BT Short Term Research Fellowship

2 Part I Semantic Web and Linguistic Data Processing

3 Content 1 Project Background: Semantic Web and NLP RDF 2 RDF – Lingua Franca of Semantic Web 3 The need for linguistic support of Semantic Web WordNet: 4 WordNet: Universal Linguistic Resource u u WordNet as a model of the word semantics u u WordNet as an online thesaurus u u WordNet as a relational database Step One: 5 Step One: Putting WordNet on the Web Step Two: 6 Step Two: Extending WordNet Step Three: 7 Step Three: LinguaShare Problems and Directions 8 Problems and Directions

4 1 Project Background: Semantic Web and NLP Semantic Web: Model-driven framework for semantically rich data processing over the Web – u RDF u RDF – Dublin Core (1999), W3C (1999) u DAML u DAML – DARPA (2000); OIL – FP5 (2000)http://www.w3c.org/2001/sw/http://www.dublincore.org/documents/dces/ Semantic Thesaurus: Linguistic database containing word meanings and semantic relations u WordNet u WordNet – George Miller, Princeton Univ. (1990) u EuroWordNet u EuroWordNet – FP4 (1997); BalkaNet – FP5 (2000)http://www.cogsci.princeton.edu/~wn/http://www.hum.uva.nl/~ewn#EuroWordnet

5 1.1. Semantic data processing over the Web u Syntactic markupRDFTopic Maps u Syntactic markup of the data (RDF,Topic Maps) u meta-language RDFSDAML u Using a kind of a meta-language (schema) for providing intended semantics of the data represented (RDFS, DAML) u ontologies KIFOILOWL u Specify domain ontologies for representing the restrictions, dependencies, regularities and rules for inference (KIF, OIL, OWL)

6 Layer Cake (McGuiness, 2002)

7 1.2. Computer-based semantic thesaurus u Explaining the meaning of the words u Finding other words with the (synonyms) u Finding other words with the same meaning (synonyms) u Finding of other words synonymous usage) u Finding of other words with similar meaning in the same context (synonymous usage) u semantic referencing u Finding of semantically independent, related or dependent word forms (semantic referencing)

8 EXAMPLE: Type inference through analysis of the argument structure of verb phrases and their syntactic appearance in texts: u u The varieties of argument structure for EVENT-verbs suggests seven major subtypes: PHENOMENON, ASPECTUAL, STATE, ACT, PSYCHOLOGICAL_EVENT, CHANGE and CAUSE_CHANGE u u Based on them, we can differentiate COGNITIVE_EVENT (experiencer is syntactic subject, e.g. fear) from ACT (experiencer is syntactic object, e.g., frighten) Determining ontological information using lexical information

9 1.3 Project definition Aims: u utilizing the full potential of WordNet multilingual thesauri as an universal linguistic ontology for semantic verification of specialist terminology u embedding it in applications for semantic data processing over the Web u using contemporary Semantic Web Services technologies and tools Methodology: u Analytical research (WordNet) u Modeling (relational models, UML) u Software prototyping (Tomcat, MySQL)

10 2 RDF – Lingua Franca of Semantic Web  Language to describe resources primarily on the Web (has semantics); can be used not only on the Web – e.g. Dublin Core for library catalogues  Use XML as a syntax representation of RDF statements (serialization syntax); there are alternative serializations (e.g. triplets), but XML is the most popular  The language can formulate statements about the language itself (meta-description); RDF Schema or RDFS  The statements can be stored, processed and transported over the Web (data persistence)

11 2.1 RDF Model Resources – Things being described by RDF expressions. Resources are named by URIs Examples: HTML document, XML element within the document, Collection of pages, Book Properties – Specific attributes or relations used to describe a resource. Attributes and relations can be also used as resources. Examples: Creator, Title, Name Values – Simply literals or references to resources Statements, e.g.Predicate(Property)  Subject(Resource) Object(Value) Subject(Resource) Object(Value)

12 Example “Vassil Vassilev whose e-mail is v.vassilev@londonmet.ac.uk is the creator of web page http://www.lgu.ac.uk/~vassil/index.html” Subject (Resource): ‘http://www.lgu.ac.uk/~vassil/index.html’ Predicate (Property): ‘Creator’ Object (Value): ‘Vassil Vassilev’

13 Graphical representation

14 Serialized representation in XML < <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/” xmlns:vcard="http://imc.org/vCard/3.0#"> http://www.lgu.ac.uk/~vassil/index.htm” <rdf:Description about=“http://www.lgu.ac.uk/~vassil/index.htm” Vassil Vassilev v.vassilev@londonmet.ac.uk

15 2.2 Semantic Web Applications  Context-based Information Retrieval (search after semantic patterns)  Personalized Information Delivery (data presentation based on user profiles)  User tracking (dynamic construction of user profiles based on log analysis)  Document summarizing (text generation based on models of the meaning)  Automatic translation (text transformation which uses meaning models)

16 2.3 Semantic Web Tools  Persistent storagequery interpreters  Persistent storage and query interpreters (XML databases/XQuery, RDF repositories/RQL)  Ontology visualizerseditors  Ontology visualizers and editors (OntoEdit, Protégé, etc.)  navigatorssearch engines  Ontology navigators and semantic search engines (AskJeeves, RDF Quiz, OntoSearch)  inference engines  Ontology-based inference engines (Cyc, Kaon, OMM)

17 Some observations Layers separation (data storage, data communication, information description, terminology definition, fact inference) Layers separation (data storage, data communication, information description, terminology definition, fact inference) Layers isolation (syntactic wrapping vs. semantic mapping) Layers isolation (syntactic wrapping vs. semantic mapping) Information processing concentrated on the most abstract level (ontology) Information processing concentrated on the most abstract level (ontology)  Hierarchy of languages SQL  XML  RDF  RDFS  OWL

18 3 The Need for Linguistic Support of Semantic Web u For combining multiple namespaces and names reconciliation u For combining multiple namespaces and syntactic names reconciliation u For word disambiguation in text analysis u semantic indexing u For semantic indexing of text corpora u For resolving semantic inaccuracies in texts (esp. similarity, alternatives, exclusion, generalization,etc) u For representing text meaning in transformations which use an intermediate u For representing text meaning in transformations which use an intermediate model of the meaning Why:

19

20

21 4 WordNet as Universal Linguistic Resource u Word forms (nouns, verbs, adjectives and adverbs) and lexical relations between them u Synsets and meaning relations (synonymy, antonymy, hyponymy, meronymy, troponimy, etc) u Lexical database (set of indexed files or a database) u Command language interface (originally Tcl/tk scripts for direct file manipulation, but APIs for Java and other languages also available) u Multi-lingual thesauri (network of WordNet databases for most of the languages)

22

23 4.1 WordNet semantics u Relational model transitive relations u Relational model with both standard (ATTRIBUTE, ANTONYM, ENTAILMENT, CAUSE) and transitive relations (HYPERNYM,HOLONYM, MERONYM) u first-order relational structures u Formally can be interpreted in first-order relational structures (Kripke structures) – requires modal logic u database u For adequate representation of the relations either object-relational, or relational database with additional indexing of the transitive relations (transitive closure) is necessary

24 Fig. 1 WordNet Relations

25 4.2 Relational schema of the original WordNet thesaurus word word represents the syntactic word forms divided into four main categories – noun phrases, verb phrases, adjectives and adverbs synset synset defines the different meaning sets used for giving semantic interpretation of the word forms sense sense many-to-many relationship between word forms and synsets lexrel lexrel purely lexical relationships which hold between the word forms semrel semrel semantic relationships between the word forms which contains the semantic thesaurus

26 Fig. 2 Relational schema of WordNet

27 5 Putting WordNet on the Web u Synchronous query/response model of working (CGI calls) u Purely relational database for storing the thesaurus (MySQL) u Front-end implemented as a set of servlets which query the thesaurus on behalf of other applications u XML format of the data returned as a result of the queries u Separated from the applications and use of independent server (Tomcat)

28 Tabl. 1 Servlets to explore word relations

29

30 Part II LinguaShare: Linguistic Web Service for Semantic Web


Download ppt "BT Exact Technologies - Adastral Park, Ipswich July - October 2003 Linguistic Web Services for Semantic Web Dr. Vassil T. Vassilev London Metropolitan."

Similar presentations


Ads by Google