Presentation is loading. Please wait.

Presentation is loading. Please wait.

INLS 520 – Erik Mitchell INLS 520 Information Organization.

Similar presentations


Presentation on theme: "INLS 520 – Erik Mitchell INLS 520 Information Organization."— Presentation transcript:

1 INLS 520 – Erik Mitchell INLS 520 Information Organization

2 INLS 520 – Erik Mitchell Review Controlled vocabularies –Term Lists, Hierarchies, Trees, Paradigms, Facets, Folksonomies Knowledge organization systems –Term Lists, Thesauri, Taxonomies, Ontologies

3 INLS 520 – Erik Mitchell Today Poster topic review Poster creation concepts Automation in metadata & organization –RDF –OWL Guest Speaker – Barrie Hayes

4 Creating poster presentations Content –Methodological approach Title, question, overview, methods, findings, observations –Conceptual approach Title, question, model or framework Structure Balancing words & images Document flow Examples - 1, 2, 3, 41234 INLS 520 – Erik Mitchell

5 Creating poster presentations Technology –Creation tools PowerPoint Templates in blackboardblackboard An image editor HTML Display –Flickr, PrintedFlickrPrinted Some guidelines 1, 2, 3, 41234 INLS 520 – Erik Mitchell

6 Automation Group discussion –Based on the articles by Hlava, Hearst, and Stearns what are some of the primary uses of automation in information organization? –Is automation primarily a tool to create representations of resources or to enable retrieval of resources? INLS 520 – Erik Mitchell

7 Automatic Indexing Automatic Extraction/representation –Lancaster calls this: “Words or phrases appearing in a text are extracted and used to represent the content of the text as a whole” (Lancaster 284) Automatic Classification/Categorization –The computer compares terms in the document against thesauri or controlled vocabularies to map a word in the resource to an accepted index term (Lancaster 287) Automatic Abstract/Surrogate Generation –The extraction of sentences from documents to create an abstract (Lancaster 298) Automatic index/tool Generation –Creating spelling dictionaries, word lists, indexes, etc to be used for other automation techniques Latent Semantic Indexing - Concepts are extracted and analyzed to detect relationships. These relationships are then used to help identify & rank documents (overview)overview INLS 520 – Erik Mitchell

8 Automatic metadata use Metadata Harvesting –Using automated techniques to discover and extract metadata from documents OAI/PMH Knowledge representation & discovery –Using structured ontologies to make statements and inferences about resources –OWL OntologiesOWL Ontologies Metadata interoperability –Using structured metadata to automatically connect with other metadata systems –OpenURL, XML SchemaOpenURL Transformation –Creating new documents through the automatic extraction, analysis, and processing of resources –XML/XSL INLS 520 – Erik Mitchell

9 Metadata standards XML –Structure & syntax but no ‘semantic constraints’ XML Schema –Restrict structure of XML, extend XML with datatypes RDF –Data-model showing relationships among resources RDF Schema –‘Vocabulary for describing properties and classes of RDF resources’ OWL –Added vocabulary for describing properties & classes Relationships, cardinality, equality, charicteristics INLS 520 – Erik Mitchell

10 INLS 520 – Erik Mitchell RDF Subject, property, object triples Transmitted in xml RDFS extends RDF with an ontology language –Properties, specialization OWL –More powerful extension of RDFS –Uses same syntax of RDF

11 INLS 520 – Erik Mitchell RDF Model Webpage: http://www.stuff.com “Saki Knafo” Author (Value) Object (Property type) Predicate (Resource) Subject “The author of the stuff webpage is Saki Knafo” - A literal, a triple, a statement

12 INLS 520 – Erik Mitchell How is RDF different? RDF is a descriptive model that –Allows variable contextualized description –Deconstructs the descriptive process –Allows more granular automated processing of data –Uses exact markup to indicate the context of values (namespaces, schemas) –Bags, Sequences, Alternative values, parseType

13 INLS 520 – Erik Mitchell Encoding RDF in XML <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"> The Hang: The Island of Black Jeans SAKI KNAFO Sun, 16 Sep 2007 01:04:40 GMT descriptive content

14 INLS 520 – Erik Mitchell Iterative RDF description <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:vcard="http://dli.grainger.uiuc.edu/publications/metadatacasestudy/dc_schemas/v card.xsd" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"> The Hang: The Island of Black Jeans http://www.stuff.com Sun, 16 Sep 2007 01:04:40 GMT descriptive content rdf:about="http://dli.grainger.uiuc.edu/publications/metadatacasestudy/dc_,,,"> Saki Knafo knafo@www.nytimes.com

15 INLS 520 – Erik Mitchell RDFS RDF Schema –Defines additional rdf elements that help type relationships Special Classes –Based on RDF Classes / Properties / Attributes with additional http://www.w3schools.com/rdf/rdf_reference.asp Allows the creation of vocabularies / ontologies

16 INLS 520 – Erik Mitchell Ontology Definitions “The study of being or existence” “A conceptualization of a specification” (Gruber)Gruber “An ontology formally defines a common set of terms that are used to describe and represent a domain.” (OWL)

17 INLS 520 – Erik Mitchell Ontology Concepts Classes –Names of objects in the domain Relationships between classes Connections between classes Properties of classes Background or identifying knowledge of these objects Constraints on these properties & relationships Limits and parameters of the relationships

18 INLS 520 – Erik Mitchell A good ontology has Features: –Meaningful – all classes have instances –Accurate / correct –Non-redundant – each class/instance is represented in a single way –Rich in description – context, content Enabled functionality: –Able to use queries to connect new pieces of information –Use XML & definitions to integrate knowledge across domains

19 INLS 520 – Erik Mitchell Ontology Continuum Keyword Lists Basic Thesauri Complex Thesauri Taxonomies Simple Ontologies (wordnet) Complex Ontologies (OWL)

20 INLS 520 – Erik Mitchell SHOE Ontology project – Possible to build an ontology for anything –Simple HTML Ontology Extensions (SHOE) Project http://www.cs.umd.edu/projects/plus/SHOE/ http://www.cs.umd.edu/projects/plus/SHOE/html-pages.html Sample projects –Beer Ontology http://www.cs.umd.edu/projects/plus/SHOE/onts/index.html#beer –Document Ontology http://www.cs.umd.edu/projects/plus/SHOE/onts/docmnt1.0.html

21 INLS 520 – Erik Mitchell Ontology Concepts Multiple inheritance Vertical and horizontal relationships Decomposed subject/object Predicate based description (isRelatedto, hasVersion)

22 INLS 520 – Erik Mitchell Creating an Ontology Determine Scope of field, define boundaries Check for existing ontologies, vocabularies Select a top-down/bottom-up approach –Identify concepts, vocabulary, parameters, constraints Identify relationships –Multiple hierarchies, inheritance Build, test, maintain

23 INLS 520 – Erik Mitchell OWL (Web Ontology Language) An ontolgy that is geared towards representing information on the web –Classes, properties, and relationships that describe URIs and their facets. Based on the Triple concept –Subject, Predicate, Object –3 versions: OWL-Lite, OWL-DL, OWL-Full Formatted in RDF/XML –Uses RDF and RDFS as a foundation –Adds new elements in the owl namespace

24 INLS 520 – Erik Mitchell OWL Versions OWL-Lite –Simple hierarchies, constraints OWL-DL –Uses description logics Logic-based semantic markup based on first-order predicate logic –Still guarantees finite relationship processing –Adds ‘reasoning’ capacity to infer information/relaitonships OWL-Full –Most complex –Open ended, possible to get into infinite processing

25 INLS 520 – Erik Mitchell OWL Example <rdf:RDF xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#http://www.w3.org/1999/02/22-rdf-syntax-ns# xmlns:rdfs="http://www.w3.org/2000/01/rdfschema#" xmlns:owl=http://www.w3.org/2002/07/owl#http://www.w3.org/2002/07/owl# xmlns=http://www.w3.org/2001/sw/BestPractices/OEP/SimplePartWhole/part.owl#http://www.w3.org/2001/sw/BestPractices/OEP/SimplePartWhole/part.owl# xml:base="http://www.w3.org/2001/sw/BestPractices/OEP/SimplePartWhole/part.owl"> 1.0 An ontology containing the basic part relations: partOf, hasPart, partOf_directly, and hasPart_directly. These are described in the accompanying note. Author: Chris Welty (Chris Welty)

26 OWL – Lite features Class A collection of things related to each other by properties rdfs:subClassOf A way of showing hierarchical class relationships rdf:Property A stated relationship between an thing and a value (hasChild, hasRelative, hasSibling, hasAge) Bi-directional, Transitive (hasAncestor), Rdf:subPropertyOf Similar to subClassOf, a way of showing property hierarchies Individual Instances of classes (objects) INLS 520 – Erik Mitchell

27 OWL relationships INLS 520 – Erik Mitchell Practical guide to OWL ontologies

28 INLS 520 – Erik Mitchell Some OWL Examples Airport Pizza

29 INLS 520 – Erik Mitchell Next Week(s) 10/28 – No class 11/4 – Metadata based services, guest speaker 11/11 – no class 11/18 – no class 11/25 – semantic web 12/2 – final projects due


Download ppt "INLS 520 – Erik Mitchell INLS 520 Information Organization."

Similar presentations


Ads by Google