StrategiesTaxonomy June 9, 2014Copyright 2014 Taxonomy Strategies. All rights reserved. The Search for Meaning and Semantics: Taxonomies Get It Done Joseph.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
13 February 2014 The Role of Metadata in Transportation Data Programs Joseph A Busch, Taxonomy Strategies.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
NERC DataGrid Vocabulary Workshop, RAL, February 25, 2009 NERC DataGrid Vocabulary Server Description.
Advanced Searching Engineering Village.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Engineering Village ™ ® Basic Searching On Compendex ®
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
A Registry for controlled vocabularies at the Library of Congress
Overview of Search Engines
Tutorial 3: Adding and Formatting Text. 2 Objectives Session 3.1 Type text into a page Copy text from a document and paste it into a page Check for spelling.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
Lesson 4: Using HTML5 Markup.  The distinguishing characteristics of HTML5 syntax  The new HTML5 sectioning elements  Adding support for HTML5 elements.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. Collaborative Building of Controlled Vocabularies Crosswalks Mateusz.
A J Miles Rutherford Appleton Laboratory SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web NKOS workshop ECDL.
StrategiesTaxonomy November 5, 2013Copyright 2013 Taxonomy Strategies. All rights reserved. Taxonomies for Program Management Consistency in a Constantly.
Aardvark Anatomy of a Large-Scale Social Search Engine.
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
Project Overview Bibliographic merging, Endeca, and Web application.
XML DTDs and other Alternatives: Vocabulary Markup Language (Voc-ML) Project & Friends Joseph A. Busch Director, Solutions Architecture NetLab and Friends.
Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Keyword vs. Controlled Vocabulary Searching 12 Basic Skills for IQ.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
The Internet 8th Edition Tutorial 4 Searching the Web.
Strategies LLC Taxonomy 28 August 2007Copyright 2007 Taxonomy Strategies LLC. All rights reserved. Metadata and Controlled Vocabularies Global Corporate.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Click to edit Master title style © 2006 IBM Corporation Connecting the dots: Relationships and relevance with DITA maps Presented by Erik Hennum, IBM User.
Introduction to the Semantic Web and Linked Data
Microsoft FrontPage 2003 Illustrated Complete Integrating a Database with a Web Site.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
APS Taxonomy Project Arthur Smith, American Physical Society April 2014.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Presentation On HTML & Podcast Done by: Shamelia Young & Sheriece Williamson.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
RSS Interfaces and Standards Chander Iyer. Really Simple Syndication (RSS) Web data format providing users with frequently updated content. Make a collection.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
MSG Reuse Catalog T.W. van den Berg 7 April 2010.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
Semantic Web unleashes your data! The Semantic Web will transform the use of content. Semantic Web – is an extension of the current web. Semantic Web.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Search can be Your Best Friend You just Need to Know How to Talk to it IW 306 Ágnes Molnár.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Summon® 2.0 Discovery Reinvented
Prepared for Md. Zakir Hossain Lecturer, CSE, DUET Prepared by Miton Chandra Datta
Introduction to Semantic Metadata & Semantic Web
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
PREMIS Tools and Services
Presentation transcript:

StrategiesTaxonomy June 9, 2014Copyright 2014 Taxonomy Strategies. All rights reserved. The Search for Meaning and Semantics: Taxonomies Get It Done Joseph Busch – Why Semantics Matter

2 Taxonomy Strategies The business of organized information Agenda v Why semantics matter (… a quick review from 2001) v What is semantic search, SKOS and Linked Data? v Some semantic search examples?

3 Taxonomy Strategies The business of organized information Why Semantics Matter May 20, 2001

4 Taxonomy Strategies The business of organized information When you own a Rembrandt you can spell his name any way you want.

5 Taxonomy Strategies The business of organized information But when you want to find a Rembrandt … you better spell his name correctly.

6 Taxonomy Strategies The business of organized information Vocabulary resources can help find the right artist even if their name is typed incorrectly.

7 Taxonomy Strategies The business of organized information Users cannot type in the complex queries needed to find all the relevant items... But this can be done automatically.

8 Taxonomy Strategies The business of organized information Complex queries are even more important when you search the entire web.

9 Taxonomy Strategies The business of organized information So you find Rembrandt the Dutch guy...

10 Taxonomy Strategies The business of organized information … And not Rembrandt the toothpaste.

11 Taxonomy Strategies The business of organized information Getty Vocabularies Linked Data Services February 19, 2014

12 Taxonomy Strategies The business of organized information Agenda v Why semantics matter v What is semantic search, SKOS and Linked Data? v Some semantic search examples?

13 Taxonomy Strategies The business of organized information Search Failure v 19% Character errors. (Young, et al) v 40% Vocabulary errors. (Seaman. Norgard, et al) v 20% Index confusion. v 21% Successful (Nielsen) 40% 20% 19% 21%

14 Taxonomy Strategies The business of organized information

15 Taxonomy Strategies The business of organized information Semantic search solution v Semantic search improves search accuracy by inferring the contextual meaning of terms via:  Disambiguation  Part of speech (POS) analysis  Synonyms, variations and quasi-synonyms  Concept matching  Natural language query analysis  Key sentence detection v Generate more consistent content to search on. v Correct user errors. v Map the language of users to the language of the target content. v Augment search results with linked data.

16 Taxonomy Strategies The business of organized information What semantics do for search? FunctionDescription Related searchQuery corrections … did you mean? Concept searchQuery expansion with synonyms, abbreviations, acronyms, etc. … do you also want? Ontology-based searchQuery expansion with narrower or broader terms; scoping exhaustive search results Faceted searchDynamic filtering of search results; online shopping ClusteringDynamically bucketing search results into pre- defined categories Stored queriesRSS feeds, alerts, SDI (selective dissemination of information), etc. PersonalizationWeighting search results based on explicit profiles and implicit data (where you’ve been and what you’ve done)

17 Taxonomy Strategies The business of organized information What is SKOS? v Provides the basis for any user, tool, or program to identify, define and link concept vocabularies. RelationshipDefinition ConceptA unit of thought, an idea, meaning, or category of objects or events. A Concept is independent of the terms used to label it. Preferred LabelA preferred lexical label for the resource such as a term used in a digital asset management system. Alternate LabelAn alternative label for the resource such as a synonym or quasi- synonym. Broader Concept Hierarchical link between two Concepts where one Concept is more general than the other. Narrower Concept Hierarchical link between two Concepts where one Concept is more specific than the other. Related Concept Link between two Concepts where the two are inherently "related", but that one is not in any way more general than the other.

18 Taxonomy Strategies The business of organized information lc:sh Fringe parking Park and ride systems Park and ride CONCEPT trt:Brddf Park & ride Park-n- ride altLabel prefLabel altLabel P&R system altLabel broader Parking trt:Brdd prefLabel

19 Taxonomy Strategies The business of organized information Why SKOS? According to Alistair Miles* (SKOS co-author) v Ease of combination with other standards  Vocabularies are used in great variety of contexts. – E.g., databases, faceted navigation, website browsing, linked open data, spellcheckers, etc.  Vocabularies are re-used in combination with other vocabularies. – E.g., Library of Congress Subject Headings + Transportation Research Thesaurus; USPS states + USPS zip codes + US Congressional districts; etc.Library of Congress Subject HeadingsTransportation Research ThesaurusUSPS statesUS Congressional districts v Flexibility and extensibility to cope with variations in structure and style  Variations between types of vocabularies – E.g., list vs. classification scheme  Variations within types of vocabularies – E.g., Z monolingual controlled vocabularies and the Transportation Research ThesaurusZ Transportation Research Thesaurus * Head of Epidemiological Informatics at Oxford University Wellcome Trust Centre for Human Genetics (formerly OUP Senior Computing Officer)

20 Taxonomy Strategies The business of organized information Why SKOS? (2) v Publish managed vocabularies so they can readily be consumed by applications  Identify the concepts – What are the named entities?  Describe the relationships – Labels, definitions and other properties  Publish the data – Convert data structure to standard format – Put files on an http server (or load statements into an RDF server) v Ease of integration with external applications  Use web services to use or link to a published concept, or to one or more entire vocabularies. – E.g., Google maps API, NY Times article search API, Linked open data; etc.Google maps APINY Times article search APILinked open data v A W3C standard like HTML, CSS, XML and RDF, RDFS, and OWL.

21 Taxonomy Strategies The business of organized information Agenda v Why semantics matter v What is semantic search, SKOS and Linked Data? v Some semantic search examples?

22 Taxonomy Strategies The business of organized information Taxonomy browser

23 Taxonomy Strategies The business of organized information Taxonomy-powered search results

24 Taxonomy Strategies The business of organized information Audience Products Location Organization Content Type Product Line Application Technology Industry Solution Person Oracle.com top-level taxonomy Has a Is a

25 Taxonomy Strategies The business of organized information Oracle event finder Filter on Location and Language More filters based on this result Results shown on Google maps UI Subscribe to RSS feed based on the criteria set on this page

26 Taxonomy Strategies The business of organized information APS Taxonomy browserTaxonomy browser

27 Taxonomy Strategies The business of organized information Linked data example APS Taxonomy Broad Subject Areas Methods & Theories Phenomena Physical Systems APS Taxonomy Broad Subject Areas Methods & Theories Phenomena Physical Systems Astronomical systems Atomic-scale objects Beams Complex systems Dynamical systems Electric & magnetic fields Engineered materials Fundamental particles Gases delete Information systems Liquids delete Materials Nonlinear system Nuclei Plasma Quasiparticles Astronomical systems Atomic-scale objects Beams Complex systems Dynamical systems Electric & magnetic fields Engineered materials Fundamental particles Gases delete Information systems Liquids delete Materials Nonlinear system Nuclei Plasma Quasiparticles Materials by Composition Materials by Dimensionality Materials by Property Materials by Structure Materials by Composition Materials by Dimensionality Materials by Property Materials by Structure Elements by Group Group 1 Group 2 Group 3 Group 4 Group 5 Group 6 Group 7 Group 8 Group 9 Group 10 Group 11 Group 12 Group 13 Group 14 Group 15 Group 16 Group 17 Group 18 Elements by Group Group 1 Group 2 Group 3 Group 4 Group 5 Group 6 Group 7 Group 8 Group 9 Group 10 Group 11 Group 12 Group 13 Group 14 Group 15 Group 16 Group 17 Group 18 Elements of the periodic table, and common isotopes Cadmium Copernicium Mercury Zinc Cadmium Copernicium Mercury Zinc 194Hg 196Hg 198Hg 199Hg 200Hg 201Hg 202Hg 204Hg 194Hg 196Hg 198Hg 199Hg 200Hg 201Hg 202Hg 204Hg A faceted taxonomy of concepts in physics

28 Taxonomy Strategies The business of organized information Paper submission tagging (prototype)Paper submission tagging

29 Taxonomy Strategies The business of organized information QUESTIONS Joseph A Busch Mobile

30 Taxonomy Strategies The business of organized information Session description v Semantic search – a phrase that is increasingly used in the popular as well as the professional literature. What does it look like, and how will it work. Panelists will present their visions of semantic search. Program is designed to be interactive with audience participation – suggestions for functions and features they see in the future.  What is semantic search?  What are the components of semantic search?  How can it be used in libraries?