Presentation is loading. Please wait.

Presentation is loading. Please wait.

U SING O NTOLOGICAL R ELATIONSHIPS TO P ROVIDE I NDEXING OF P LAIN T EXT S EARCHES 1 Research by Fletcher Liverance November.

Similar presentations


Presentation on theme: "U SING O NTOLOGICAL R ELATIONSHIPS TO P ROVIDE I NDEXING OF P LAIN T EXT S EARCHES 1 Research by Fletcher Liverance November."— Presentation transcript:

1 U SING O NTOLOGICAL R ELATIONSHIPS TO P ROVIDE I NDEXING OF P LAIN T EXT S EARCHES 1 Research by Fletcher Liverance fletcher.liverance@gmail.com November 14 th, 2011

2 H OW D OES A S EARCH E NGINE W ORK ? 2 PageRank 818374 712973 239977 997645 1925521 154211 65988 1. User submits a keyword based query to the search engine 2. The indexer locates all relevant pages containing those keywords 3. The database returns all pages found in the index 4. Pages are ranked and returned to the user

3 H OW D OES A S EARCH E NGINE W ORK ? Benefits Fast Machine learnable Straight forward Drawbacks Pattern matching Keyword based Garbage in, garbage out 3

4 G ARBAGE IN, G ARBAGE OUT 4 Scenario You saw this television series and you’d like to find out more about it, but you don’t know what the name of the series or any of the characters are. What do you do? http://www.dan-dare.org/FreeFun/Images/CartoonsMoviesTV/WinnieThePoohWallpaper1024.jpg

5 G ARBAGE IN, G ARBAGE OUT 5 POOR RESULTS!

6 G ARBAGE IN, G ARBAGE OUT 6 GOOD RESULTS!

7 S EMANTIC R ELATIONSHIPS 7 Winnie the Pooh Bear Yellow Disney PigletShirt RedPig isMadeBy isA hasClothing hasColor hasFriend isA Ontology “An ontology is a description (like a formal specification of a program) of the concepts and relationships that can exist for an agent or a community of agents.” http://www-ksl.stanford.edu/kst/what-is-an-ontology.html http://www-ksl.stanford.edu/kst/what-is-an-ontology.html Resource Description Framework (RDF) “RDF extends the linking structure of the Web to use URIs to name the relationship between things as well as the two ends of the link. Using this simple model, it allows structured and semi-structured data to be mixed, exposed, and shared across different applications.” http://www.w3.org/RDF/

8 S EMANTIC R ELATIONSHIPS 8 Winnie the Pooh Bear Yellow Disney PigletShirt RedPig isMadeBy isA hasClothing hasColor hasFriend isA How can we locate useful semantic relationships? Link Distance Link Direction Link Relationship Brown Mammal Company isA hasColor 0xFFFF00 hasRGB

9 M ODIFIED S EARCH I NDEXING 9 SearchRank 818374 712973 239977 997645 1925521 1. User submits a keyword based query to the search engine 4. Searches are ranked and returned to the user as additional search suggestions 2. Search analyzer creates additional searches based on ontological information 3. Search engine performs parallel searches of top search terms

10 C URRENT W ORK 10 NASA SWEET Ontologies 6000 concepts 200 ontologies Scientific Loose relationships National Oceanographic and Atmospheric Administration 30+ years of scientific research Text based Unsorted 2+ gigabytes Domain specific terminology

11 C HALLENGES & F UTURE W ORK 11 How to rank plain text No links or history No ‘page views’ Limited ontology coverage 6000 concepts in NASA SWEET ontologies ~170,000 words in the English language Many more unique names and scientific terms How can ontologies be automatically generated? Graph matching Identifying related terms in a large graph is difficult Multiple links per node, must identify appropriate links

12 Q & A 12


Download ppt "U SING O NTOLOGICAL R ELATIONSHIPS TO P ROVIDE I NDEXING OF P LAIN T EXT S EARCHES 1 Research by Fletcher Liverance November."

Similar presentations


Ads by Google