Presentation is loading. Please wait.

Presentation is loading. Please wait.

September 23, 2007NSF TANGO BYU/RPI1 TANGO Table Analysis for Generating Ontologies David W. Embley (BYU) & George Nagy (RPI) under NSF Awards 0414644.

Similar presentations


Presentation on theme: "September 23, 2007NSF TANGO BYU/RPI1 TANGO Table Analysis for Generating Ontologies David W. Embley (BYU) & George Nagy (RPI) under NSF Awards 0414644."— Presentation transcript:

1 September 23, 2007NSF TANGO BYU/RPI1 TANGO Table Analysis for Generating Ontologies David W. Embley (BYU) & George Nagy (RPI) under NSF Awards 0414644 and 0414854 INFORMATION & KNOWLEDGE MANAGEMENT Dr. Maria Zemankova (a) Table Interpretation (b) Query by Table

2 September 23, 2007NSF TANGO BYU/RPI2 TABLE INTERPRETED TABLE MINI ONTOLOGY GROWING ONTOLOGY Wang Notation & XML Wang Notation Tool Ontology Editor Annotated Semantic Web Pages Standard Ontology Language (OWL) Ontology Based Web Services Form Based Specification Extraction Ontologies Relational Databases Query By Table TANGO STEPS

3 September 23, 2007NSF TANGO BYU/RPI3 TABLE INTERPRETED TABLE MINI ONTOLOGY GROWING ONTOLOGY Wang Notation & XML Wang Notation Tool Ontology Editor Annotated Semantic Web Pages Standard Ontology Language (OWL) Ontology Based Web Services Form Based Specification Extraction Ontologies Relational Databases Query By Table This presentation

4 September 23, 2007NSF TANGO BYU/RPI4 (a) Table Interpretation HTML web pages Construct Wang notation Matlab table Extract table Wang Notation XML table Confirm or correct Mini Ontology

5 September 23, 2007NSF TANGO BYU/RPI5 Median Income table http://www40.statcan.ca/l01/cst01/famil108a.htm?sdi=median%20income

6 September 23, 2007NSF TANGO BYU/RPI6 Median Income table displayed from Canada Statistics displayed in TANGO Wang Notation Tool

7 September 23, 2007NSF TANGO BYU/RPI7 Wang Notation Abstract table is specified by ordered pair (C,  ) - (category, delta) C is a finite set of labeled domains (header, sub headers of tables, etc)  represents each individual value within a table corresponding to C.

8 September 23, 2007NSF TANGO BYU/RPI8 Categories Two categories in previous table. CATEGORY 1: (Region_Virtual,{(Canada,phi), (Newfoundland and Labrador,phi), (Prince Edward Island,phi), (Nova Scotia,phi), (New Brunswick,phi), (Quebec,phi), (Ontario,phi), (Manitoba,phi), (Saskatchewan,phi),(Alberta,phi),(British Columbia,phi),(Yukon Territory,phi), (Northwest Territories,phi), (Nunavut,phi)}) CATEGORY 2: (Year_Virtual, {(2001,phi), (2002,phi), (2003,phi), (2004,phi), (2005,phi)})

9 September 23, 2007NSF TANGO BYU/RPI9 Content (leaf) cells Delta Notation for two (of 15) rows: delta({Year_Virtual.2001,Region_Virtual.Canada})=53,500 delta({Year_Virtual.2002,Region_Virtual.Canada})=55,000 delta({Year_Virtual.2003,Region_Virtual.Canada})=56,000 delta({Year_Virtual.2004,Region_Virtual.Canada})=58,100 delta({Year_Virtual.2005,Region_Virtual.Canada})=60,600 delta({Year_Virtual.2001,Region_Virtual.Newfoundland and Labrador})=41,400 delta({Year_Virtual.2002,Region_Virtual.Newfoundland and Labrador})=43,200 delta({Year_Virtual.2003,Region_Virtual.Newfoundland and Labrador})=44,800 delta({Year_Virtual.2004,Region_Virtual.Newfoundland and Labrador})=46,100 delta({Year_Virtual.2005,Region_Virtual.Newfoundland and Labrador})=47,600

10 September 23, 2007NSF TANGO BYU/RPI10 XML Representation: Schema for (1) table (2) categories (3) data cells (4) augmentation … XML file for this table has ~350 lines of Object Identifier tags

11 September 23, 2007NSF TANGO BYU/RPI11 Verification tool: category headers for a selected content cell

12 September 23, 2007NSF TANGO BYU/RPI12 Verification tool: content cells for a selected header

13 September 23, 2007NSF TANGO BYU/RPI13 Verification tool: hierarchical category structure for a selected content cell

14 September 23, 2007NSF TANGO BYU/RPI14 (b) Query by Table Income 2002 2003 2004 2005 Ontology from many tables Database Income 2002 $4500 2003$3300 2004$1240 2005$3400 QBT Interpret Query Table

15 September 23, 2007NSF TANGO BYU/RPI15 Query Table Composed in MS-Excel by a person seeking information from an ontology compiled from many web tables

16 September 23, 2007NSF TANGO BYU/RPI16 Display of automatically processed Query Table for human verification

17 September 23, 2007NSF TANGO BYU/RPI17 Wang notation for Query Table

18 September 23, 2007NSF TANGO BYU/RPI18 QBT identifies requested data

19 September 23, 2007NSF TANGO BYU/RPI19 URLs of tables in the Example Database Median Total Income : http://www40.statcan.ca/l01/cst01/famil108a.htm?sdi=median%20income*http://www40.statcan.ca/l01/cst01/famil108a.htm?sdi=median%20income Number of Induced Abortions: http://www40.statcan.ca/l01/cst01/health40a.htm?sdi=abortions http://www40.statcan.ca/l01/cst01/health40a.htm?sdi=abortions Number of Divorces: http://www40.statcan.ca/l01/cst01/famil02.htm?sdi=number%20divorces http://www40.statcan.ca/l01/cst01/famil02.htm?sdi=number%20divorces Infant Mortality Rate: http://www40.statcan.ca/l01/cst01/health21a.htm?sdi=infant%20mortality%20rate* http://www40.statcan.ca/l01/cst01/health21a.htm?sdi=infant%20mortality%20rate Trips By Canadians in Canada: http://www40.statcan.ca/l01/cst01/arts26a.htm http://www40.statcan.ca/l01/cst01/arts26a.htm Number of Homicides: http://www40.statcan.ca/l01/cst01/legal12a.htm?sdi=homicide http://www40.statcan.ca/l01/cst01/legal12a.htm?sdi=homicide Population: http://www40.statcan.ca/l01/cst01/demo02a.htm?sdi=population http://www40.statcan.ca/l01/cst01/demo02a.htm?sdi=population Number of Persons with Diabetes: http://www40.statcan.ca/l01/cst01/health54a.htm?sdi=diabetes Number of Persons with Asthma: http://www40.statcan.ca/l01/cst01/health50a.htm?sdi=asthma University Degrees Awarded to Males: http://www40.statcan.ca/l01/cst01/educ51b.htm University Degrees Awarded to Females: http://www40.statcan.ca/l01/cst01/educ51c.htm Food services and drinking places (13 tables): http://www40.statcan.ca/l01/cst01/serv24j http://www40.statcan.ca/l01/cst01/serv24j

20 September 23, 2007NSF TANGO BYU/RPI20 Fields in the Example Database IDENTIFIER REGION YEAR NUMBER_OF_ABORTIONS ABORTION_RATE NUMBER_OF_DIVORCES INFANT_MORTALITY_RATE NUMBER_OF_TRIPS MEDIAN_TOTAL_INCOME POPULATION NUMBER_OF_HOMICIDES GENDER INCIDENCE_OF_DIABETES UNIVERSITY_DEGREES_AWARDED INCIDENCE_OF_ASTHMA RESTAURANT_OPERATING_REVENUE RESTAURANT_OPERATING_EXPENSES RESTAURANT_OPERATING_PROFIT_MARGIN RESTAURANT_OPERATING_WAGES

21 September 23, 2007NSF TANGO BYU/RPI21 QBT fills in requested data from Example Database

22 September 23, 2007NSF TANGO BYU/RPI22 A current puzzle How can QBT tell that these two query tables represent the same request? NB: Although plausible, both of these tables exemplify poor layout. YearRegionGenderDiabetics 2002 Alberta Male XX Female XX Ontario Male XX Female XX Year Region Diabetics MaleFemale 2002 Alberta XX Ontario XX

23 September 23, 2007NSF TANGO BYU/RPI23 Next steps Complete the conversion of Wang/XML table descriptions to mini ontologies Improve the interface for generating cumulative ontology from mini ontologies Implement database generation from ontology Embed logging routines for statistical evaluation of time/error trade-offs


Download ppt "September 23, 2007NSF TANGO BYU/RPI1 TANGO Table Analysis for Generating Ontologies David W. Embley (BYU) & George Nagy (RPI) under NSF Awards 0414644."

Similar presentations


Ads by Google