Download presentation
Presentation is loading. Please wait.
1
Mapping Target Schemas to Source Schemas Using WordNet Hierarchies Master’s Thesis Proposal David Jackman
2
Biskup-Embley Approach PublicationAbstract Title Author Publication Year Publication Name 1 1 1 0:1 1:* has Target View JournalEditor Title Publication Date 1 0:1 1:* has Article TitleAuthor has Abstract 1 1:* 1 1 0:1 1:* 1 has Source Schema Other Source Schemas
3
WordNet Background A lexical database for English Developed at Cognitive Science Lab at Princeton University Has become a standard in natural language research English terms are organized into synonym sets, each representing one underlying lexical concept
4
Framework Mapping Process Target Schema (XML) Integration Mappings (XML) Source Schema (XML)
5
Mapping Process Written in Java For this research, two procedures are used: –First creates mappings with a WordNet score –Second adds a context score to those mappings
6
JournalEditor Title Publication Date 1 0:1 1:* has Article TitleAuthor has Abstract 1 1:* 1 1 0:1 1:* 1 has PublicationAbstract Title Author Publication Year Publication Name 1 1 1 0:1 1:* has Target Source Publication#1 Work#2 Product#2 Creation#2 Piece#2 Article#1
7
JournalEditor Title Publication Date 1 0:1 1:* has Article TitleAuthor has Abstract 1 1:* 1 1 0:1 1:* 1 has PublicationAbstract Title Author Publication Year Publication Name 1 1 1 0:1 1:* has Target Source Target TermDistanceSource TermDistanceRoot Term Publication3Article2Creation Publication0Journal2Publication (Pub.) Name0Title2Name (Pub.) Name5(Pub.) Date4Social Group (Pub.) Name1Article4Language Unit (Pub.) Year0(Pub.) Date2Year Author0 0 Abstract0 0 2Title1Writing
8
JournalEditor Title Publication Date 1 0:1 1:* has Article TitleAuthor has Abstract 1 1:* 1 1 0:1 1:* 1 has PublicationAbstract Title Author Publication Year Publication Name 1 1 1 0:1 1:* has Target Source
9
Experiments Run in 4 domains: Music CDs, Genealogy, Real Estate, Travel Source schemas taken from existing web sites Target schemas taken from standard XML DTDs or created by independent third parties Combined WordNet & Context scores compared against mappings determined by a human expert
10
Conclusion Data integration process can become more automatic Combination with research being done in data extraction and data warehousing
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.