Presentation is loading. Please wait.

Presentation is loading. Please wait.

ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico.

Similar presentations


Presentation on theme: "ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico."— Presentation transcript:

1 ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico di Torino, Italy

2 GWC 2004 - Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

3 GWC 2004 - Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

4 GWC 2004 - Brno, January 20-23, 2004 ArchiWordNet: a WordNet-like thesaurus A bilingual English/Italian thesaurus for the “Architecture and Construction” domain –structured according to the WordNet model –fully integrated with MultiWordNet MultiWordNet A multilingual lexical database in which the Italian WordNet is strictly aligned with Princeton’s English WordNet.

5 GWC 2004 - Brno, January 20-23, 2004 Motivation Still Image Server, an architecture image archive available at the Polytechnic of Turin –need for a thesaurus: Image cataloguing (minimize subjectivity) Image retrieval (minimize ambiguity) No exhaustive thesauri for the architecture domain are available

6 GWC 2004 - Brno, January 20-23, 2004 Why (Multi)WordNet model? A rich and rigorous structure –synonyms –many relations explicitly and homogeneously encoded Allows for a more powerful and expressive retrieval mechanism –no ambiguities –extended search with related concepts Is more suitable for educational purposes

7 GWC 2004 - Brno, January 20-23, 2004 Why integrated with MultiWN? General and multilingual framework for the specialized knowledge Integrated access allowing for a more flexible retrieval of the information Information already existing in the generic (Multi)WordNet can be exploited in the creation of the specialized one

8 GWC 2004 - Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

9 GWC 2004 - Brno, January 20-23, 2004 Adopting MultiWN model Sources: –Specialized sources Art and Architecture Thesaurus (AAT) Construction Indexing Manual of CI|SfB International and National standards (ISO, CEN, UNI) Architecture and Building Dictionaries Domain literature –MultiWN itself Issues: –Reorganize specialized sources to make them compatible with the MultiWN model –Modify MultiWN synsets to make them suitable for representing the specialized domain

10 GWC 2004 - Brno, January 20-23, 2004 Reorganizing domain-specific sources AAT hierarchy ArchiWN hierarchy

11 GWC 2004 - Brno, January 20-23, 2004 Tailoring MultiWN synsets MultiWN synsets considered appropriate by the domain experts are included into ArchiWN Several options are available: –add or delete synonyms to MultiWN synsets –modify MultiWN definitions of the synsets –delete and add relations between synsets

12 GWC 2004 - Brno, January 20-23, 2004 New relations for ArchiWN HAS FORM (n/n) –{tympanum} HAS-FORM {triangle, trigon, …} HAS ROLE (n/n) –{metal section} HAS-ROLE {upright, vertical} HAS FUNCTION (n/v) –{beam} HAS-FUNCTION {to hold, to support,…}

13 GWC 2004 - Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

14 GWC 2004 - Brno, January 20-23, 2004 Integrating ArchiWN with MultiWN 5,000 terms grouped in 13 semantic areas => the main ArchiWN hierarchies Architectural styles Materials Construction products Techniques Tools Components of buildings Single buildings and building complexes Physical properties Conditions Disciplines People Documents Drawings and representations

15 GWC 2004 - Brno, January 20-23, 2004 Integration issues Identify the MultiWN nodes where to insert the ArchiWN hierarchies Include ArchiWN hierarchies in MultiWN Handle the overlaps between terms present in both MultiWN and ArchiWN Handle the possible inconsistencies in the hierarchies

16 GWC 2004 - Brno, January 20-23, 2004 The integration methodology Basic operations –performed on single MultiWN synsets Complex procedures (plug-in) –apply to entire hierarchies

17 GWC 2004 - Brno, January 20-23, 2004 Basic operations eclipse a synset tag a synset with the “architecture and construction” domain label add or delete relations to a synset add or delete synonyms in a synset modify the synset definition

18 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in

19 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN

20 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN MWN

21 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN

22 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN MWN

23 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN

24 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN AWN

25 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN

26 GWC 2004 - Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN MWN

27 GWC 2004 - Brno, January 20-23, 2004 Results 13 ArchiWN semantic areas plugged in 18 MultiWN synsets –11 ArchiWN semantic areas (12 hierarchies) directly plugged in MultiWN11 4 substitutive plug-ins 8 integrative plug-ins –2 ArchiWN semantic areas (6 hierarchies) required a reorganization of some MultiWN sub-hierarchies2 4 hyponymic plug-ins 2 inverse plug-ins large synset eclipsing

28 GWC 2004 - Brno, January 20-23, 2004 ArchiWN up to now “Single buildings and building complexes” sub- hierarchy –900 synsets –Italian and English synonyms –accurate definition Work done manually using the MultiWN graphical interface which allows the user –to modify existing synsets and relations –to create new synsets

29 GWC 2004 - Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

30 GWC 2004 - Brno, January 20-23, 2004 Conclusions It is possible to integrate ArchiWN with MultiWN MultiWN itself can be widely exploited in the creation of ArchiWN hierarchies Advantages of interdisciplinary cooperation –wrt specialized thesauri formalized structure inheritance of linguistic-oriented information from the generic WordNet –wrt lexical resources many synsets will be associated with images

31 GWC 2004 - Brno, January 20-23, 2004 Future work Go on enriching the “Single buildings and building complexes” hierarchy and populating the remaining hierarchies Industrial applications: multilingual specialized lexicon of approximately 1,000 synsets for the window and curtain wall industry Agreement for the future usage of ArchiWN by the Piemonte region in the cataloguing of its architectural cultural heritage

32 GWC 2004 - Brno, January 20-23, 2004 Details

33 GWC 2004 - Brno, January 20-23, 2004 Direct plug-ins Architectural stylesarchitectural style/1Sub Materialsmaterial/1, substance/1Sub Construction productsbuilding material/1Sub Techniquestechnique/1Int Toolstool/1Int Physical propertiesphysical property/1Int Conditionscondition/1Int Disciplinesdiscipline/1Int Peopleperson/Int Documentsdocument/1Int Drawings and representationsdrawing/2,representation/2Int back

34 GWC 2004 - Brno, January 20-23, 2004 Reorganizations back Components of buildingsstructure/1 component/3 region/1 Hypo Single buildings and building complexes structure/AWN building/1 building complex/1 Hypo Inverse

35 GWC 2004 - Brno, January 20-23, 2004 Term overlapping ITC-irst provides the Polythecnic with lists of terms: -synsets tagged with the “architecture” label in WN-Domains -hyponyms of WordNet plug-in synsets WN-Domains: 2,595 Architecture = 155 synsets –Town planning = 444 synsets –Building industry = 1,541 synsets –Furniture = 455 synsets

36 GWC 2004 - Brno, January 20-23, 2004 Hyponyms of Plug-in synsets Architectural stylesarchitectural style/1S12 hyponyms Materials material/1 substance/1 S 1,266 hyponyms 6,054 hyponyms Construction productsbuilding material/1S 95 hyponyms Techniquestechnique/1I 3 hyponyms Toolstool/1I301 hyponyms Physical propertiesphysical property/1I 103 hyponyms Conditionscondition/1I 1,721 hyponyms Disciplinesdiscipline/1I464 hyponyms Peopleperson/I6,068 hyponyms Documentsdocument/1I328 hyponyms Drawings and representations drawing/2, representation/2 IIII 26 hyponyms 159 hyponyms back

37 GWC 2004 - Brno, January 20-23, 2004 building complex/1 room, area, building space building element open space entity/1 object/1 artifact/1 structure/1 architectural component part/4 location/1 structure (AWN) region/1 component/3 architectural space building/1 hypo inverse eclipsing Reorganization of: -Components of buildings -Single buildings and building complexes

38 GWC 2004 - Brno, January 20-23, 2004 Modifying MultiWN definition structural_wall bearing_wall ISA an architectural partition with a height and length greater than its thickness; used to divide or enclose an area support wall partition divider any wall supporting a floor or the roof of a building WordNet: {wall – “an architectural partition with a height and length greater than its thickness; used to divide or enclose an area or to support another structure”}


Download ppt "ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico."

Similar presentations


Ads by Google