Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC 259234.

Similar presentations


Presentation on theme: "A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC 259234."— Presentation transcript:

1 A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC 259234

2 Lexical Resource WordNet BabelNet UBY

3 Lexical Resource WordNet BabelNet UBY

4 Lexical Resource WordNet BabelNet UBY

5 Improved word and concept coverage  e.g., named entities, new senses Improved domain coverage Improved multilinguality  dozens of new languages Expert-made relations preserved  e.g., Hypernymy, meronymy, etc. Why combine resources?

6 Provides complementary knowledge Applications: Semantic Parsing Shi and Mihalcea, 2005 Semantic Role Labeling Palmer et al., 2010 WSD and entity linking Moro et al., TACL 2014 Why combine resources?

7 A synset from BabelNet

8 Why combine resources? A synset from BabelNet for plant (living organism) a case example BabelNet

9 Why combine resources? A synset from BabelNet for plant (living organism)

10 Why combine resources?

11 A synset from BabelNet for plant (living organism)

12 Why combine resources? A synset from BabelNet for plant (living organism)

13 Difficulty of resource alignment Fine granularity of lexical resources plant WordNet 4 senses15 senses

14 How resource alignment works? WordNet plant#n#1 Usually measures the similarity of two concepts WKT: WN:

15 How resource alignment works? Usually measures the similarity of two concepts And aligns two concepts if their similarity exceeds a certain threshold ? WordNet plant#n#1 WKT: WN:

16 How resource alignment works? Alignment approaches differ in the way they calculate this similarity ? WordNet plant#n#1 WKT: WN:

17 Gloss similarity WordNet Denfinitional similarity

18 Strong baseline Fall short when Different wordings are used for same concepts When two words lack quality glosses key -- Metal device shaped in such a way that when it is inserted into the appropriate lock the lock's mechanism can be rotated Key -- An object designed to open and close a lock Denfinitional similarity

19 Strong baseline Fall short when Different wordings are used for same concepts When two words lack quality glosses plant -- Buildings for carrying on industrial labor. plant -- The necessary infrastructure used in support and maintenance of a given facility. Denfinitional similarity

20 Contributions A novel concept similarity measure Denfinitional similarity A robust technique for alignment of resources A robust technique for alignment of heterogeneous resources An effective ontologization approach

21 Our approach: SemAlign

22 Definition similarity

23 Our approach: SemAlign Structural similarity

24 SemAlign: structural similarity WordNet

25 SemAlign: core Modeling concepts through Semantic Signatures

26 SemAlign: core Semantic Signatures of concepts

27 some Personalized PageRank Semantic Signature of a concept

28 Distributional representation over all concepts in the semantic network Personalized PageRank Semantic Signature of a concept

29 Distributional representation over all concepts in the semantic network...

30 Semantic Signature Example WordNet concept for ----- Importance of concept_3 for our concept...

31 Semantic Signature Importance of concept_3 for our concept... cooking#1 frying_pan#1 oil#4 kitchen#3 Example WordNet concept for -----

32 Semantic Signature Importance of concept_3 for our concept... table#3 carpet#2 sugar#2 natural_gas#2 Example WordNet concept for -----

33 Semantic Signature Importance of concept_3 for our concept... read#3 bank#2 Example WordNet concept for -----

34 SemAlign Definition similarity WordNet

35 SemAlign WordNet

36 SemAlign: signature unification

37 WordNet SemAlign: signature unification Find concepts associated with monosemous words

38 WordNet SemAlign: signature unification Truncate vectors to the overlapping concepts

39 SemAlign: signature unification WordNet Synsets containing at least one monosemous word ~ 60% (72,000) The reliability of leveraging monosemous words

40 Semantic Signature Comparison

41 Weighted Overlap (Pilehvar et al., ACL 2013) Semantic Signature Comparison

42 Comparing Semantic Signatures Weighted Overlap

43

44

45

46 SemAlign: score combination

47 Ontologization of lexical resources

48 WordNet Ontologization of lexical resources

49 WordNet Ontologization of lexical resources

50

51

52 WordNet Ontologization of lexical resources

53 WordNet Ontologization of lexical resources

54

55 Definition page for sail

56 WordNet Ontologization of lexical resources

57 WordNet Ontologization of lexical resources

58

59 WordNet Ontologization of lexical resources

60 Definition page for windmill 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Ontologization of lexical resources

61 Definition page for windmill 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. windmill.n.1 linear motion.n.1 rotational motion.n.1 translate.v.3 wind.n.1 machine.n.1 sail.n.3 adjustable vanes.v.4 Ontologization of lexical resources

62 Ontologization : similarity-based disambiguation Definition page for windmill 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

63 Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill

64 Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill

65 Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill

66 Ontologization : similarity-based disambiguation 1.A trip in a boat, especially a sailboat. 2.A sailing vessel; a vessel of any kind; a craft. 3.The blade of a windmill. 4.A tower-like structure found on the dorsal (topside) surface of submarines. 1.A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails. Definition page for sail Definition page for windmill

67 Ontologization: evaluation (Meyer & Gurevych, 2012)

68 Alignment Experiments: Datasets (Matuschek & Gurevych, TACL 2013) WordNet WN synsets manually mapped to their corresponding concepts 320 484 315

69 Alignment Experiments: System configurations Parameter 1: score combination parameter

70 Alignment Experiments: System configurations Parameter 2: similarity threshold

71 Alignment Experiments: System configurations 01 0.5 Setting and a. Unsupervised system

72 Alignment Experiments: System configurations 01 a. Unsupervised system Setting and b. Tuning on subset 0.42 0.53

73 Alignment Experiments: System configurations a. Unsupervised system Setting and b. Tuning on subset c. Cross validation 01 0.44 0.51

74 Comparison system Dijkstra-WSA (Matuschek & Gurevych, TACL 2013)

75 Alignment Experiments SB SB+DWSA F1

76 Alignment Experiments F1 SB SB+DWSA

77 SemAlign: structural similarity

78 Alignment Experiments F1

79 Conclusions A novel approach for aligning lexical resources –Accurate even in the absence of training data –Robust across different resources An effective ontologization approach Experiments on aligning –WN to WP, WT, and OW

80 Future directions Integrating the approach into BabelNet for boosting the alignment accuracy Alignment across different languages Updating lexicons with novel terms

81 1. thanks -- an acknowledgment of appreciation 2. thanks -- with the help of or owing to; 1.thanks -- an expression of gratitude. 2. thanks -- because of; normally used with a positive connotation, though it can be used sarcastically. WordNet Thanks!


Download ppt "A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC 259234."

Similar presentations


Ads by Google