Presentation is loading. Please wait.

Presentation is loading. Please wait.

Institut für Informatik Automatische Sprachverarbeitung The Impact of Semantic Handshakes TMRA 2006, Leipzig, 12.10.2006 Lutz Maicher, University of Leipzig.

Similar presentations


Presentation on theme: "Institut für Informatik Automatische Sprachverarbeitung The Impact of Semantic Handshakes TMRA 2006, Leipzig, 12.10.2006 Lutz Maicher, University of Leipzig."— Presentation transcript:

1 Institut für Informatik Automatische Sprachverarbeitung The Impact of Semantic Handshakes TMRA 2006, Leipzig, 12.10.2006 Lutz Maicher, University of Leipzig maicher@informatik.uni-leipzig.de

2 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 2Lutz Maicher Agenda The Integration Model of the TMDM Semantic Handshakes and Interaction Protocols Simulations Result and Discussion

3 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 3Lutz Maicher Preliminary Remark This presentation does only describe the impact of a phenomenon which is determined by the existence of –the integration model of the TMDM (Topic Maps Data Model) –Topic Maps Communication Protocols like TMRAP, TMIP, etc This presentation does not propose any new issues –nor methodologies, technologies, paradigms or anything else

4 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 4Lutz Maicher The Integration Model of the TMDM

5 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 5Lutz Maicher The Integration Model of the TMDM Two Topic Items are equal if (TMDM 5.3.5) : (they represent the same Subject) –at least one equal string in their [subject identifiers] properties, –at least one equal string in their [item identifiers] properties, –at least one equal string in their [subject locators] properties, –an equal string in the [subject identifiers] property of the one topic item and the [item identifiers] property of the other, or –the same information item in their [reified] properties. Equal Topic Items A and B have to be merged into C (TMDM 6.2) –…. –Set C's [subject identifiers] property to the union of the values of A and B's [subject identifiers] properties. –….

6 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 6Lutz Maicher The Integration Model of the TMDM in practice [subject identifier] {ns1:LutzMaicher} A [subject identifier] {ns2:MaicherLutz} B equality holds not (according TMDM) [subject identifier] {ns1:LutzMaicher} A [subject identifier] {ns2:MaicherLutz} B In the case of terminological diversity….

7 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 7Lutz Maicher The Integration Model of the TMDM in practice [subject identifier] {ns1:LutzMaicher} A B equality holds (according TMDM) C [subject identifier] {ns1:LutzMaicher} merging (according TMDM) In the case of terminologial alignment…. the PSI case But who can enforce universal vocabularies?

8 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 8Lutz Maicher Semantic Handshakes and Interaction Protocols

9 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 9Lutz Maicher Semantic Handshake [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz} A [subject identifier] {ns2:MaicherLutz} B equality holds (according TMDM) C [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz} merging (according TMDM) The author of A has decided that both terms can be used to indicate Lutz Maicher

10 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 10Lutz Maicher Local Semantic Handshakes and Interaction Protocols [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz} A [subject identifier] {ns2:MaicherLutz, ns3:ML} B [subject identifier] {ns3:ML} C [subject identifier] {ns4:Lutz, ns3:ML} D Local Semantic Handshake TM1 TM3 TM2 TM4 All Topic Maps interacting using the existing protocols like TMRAP, TMIP …

11 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 11Lutz Maicher Local Semantic Handshakes and Interaction Protocols [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz} A [subject identifier] {ns2:MaicherLutz, ns3:ML} B [subject identifier] {ns3:ML} C [subject identifier] {ns4:Lutz, ns3:ML} D Request: Do you have a Topic Item with ns1:LutzMaicher or ns2:MaicherLutz in the property [subject identifier]? (Do you have information about the Subject Lutz Maicher?) Step 1

12 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 12Lutz Maicher Local Semantic Handshakes and Interaction Protocols [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz} A [subject identifier] {ns2:MaicherLutz, ns3:ML} B [subject identifier] {ns3:ML} C [subject identifier] {ns4:Lutz, ns3:ML} D Request: Do you have a Topic Item with ns1:LutzMaicher or ns2:MaicherLutz in the property [subject identifier]? (Do you have information about the Subject Lutz Maicher?) NO ns2:MaicherLutz, ns3:ML ns2:MaicherLutz, ns1:LutzMaicher Step 1

13 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 13Lutz Maicher Local Semantic Handshakes and Interaction Protocols [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz, ns3:ML} A B [subject identifier] {ns3:ML} C [subject identifier] {ns4:Lutz, ns3:ML} D Request: Do you have a Topic Item with ns1:LutzMaicher, ns2:MaicherLutz or ns3:ML in the property [subject identifier]? Step 2

14 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 14Lutz Maicher Local Semantic Handshakes and Interaction Protocols [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz, ns3:ML} A B [subject identifier] {ns3:ML} C [subject identifier] {ns4:Lutz, ns3:ML} D Request: Do you have a Topic Item with ns1:LutzMaicher, ns2:MaicherLutz or ns3:ML in the property [subject identifier]? ns1:LutzMaicher, ns3:ML, ns2:MaicherLutz ns3:ML ns4:Lutz, ns3:ML ns1:LutzMaicher, ns3:ML, ns2:MaicherLutz, Step 2

15 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 15Lutz Maicher Local Semantic Handshakes leads to Global Integration [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz, ns3:ML, ns4:Lutz} A [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz, ns3:ML} B [subject identifier] {ns1:LutzMaicher, ns2:MaicherLutz, ns3:ML, ns4:Lutz} C D TM1 TM3 TM2 TM4 Global Integration through Local Semantic Handshakes.

16 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 16Lutz Maicher Hypothesis and Simulation Design

17 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 17Lutz Maicher Hypothesis Due to the existence of the TMDM and interaction protocols, terminological diversity will be resolved to global integration if the majority of Topics discloses one local Semantic Handshake Simulations for testing the Hypothesis …

18 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 18Lutz Maicher Simulation Design Create Topics –Create a number (cardE) of Topics which are assumed to exist in the world and representing the same Subject by definition –All Topics can always interact with each other Add Subject Identifiers randomly –Draw a number of Subject Identifieres (nbrOfDifferentII) which should be assigend to the Topic according to a given distribution (distributionNbrOfII) if number is 1 no semantic handshake if number is bigger than 1 semantic handshakes are done –Draw for each Subject Identifier of a Topic an integer according to a given distribution (distributionII) in the range [1..nbrOfII] Start Interaction between Topics –If two Topics have an identical number in their sets of Subject Identifiers they become merged (the sets of Subject Identifiers of both Topics become the union of the origin sets)

19 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 19Lutz Maicher Definition of an Distribution Distributions are defined as follows: is similar to the lottery –that 1,2,3 is drawn with the probability 80% –that 1,2,3 is drawn with the probability 20% is similar to the lottery –that a number in [1,25] is drawn with the probability 80% –that a number in [26,50] is drawn with the probability 10% –that a number in [51,75] is drawn with the probability 7% –that a number in [76,100] is drawn with the probability 3%

20 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 20Lutz Maicher Analysis - Measures Measures of Interest (after some iterations) –Number of independet clusters (integration clouds) an integration cloud is a set of Topics which are equal –Average size of the integration clouds clouds(E) the lower the better clouds(E) = 1 global integration the higher the better card(T) = card(E) global integration clouds(E) = 3 card(T) = 33/9 = 3,7 clouds(E) = 2 card(T) = 41/9 = 4,6

21 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 21Lutz Maicher Experiment Series

22 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 22Lutz Maicher Simulation: Global Ontology the PSI Case No Simulation is necessary –each Topic has the same, globally unique Subject Identifier –clouds(E)=1 (Global Integration) –card(T) = card(E) … but the enforcement of global ontologies is an overly optimistic premise!

23 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 23Lutz Maicher Simulation: Heterogenous World without Semantic Handshakes Iteration of nbrOfDifferentII in [5,100] general parameter: card(E)=100, distributionNbrOfII= specific parameter exp01: distributionII= specific parameter exp02: distributionII= some terms are more prominent 100 different terms will be resolved less then 40 integration clouds because some authors use the same term by chance (esp. the most prominent terms) no Semantic Handshakes

24 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 24Lutz Maicher Simulation: The Impact of Semantic Handshakes Iteration of a in distributionNbrOfII= in [0.0,1.0] general parameters: card=100, nbrOfDifferentII=100 specific parameters exp03: distributionII= specific parameters exp04: distributionII= no semantic handshakes always a semantic handshake some terms are more prominent high terminological diversity 100 different terms will be resolved to ten integration clouds if only 55% of all Topics disclose a Semantic Handshake!

25 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 25Lutz Maicher Simulation: The Impact of the terminological diversity Iteration of nbrOfDifferentII in [2,100] general parameters: cardE=100, distributionII= specific parameter exp05: distributionNbrOfII= specific parameter exp06: distributionNbrOfII= high terminological diversity low terminological diversity semantic handshake by the minority semantic handshake by the majority 50 different terms will be resolved to global integration if 80% of all Topics disclose a Semantic Handshake!

26 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 26Lutz Maicher Result and Discussion

27 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 27Lutz Maicher Result Hypothesis is proofed: Global Integration will be reached if a significant number (majority) of Topics disclose one semantic handshake. –Remark the effect does only appear, if there exist interaction links between all topic maps the time point the effect appears depends on the interaction frequency The more prominent the used terms are, the lower the global number of semantic handshakes necessary for global integration. Design Recommendation: –Assign two (prominent) Subject Identifiers to each Topic you create. (You dont have to be aware of all existing terms for your concept.)

28 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 28Lutz Maicher Discussion These findings include problems concerning –Wrong Semantic Handshakes (by mistake, by purpose) Homonymy (= the same term for different concepts) Trust (Can I trust the local Semantic Handshakes?) … but they are implied by the existence of the –TMDM and –Topic Maps Interaction Protocols

29 Institut für Informatik The Impact of Semantic Handshakes Automatische Sprachverarbeitung 29Lutz Maicher Questions?!


Download ppt "Institut für Informatik Automatische Sprachverarbeitung The Impact of Semantic Handshakes TMRA 2006, Leipzig, 12.10.2006 Lutz Maicher, University of Leipzig."

Similar presentations


Ads by Google