Presentation is loading. Please wait.

Presentation is loading. Please wait.

Resolving Personal Names in Email Using Context Expansion Tamer Elsayed, Douglas W. Oard, and Galileo Namata ACL, Columbus, Ohio, June 2008 Human Language.

Similar presentations


Presentation on theme: "Resolving Personal Names in Email Using Context Expansion Tamer Elsayed, Douglas W. Oard, and Galileo Namata ACL, Columbus, Ohio, June 2008 Human Language."— Presentation transcript:

1 Resolving Personal Names in Email Using Context Expansion Tamer Elsayed, Douglas W. Oard, and Galileo Namata ACL, Columbus, Ohio, June 2008 Human Language Technology Center of Excellence UMIACS CLIP Lab

2 Resolving Personal Names in Email Using Context Expansion 2 Real Problem National Archives Clinton White House Tobacco Policy search request hired 25 persons ~~~~~~~~ ~~~~~~~~ ~~~~~~~~ ~~~~~~~~ ~~~~~~~~ 32 million emails 200,000 80,000 for 6 months …

3 Resolving Personal Names in Email Using Context Expansion 3 Date: Wed Dec 20 08:57:00 EST 2000 From: Kay Mann To: Suzanne Adams Subject: Re: GE Conference Call has be rescheduled Did Sheila want Scott to participate? Looks like the call will be too late for him. Sheila Identity Resolution in Email WHO?WHO?

4 Resolving Personal Names in Email Using Context Expansion 4 Enron Collection -----Original Message----- From: SStack@reliant.com@ENRON Sent: Monday, July 30, 2001 2:24 PM To: Sager, Elizabeth; Murphy, Harlan; jcrespo@hess.com; wfhenze@jonesday.com Cc: ntillett@reliant.com Subject:Shhhh.... it's a SURPRISE ! Message-ID: Date: Mon, 30 Jul 2001 12:40:48 -0700 (PDT) From: elizabeth.sager@enron.com To: sstack@reliant.com Subject: RE: Shhhh.... it's a SURPRISE ! X-From: Sager, Elizabeth X-To: 'SStack@reliant.com@ENRON' Hope all is well. Count me in for the group present. See ya next week if not earlier Please call me (713) 207-5233 Liza Elizabeth Sager 713-853-6349 Hi Shari Thanks! Shari 55 Sheila’s !! weisman pardo glover rich jones breeden huckaby tweed mcintyre chadwick birmingham kahanek foraker tasman fisher petitt Dombo Robbins chang jarnot kirby knudsen boehringer lutz glover wollam jortner neylon whanger nagel graves mclaughlin venville rappazzo miller swatek hollis maynes nacey ferrarini dey macleod howard darling watson perlick advani hester kenner lewis walton whitman berggren osowski kelly Rank Candidates

5 Resolving Personal Names in Email Using Context Expansion 5 Proposed Generative Model person 1.Choose “person” c to mention p(c)p(c) context 2.Choose appropriate “context” X to mention c p(X | c) mention 3.Choose a “mention” l p(l | X, c) “sheila” GE conference call

6 Resolving Personal Names in Email Using Context Expansion 6 3-Step Solution (1) Identity Modeling Posterior Distribution (3) Mention Resolution (2) Context Reconstruction

7 Resolving Personal Names in Email Using Context Expansion 7 Outline  Introduction and Approach Overview  Computational Model of Identity  Context Reconstruction  Mention Resolution  Evaluation  Conclusion and Future Work

8 Resolving Personal Names in Email Using Context Expansion 8 “Easy References” of Identity -----Original Message----- From: SStack@reliant.com@ENRON Sent: Monday, July 30, 2001 2:24 PM To: Sager, Elizabeth; Murphy, Harlan; jcrespo@hess.com; wfhenze@jonesday.com Cc: ntillett@reliant.com Subject:Shhhh.... it's a SURPRISE ! Message-ID: Date: Mon, 30 Jul 2001 12:40:48 -0700 (PDT) From: elizabeth.sager@enron.com To: sstack@reliant.com Subject: RE: Shhhh.... it's a SURPRISE ! X-From: Sager, Elizabeth X-To: 'SStack@reliant.com@ENRON' Hope all is well. Count me in for the group present. See ya next week if not earlier Please call me (713) 207-5233 Liza Elizabeth Sager 713-853-6349 Hi Shari Thanks! Shari Email Standards Email-Client Behavior User Regularities Elsayed and Oard, CEAS 2006

9 Resolving Personal Names in Email Using Context Expansion 9 Representational Model of Identity 77,240 “non-trivial” models sheila.glover@enron.com 14 (Quoted Headers) sheila glover 932 (Main Headers) sheila 19 (Salutation) 216 (Signature) sg 19 (Signature) sheila glover 1170 (User Name) Representational Model Elsayed and Oard, CEAS 2006

10 Resolving Personal Names in Email Using Context Expansion 10 Computational Model of Identity c m t identity observed mention name type

11 Resolving Personal Names in Email Using Context Expansion 11 Identity Models Candidates Likelihood: p ( “sheila” | c)

12 Resolving Personal Names in Email Using Context Expansion 12 Outline  Introduction and Approach Overview  Computational Model of Identity  Context Reconstruction  Mention Resolution  Evaluation  Conclusion and Future Work

13 Resolving Personal Names in Email Using Context Expansion 13 Date: Wed Dec 20 08:57:00 EST 2000 From: Kay Mann To: Suzanne Adams Subject: Re: GE Conference Call has be rescheduled Did Sheila want Scott to participate? Looks like the call will be too late for him. Sheila Who is that “Sheila”? ?

14 Resolving Personal Names in Email Using Context Expansion 14 Contextual Space Local Context Local Context Conversational Context Conversational Context Topical Context

15 Resolving Personal Names in Email Using Context Expansion 15 Topical Context Date: Fri Dec 15 05:33:00 EST 2000 From: david.oxley@enron.com To: vince j kaminski Cc: sheila walton Subject: Re: Grant Masson Great news. Lets get this moving along. Sheila, can you work out GE letter? Vince, I am in London Monday/Tuesday, back Weds late. I'll ask Sheila to fix this for you and if you need me call me on my cell phone. sheila.walton@enron.com Date: Wed Dec 20 08:57:00 EST 2000 From: Kay Mann To: Suzanne Adams Subject: Re: GE Conference Call has be rescheduled Did Sheila want Scott to participate? Looks like the call will be too late for him. Sheila call Sheila call GE

16 Resolving Personal Names in Email Using Context Expansion 16 Contextual Space Social Context Local Context Local Context Conversational Context Conversational Context Topical Context

17 Resolving Personal Names in Email Using Context Expansion 17 Date: Wed Dec 20 08:57:00 EST 2000 From: Kay Mann To: Suzanne Adams Subject: Re: GE Conference Call has be rescheduled Did Sheila want Scott to participate? Looks like the call will be too late for him. Social Context Date: Tue, 19 Dec 2000 07:07:00 -0800 (PST) From: rebecca.walker@enron.com To: kay.mann@enron.com Subject: ESA Option Execution Kay Can you initial the ESA assignment and assumption agreement or should I ask Sheila Tweed to do it? I believe she is currently en route from Portland. Thanks, Rebecca Sheila Tweed kay.mann@enron.com

18 Resolving Personal Names in Email Using Context Expansion 18 Formally probability distribution  A context of an email is a probability distribution over emails Probability estimated based on type of context  Contextual Space is a linear combination of 4 contexts

19 Resolving Personal Names in Email Using Context Expansion 19 Contextual Space (emails) Social Context Local Context Local Context Conversational Context Conversational Context Topical Context

20 Resolving Personal Names in Email Using Context Expansion 20 Contextual Space (mentions) “Sheila” social conversational social topical social topical “Sheila Tweed” “sheila” “jsheila@enron.com” “sg” “Sheila Walton” “Sheila”

21 Resolving Personal Names in Email Using Context Expansion 21 Outline  Introduction and Approach Overview  Computational Model of Identity  Context Reconstruction  Mention Resolution  Evaluation  Conclusion and Future Work

22 Resolving Personal Names in Email Using Context Expansion 22 Mention Resolution Candidates Likelihood: p ( “sheila” | c) Goal: estimate p(c|m, X(m)) and rank accordingly Date: Wed Dec 20 08:57:00 EST 2000 From: Kay Mann To: Suzanne Adams Subject: Re: GE Conference Call has be rescheduled Did Sheila want Scott to participate? Looks like the call will be too late for him. Sheila 1 2 3 ?

23 Resolving Personal Names in Email Using Context Expansion 23 Context-Free Resolution “Sheila” social conversational social topical social topical “Sheila Tweed” “sheila” “jsheila@enron.com” “sg” “Sheila Walton” “Sheila” X Context-Free Resolution

24 Resolving Personal Names in Email Using Context Expansion 24 Contextual Resolution “Sheila” social conversational social topical social topical “Sheila Tweed” “sheila” “jsheila@enron.com” “sg” “Sheila Walton” “Sheila” “Sheila Tweed” “sheila” “jsheila@enron.com” “sg” “Sheila Walton” “Sheila” Context-Free Resolution

25 Resolving Personal Names in Email Using Context Expansion 25 Outline  Introduction and Approach Overview  Computational Model of Identity  Context Reconstruction  Mention Resolution  Evaluation  Conclusion

26 Resolving Personal Names in Email Using Context Expansion 26 Test Collections CollectionEmailsIdentitiesMentionCandidates QueriesMin.Avg.Max. Sager1,628627511411 Shapiro974855491821 Enron-subset54,01827,340781152489 Enron-all248,451123,7837835181785 Sager Shapiro Enron-subset Enron-all

27 Resolving Personal Names in Email Using Context Expansion 27 Evaluation Measures Commonly used in “known-item” retrieval  Success @1 (i.e., Precision @1) One-best  MRR (Mean Reciprocal Rank) Inverse of the harmonic mean of the ranks of true answer r i

28 Resolving Personal Names in Email Using Context Expansion 28 Which Context is the best? SagerShapiroEnron-subEnron-all

29 Resolving Personal Names in Email Using Context Expansion 29 Mixture of Contexts

30 Resolving Personal Names in Email Using Context Expansion 30 Comparison w/Literature MRR Success @ 1 ContextLit.ContextLit. CollectionExpansionBestExpansionBest Sager0.9110.8890.8630.804 Shapiro0.9130.8790.8780.779 Enron-subset0.91-0.846(0.82) Enron-all0.89-0.821- Context Expansion Lit. Best Context Expansion Lit. Best

31 Resolving Personal Names in Email Using Context Expansion 31 Conclusion and Future Work  Social context is best individual context Highlights importance of social context  Scales well to large collections  Need for extending the test collection Queries for identities on long-tail  Iterative approach for “joint resolution”

32 Resolving Personal Names in Email Using Context Expansion 32 Thank You!

33 Resolving Personal Names in Email Using Context Expansion 33 Related Work  Diehl et al. (SIAM, 2006) Developed Enron-subset collection Temporal traffic models Candidates must have communicated with sender  Minkov et al. (SIGIR, 2006) Developed Sager and Shapiro collections Graphical framework Large collections?


Download ppt "Resolving Personal Names in Email Using Context Expansion Tamer Elsayed, Douglas W. Oard, and Galileo Namata ACL, Columbus, Ohio, June 2008 Human Language."

Similar presentations


Ads by Google