Presentation is loading. Please wait.

Presentation is loading. Please wait.

6 Nov 2001IS202: Information Organization and Retrieval Information Extraction Ray Larson & Warren Sack IS202: Information Organization and Retrieval Fall.

Similar presentations


Presentation on theme: "6 Nov 2001IS202: Information Organization and Retrieval Information Extraction Ray Larson & Warren Sack IS202: Information Organization and Retrieval Fall."— Presentation transcript:

1 6 Nov 2001IS202: Information Organization and Retrieval Information Extraction Ray Larson & Warren Sack IS202: Information Organization and Retrieval Fall 2001 UC Berkeley, SIMS lecture author: Warren Sack

2 6 Nov 2001IS202: Information Organization and Retrieval Cognitive Science 10/30/01 – AI, knowledge representation and common sense 11/01/01 – Computational Linguistics, Cognitive Psychology and Lexical Knowledge 11/06/01 – AI and information extraction 11/08/01 – Linguistics, Philosophy, Psychology, categories, and cognition

3 6 Nov 2001IS202: Information Organization and Retrieval Last Time Lexical relations –Linguistics Two approaches to semantics: –Compositional –Relational –Psycholinguistics WordNet –Description –Structure –Applications

4 6 Nov 2001IS202: Information Organization and Retrieval Levels of Linguistic Analysis Sentences –Phonological/Morphological analysis –Syntactic analysis –Semantic analysis More than one sentence –Pragmatic analysis

5 6 Nov 2001IS202: Information Organization and Retrieval Pragmatics Deixis –E.g., “I’ll be back in an hour” depends upon the time of the utterance. Conversational implicature –A: “Can you tell me the time?” –B: “Well, the milkman has come.” [I don’t know exactly, but perhaps you can deduce it from some extra information I give you.] Presupposition –“Are you still such a bad driver?” Speech acts –Constatives vs. performatives –e.g., “I second the motion.” Conversational Structure –E.g., turn-taking rules

6 6 Nov 2001IS202: Information Organization and Retrieval Last Last Time What is Cognitive Science? What is Artificial Intelligence? –Knowledge Representation Languages –Representing Common Sense Common Sense Interfaces Story Understanding, Story Generation, and Common Sense

7 6 Nov 2001IS202: Information Organization and Retrieval Today: Information Extraction A short history: AI Story Understanding, SAM, and FRUMP Basic Techniques: Lexical analysis, name recognition, syntax, scenario, coreference, inference, template Evaluation: MUC-3 to MUC-7 What else can you do with an IE system? SpinDoctor and PLUM

8 6 Nov 2001IS202: Information Organization and Retrieval History: Story Understanding Roger Schank and Robert Abelson, Scripts, Plans, Goals and Understanding, 1977 Richard Cullingford, SAM, 1979 Robert Wilensky, PAM, 1978 Gerald DeJong, FRUMP, 1979

9 6 Nov 2001IS202: Information Organization and Retrieval SAM: Script Applier Mechanism #| restaurant script: (1) go to the restaurant; (2) order a meal; (3) eat the meal; (4) pay; (5) leave the restaurant |# (events-script '$restaurant '((ptrans (actor ?diner) (object ?diner) (to ?restaurant)) (mtrans (actor ?diner) (object (ingest (actor ?diner) (object ?meal)))) (ingest (actor ?diner) (object ?meal)) (atrans (actor ?diner) (object (money)) (from ?diner) (to ?restaurant)) (ptrans (actor ?diner) (object ?diner) (from ?restaurant) (to ?elsewhere))))

10 6 Nov 2001IS202: Information Organization and Retrieval PAM: Plan and Goal Applier Mechanism? #| restaurant plan: goal: you’re hungry and you want to eat; plan: go to a restaurant |# (goal (planner ?x) (objective (is (actor ?x) (state (hunger (val 0)))) (do-restaurant-plan (planner ?x) (restaurant ?y))) (subgoal (do-restaurant-plan (planner ?x) (restaurant ?y) (goal (planner ?x) (objective (proximity (actor ?x) (location ?y))) (isa restaurant ?y)))

11 6 Nov 2001IS202: Information Organization and Retrieval FRUMP: Fast Reading Understanding and Memory Program $demonstration script The demonstrators arrive at the demonstration location. The demonstrators march. Police arrive on the scene. The demonstrators communicate with the target of the demonstration. The demonstrators attack the target of the demonstration. The demonstrators attack the police. (From DeJong, 1979; pp. 19-20)

12 6 Nov 2001IS202: Information Organization and Retrieval FRUMP: I/O Example

13 6 Nov 2001IS202: Information Organization and Retrieval

14 6 Nov 2001IS202: Information Organization and Retrieval

15 6 Nov 2001IS202: Information Organization and Retrieval

16 6 Nov 2001IS202: Information Organization and Retrieval Information Extraction: Basic Techniques Lexical analysis Name recognition Syntax Scenario Coreference Inference Template

17 6 Nov 2001IS202: Information Organization and Retrieval Levels of Linguistic Analysis Sentences –Phonological/Morphological analysis –Syntactic analysis –Semantic analysis More than one sentence –Pragmatic analysis

18 6 Nov 2001IS202: Information Organization and Retrieval

19 6 Nov 2001IS202: Information Organization and Retrieval Lexical Analysis Input: Sam Schwartz retired as executive vice president of the famous hot dog manufacturer, Hupplewhite Inc. He will be succeeded by Harry Himmelfarb. Output: –Sam/name Schwartz/name retired/verb as/prep executive/adj vice/adj president/noun of/prep the/det famous/adj hot/adj dog/noun manufacturer/noun,/comma Hupplewhite/name Inc/name./period –He/pron will/verb be/verb succeeded/verb by/prep Harry/name Himmelfarb/name./period

20 6 Nov 2001IS202: Information Organization and Retrieval Name Recognition

21 6 Nov 2001IS202: Information Organization and Retrieval Syntactic Analysis

22 6 Nov 2001IS202: Information Organization and Retrieval Syntactic Analysis (continued)

23 6 Nov 2001IS202: Information Organization and Retrieval Scenario Matching

24 6 Nov 2001IS202: Information Organization and Retrieval Scenario Matching (continued)

25 6 Nov 2001IS202: Information Organization and Retrieval Coreference analysis

26 6 Nov 2001IS202: Information Organization and Retrieval Inference and Event Matching

27 6 Nov 2001IS202: Information Organization and Retrieval Event Template Matching

28 6 Nov 2001IS202: Information Organization and Retrieval Evaluation

29 6 Nov 2001IS202: Information Organization and Retrieval MUC/Tipster

30 6 Nov 2001IS202: Information Organization and Retrieval MUC 3 to MUC 7

31 6 Nov 2001IS202: Information Organization and Retrieval

32 6 Nov 2001IS202: Information Organization and Retrieval

33 6 Nov 2001IS202: Information Organization and Retrieval

34 6 Nov 2001IS202: Information Organization and Retrieval What else can you do with an IE system? SpinDoctor (Sack, 1994) PLUM (Elo, 1995)

35 6 Nov 2001IS202: Information Organization and Retrieval PLUM: Peace Love and Understanding Machine

36 6 Nov 2001IS202: Information Organization and Retrieval

37 6 Nov 2001IS202: Information Organization and Retrieval

38 6 Nov 2001IS202: Information Organization and Retrieval SpinDoctor: Categorizing News Stories by Ideological Point of View

39 6 Nov 2001IS202: Information Organization and Retrieval

40 6 Nov 2001IS202: Information Organization and Retrieval

41 6 Nov 2001IS202: Information Organization and Retrieval

42 6 Nov 2001IS202: Information Organization and Retrieval

43 6 Nov 2001IS202: Information Organization and Retrieval

44 6 Nov 2001IS202: Information Organization and Retrieval

45 6 Nov 2001IS202: Information Organization and Retrieval Next Time Categories and Cognition according to George Lakoff


Download ppt "6 Nov 2001IS202: Information Organization and Retrieval Information Extraction Ray Larson & Warren Sack IS202: Information Organization and Retrieval Fall."

Similar presentations


Ads by Google