Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri,

Slides:



Advertisements
Similar presentations
Knowledge-based Information Retrieval: A Work in Progress Knowledge-based Systems Research Group, University of Texas at Austin.
Advertisements

DARPA SHAKEN Virus-Invades-Cell Invade VirusCell Attach Penetrate Release Move invader thing invaded barrier Cell-membrane has-part subevent penetrator.
Math CAMPPP 2011 Plenary 1 What’s the Focus? An Introduction to Algebraic Reasoning Ruth Beatty and Cathy Bruce 1.
Chapter 10 Table of Contents Section 1 Discovery of DNA
Chapter 10 Table of Contents Section 1 Discovery of DNA
Replication, Transcription, and Translation Before a cell can divide, the DNA in the nucleus of the cell must be duplicated. Since the DNA molecule consists.
DNA Structure and Function Chapter 12. Discovery of DNA Nucleic Acids were discovered in 1869 by Friedrich Mieschner as a substance contained within nuclei.
Knowledge Representation
Part II What SHAKEN already knows. SHAKEN’s Components What SHAKEN already knows is organized in components We will look only at the components needed.
Building Knowledge Bases Compositionally Bruce Porter, Peter Clark Ken Barker, Art Souther, John Thompson James Fan, Dan Tecuci, Peter Yeh Marwan Elrakabawy,
User Interface Design Yonsei University 2 nd Semester, 2013 Sanghyun Park.
Chapter 10 Table of Contents Section 1 Discovery of DNA
A Library of Generic Concepts for Composing Knowledge Bases Ken Barker, Bruce UTAustin Peter
How an SME Might Assemble a KB from Components Bruce Porter (University of Texas) Peter Clark (Boeing) and Colleagues.
Chapter 6 DNA  Consists of Deoxyribose sugar Phosphate group A, T, C, G  Double stranded molecule (Double Helix) Two strands of DNA run antiparallel.
Bruce Porter (University of Texas) Peter Clark (Boeing) and Colleagues Building KB’s by Assembling Components: An early evaluation of the approach.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
User interface design Designing effective interfaces for software systems Objectives To suggest some general design principles for user interface design.
Chapter 3 Planning Your Solution
Intelligent Tutoring Systems Traditional CAI Fully specified presentation text Canned questions and associated answers Lack the ability to adapt to students.
Machine Reading as a Process of Partial Question-Answering Peter Clark and Phil Harrison Boeing Research & Technology June 2010.
1 BTEC HNC Systems Support Castle College 2007/8 Systems Analysis Lecture 9 Introduction to Design.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
Part I Overview and Introduction to SHAKEN. Simplified Version of how a Virus Invades a Cell “A virus invades a cell in the following way. First, the.
DAY 2 Part I: Review Part II: Task Part III: Feedback and Suggestions.
Biology 10.2 Gene Regulation and Structure Gene Regulation and Structure.
Technologies to Enable Biologists to Build Large Knowledge Bases on Human Anatomy and Physiology Bruce Porter Ken Barker Art Souther Department of Computer.
Knowledge Base Content Bruce Porter, Peter Clark Ken Barker, Art Souther, John Thompson James Fan, Dan Tecuci, Peter Yeh Marwan Elrakabawy, Sarah Tierney.
Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri,
Biology 9.3 Replication of DNA
KM2SILK translator – Level 0 Design Peter Clark Boeing Research and Technology April 2010.
1 Extend is a simulation tool to create models quickly, with all the blocks you need and without even having to type an equation. You can use a series.
Data and information. Information and data By the end of this, you should be able to state the difference between DATE and INFORMAITON.
Software Architecture
2A. Distinguish between DNA and RNA.
DNA Replication IB Biology HL 1 Mrs. Peters Spring 2014.
Building a Knowledge Base by Telling a Story: An example A virus invades a cell in the following way. First, the virus attaches to the cell membrane. It.
Standardized Test Prep
DNA Replication Section 9-3. DNA is Copied with the Help of Many Enzymes We know that the two DNA strands have a complementary relationship (A pairs with.
Chapter 9.3 Grade 10 Biology Spring 2011 The Replication of DNA.
Guiding Inference with Conceptual Graphs Bruce Porter Univ Texas at Austin Peter Clark Boeing Research.
A Proposal for a Process Specification Language (Working Note 21) Peter Clark and John Thompson Knowledge Systems, Boeing Research inspired by comments.
Section 2: Replication of DNA
Enabling Domain Experts to Convey Questions to a Machine: A Modified, Template-Based Approach Peter Clark (Boeing Phantom Works) Ken Barker, Bruce Porter.
SME2Logic Translation - an initial approach Peter Clark Boeing Company.
Working Memory and Learning Underlying Website Structure
Building KB’s by Assembling Components Bruce Porter (UT Austin) Peter Clark (Boeing)
8.3 DNA Replication TEKS 3E, 5A, 9C The student is expected to: 3E evaluate models according to their limitations in representing biological objects or.
Viral Infectious Cycles – Lytic (pkt pg 12) AND Lysogenic (pkt pg 13)
Take me to NZQA Documents relating to this standard AS Demonstrate understanding of life processes at the cellular level The Replication of DNA.
We are starting our DNA Unit Today! Please do the following: pick up handouts have out your notebook
Section 2: Replication of DNA
How to Use This Presentation
Control of gene expression in eukaryotic cells
KANAL: Knowledge ANALysis
Action Editor Storyboard
KANAL: Knowledge ANALysis
DNA Replication.
2A. Distinguish between DNA and RNA.
Section 2: Replication of DNA
2A. Distinguish between DNA and RNA.
DNA Replication.
copyright cmassengale
Section 2: Replication of DNA
Bruce Porter Ken Barker Art Souther Department of Computer Science
copyright cmassengale
Chapter 10 Table of Contents Section 1 Discovery of DNA
= DNA Nucleotide Phosphate Nitrogen Base Pairs:
RNA Read the lesson title aloud to students..
Presentation transcript:

Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri, Andres Rodriguez, Jerome Thomere, Sunil Mishra (SRI International) Yolanda Gil (ISI) Pat Hayes, Thomas Reichherzer (Univ W Florida)

Goals and Context Problem: difficult for domain experts to enter knowledge into KBs directly Goal: Create tools supporting this Context: –Part of DARPA’s Rapid Knowledge Formation project –Focus on domain knowledge (cf. problem-solving) Full system (SHAKEN) includes tools for: Knowledge entry; testing, analysis, and debugging; question-answering; analogical reasoning. –Application domain: cell biology

Hypotheses and Approach Knowledge entry = “assembling pre-built representational components” (rather than “writing axioms”) –Complex axioms already pre-built in the KB Can present and manipulate these representations graphically –Presentation: dialog in terms of examples –Manipulation: only need support a small number of “connection” axiom types (rather than full FOL)

The Knowledge Entry Process User’s goal: Create/edit a representation of a concept User’s activities: –Locate and display relevant components from library –Connect & extend them to create new representation –Save the result –Test & ask questions about the new concept

Displaying axioms using examples To present axioms about a concept C, –user doesn’t see the raw axioms directly –Rather, user sees an example I of C Sees a graph of ground facts about I (computed from the axioms) ground facts are comprehensible and graphable User builds new concept by interacting with this and other examples

Displaying axioms using examples  x isa(x,Penetrate)   y,z isa(y,Traverse)  isa(z,Breach)  subevent(x,y)  subevent(x,z).  x,y isa(x,Penetrate)  agent(x,y)  isa(y,Tangible-Entity).  x,y,z isa(x,Penetrate)  subevent(x,y)  isa(y,Breach)  subevent(x,z)  isa(z,Traverse)  next-event(y,z).  w,x,y,z isa(x,Penetrate)  subevent(x,y)  isa(y,Breach)  agent(y,z)  agent(x,w)  w = z.  x,y,z isa(x,Penetrate)  subevent(x,y)  isa(y,Traverse)  path(y,z)  isa(z,Portal). …. Rules (logic) New concept: Virus-Invasion (a type of event) SME adds a Penetrate subevent

Displaying axioms using examples Rules as applied to an example New concept: Virus-Invasion (a type of event) SME adds a Penetrate subevent

Connecting and Extending the Model The user manipulates instances in the graph, using four types of graphical action –specialize, add, connect, unify Each action generates a rule –Initial rule applies just to the example being viewed –A generalization algorithm generalizes the rule to hold for all instances of the concept being built

Synthesizing the axiom: “Tangible entity 1 is a virus” Graphical Action 1: Specialize  “In this virus invasion, the thing penetrating is a virus”  “In all virus invasions, the thing penetrating is a virus”

Graphical Action 1: Specialize

Synthesizing the axiom: “In this virus invasion, there is a cell participant.” Graphical Action 2: Add  “In all virus invasions, there is a cell participant.”

Graphical Action 2: Add

Synthesizing the axiom: “In this virus invasion, the object is the cell participant.” Graphical Action 3: Connect  “In all virus invasions, the object is the cell participant.”

Graphical Action 4: Unify

Synthesizing the axiom: “Barrier 1 = Plasma Membrane 2”  “In all virus invasions, the object of the penetrate = the plasma membrane part of the cell.” Graphical Action 4: Unify  “In this virus invasion, the object of the penetrate…  “In this virus invasion, the object of the penetrate = the plasma membrane part of the cell.”

(Demonstration)

Evaluation and Lessons Learned Large-scale trials in June and July biology students used system for 4 weeks Their goals: –Encode 11-page subsection on cell biology –Create and debug representations –Test system on large set of test questions High-school level difficulty Generally “reading comprehension” style

Results All users able to grasp the basic approach Built representations for –~450 biological concepts –Size 1 to >100 (!) nodes –Axioms created: 1408, 567, 1296, 921

Example graph by end user

All users able to grasp the basic approach Built representations for –~450 biological concepts –Size 1 to >100 (!) nodes –Axioms created: 1408, 567, 1296, 921 Results Answer quality on test questions: –~2 (“mostly correct”) on scale 0-3 (1.74 on all questions, 2.24 on questions attempted) System rated “useful” and “easy” to use

Results (cont) A lot of knowledge encoded… But a lot of knowledge not encoded. –Pre/post conditions for actions –Richer process models (e.g., repetitive events) –Negative information (e.g., doesn’t happen) –Locational/spatial information (e.g., shape) –Changes with time (e.g., state at end of process) –Uncertainty (e.g., “typically”, “usually”, “mainly”, “most”)

Encoding: Example: “In bacteria, RNA polymerase molecules tend to stick weakly to the bacterial DNA when they make a random collision with it; the polymerase molecule then slides rapidly along the DNA…” Original:

Example: “In bacteria, RNA polymerase molecules tend to stick weakly to the bacterial DNA when they make a random collision with it; the polymerase molecule then slides rapidly along the DNA…” Encoding: Original:

User errors Hope: Pre-built representations guide users, reduce errors But: users still made mistakes, e.g.: –Indirect/incorrect reference “DNA” vs. “DNA strand” vs. “subsequence” –Missing coreferences “attach to RNA; remove nucleotide sequence [of that RNA]” –Overgenerality/missing context “All polymerases have a sigma factor” “Genes contain exons” –Misuse of case roles “polymerase is the instrument of copying”

Multiple Viewpoints System: assumes a single representation of concept But: Users sometimes created multiple representations –DNA as sequence of genes and non-genes Sequence of nucleotide pairs Pair of DNA strands –Multiple views of a process Which actions to include/ignore Need a better way of handling viewpoints

System’s Reasoning Users were sometimes annoyed/confused at SHAKEN’s own inferencing (!) Need better ways to –Regulate when system’s inferencing occurs –Explain why it is happening

Summary and Conclusion Key Points: –Knowledge entry = “component assembly” –Graphical interface based on dialog in terms of examples claim that a limited set of axiom types is adequate Key Results: –It (really!) works! –…but… Some knowledge not captured Some mistakes still made Viewpoints not well handled

(end)

(Very simple) example graph…

Example: “In bacteria, RNA polymerase molecules tend to stick weakly to the bacterial DNA when they make a random collision with it; the polymerase molecule then slides rapidly along the DNA…” Encoding: Original: “(In bacteria), RNA polymerase molecules (tend to) stick (weakly) to the bacterial DNA (when they make a random collision with it); the polymerase molecule then slides (rapidly) along the DNA…” make contact moves

Results (cont) A lot of knowledge encoded… But a lot of knowledge not encoded. –Simple attribute values (e.g., sizes) –Equational information (e.g., rates wrt time) –Temporal relations (e.g., simultaneous) –Pre/post conditions for actions –Richer process models (e.g., repetitive events) –Sequences (e.g., nucleotide sequences) –Negative information (e.g., doesn’t happen) –Locational/spatial information (e.g., shape) –Changes with time (e.g., state at end of process) –Uncertainty (e.g., “typically”, “usually”, “mainly”, “most”)