Knowledge Systems and Project Halo In collaboration with SRI (Vinay Chaudhri) and Boeing (Peter Clark)

Slides:



Advertisements
Similar presentations
E-Science Data Information and Knowledge Transformation Thoughts on Education and Training for E-Science Based on edikt project experience Dr. Denise Ecklund.
Advertisements

Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Kurt Hungerford CSCI The Knowledge Machine is a knowledge representation and reasoning system that allows users to store concepts and relationships.
RNA and Protein Synthesis
Knowledge-based Information Retrieval: A Work in Progress Knowledge-based Systems Research Group, University of Texas at Austin.
Progress on Building the Component Library Bruce Porter, Peter Clark Ken Barker, Art Souther, John Thompson James Fan, Dan Tecuci, Peter Yeh Charles Benton,
Ch 3 System Development Environment
Building Knowledge Bases Compositionally Bruce Porter, Peter Clark Ken Barker, Art Souther, John Thompson James Fan, Dan Tecuci, Peter Yeh Marwan Elrakabawy,
Knowledge Systems and Project Halo* University of Texas at Austin Ken Barker, Shaw Yi Chaw, James Fan, Bruce Porter, Dan Tecuci, Peter Yeh SRI International.
Knowledge Systems Bruce Porter Department of Computer Sciences The University of Texas at Austin.
A Library of Generic Concepts for Composing Knowledge Bases Ken Barker, Bruce UTAustin Peter
How an SME Might Assemble a KB from Components Bruce Porter (University of Texas) Peter Clark (Boeing) and Colleagues.
Bruce Porter (University of Texas) Peter Clark (Boeing) and Colleagues Building KB’s by Assembling Components: An early evaluation of the approach.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.
Represent the following sentences in first-order logic, using a consistent vocabulary
Protein Synthesis Ordinary Level. Lesson Objectives At the end of this lesson you should be able to 1.Outline the steps in protein synthesis 2.Understand.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
Unit 7 Lesson 1 DNA Structure and Function
Copyright © 2007 Pearson Education, Inc., publishing as Pearson Addison-Wesley PowerPoint ® Lecture prepared by Richard Wolfson 1 Doing Physics Richard.
Machine Reading as a Process of Partial Question-Answering Peter Clark and Phil Harrison Boeing Research & Technology June 2010.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
Part I Overview and Introduction to SHAKEN. Simplified Version of how a Virus Invades a Cell “A virus invades a cell in the following way. First, the.
Building Knowledge Bases from Reusable Components Peter Clark Boeing Applied Research and Technology.
Technologies to Enable Biologists to Build Large Knowledge Bases on Human Anatomy and Physiology Bruce Porter Ken Barker Art Souther Department of Computer.
 Knowledge Acquisition  Machine Learning. The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Knowledge Base Content Bruce Porter, Peter Clark Ken Barker, Art Souther, John Thompson James Fan, Dan Tecuci, Peter Yeh Marwan Elrakabawy, Sarah Tierney.
Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri,
The Reading to Learn Project Peter Clark Phil Harrison Tom Jenkins John Thompson Rick Wojcik (Boeing Phantom Works) David Israel (SRI)
Configuration Management (CM)
Knowledge Systems Knowledge Systems use formal representations of knowledge to answer unanticipated questions with coherent explanations Knowledge System.
Chapter 11 DNA Within the structure of DNA is the information for life- the complete instructions for manufacturing all the proteins for an organism. DNA.
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Chapters1-3Chapters 7-8Chapter 9Chapter.
Building Knowledge Bases from Reusable Components Bruce Porter Computer Science Dept, Univ of Texas at Austin Peter Clark Boeing Applied Research and Technology.
Domain-Independent Concepts Domain-Specific Concepts SlotEntityEventValueCliché ActionStateIntangible-EntityTangible-Entity ObjectSubstance…Place…Transfer…CreateMove.
DNA Deoxyribonucleic Acid Structure and Function.
RNA & DNA Compare RNA & DNA Contrast RNA & DNA
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
Object-Oriented Modeling: Static Models. Object-Oriented Modeling Model the system as interacting objects Model the system as interacting objects Match.
Christian Madu, Ph.D. Collin College Lecture Presentation Chapter 4-2 Chemical Quantities and Aqueous Reactions.
Generic Tasks by Ihab M. Amer Graduate Student Computer Science Dept. AUC, Cairo, Egypt.
Enabling Domain Experts to Convey Questions to a Machine: A Modified, Template-Based Approach Peter Clark (Boeing Phantom Works) Ken Barker, Bruce Porter.
Knowledge Entry as the Graphical Assembly of Components Peter Clark, John Thompson (Boeing) Ken Barker, Bruce Porter (Univ Texas at Austin) Vinay Chaudhri,
DNA Deoxyribonucleic Acid. DNA Structure What is DNA? The information that determines an organisms traits. DNA produces proteins which gives it “The.
Knowledge Representation
DNA.
Building KB’s by Assembling Components Bruce Porter (UT Austin) Peter Clark (Boeing)
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Topic 4 - Database Design Unit 1 – Database Analysis and Design Advanced Higher Information Systems St Kentigern’s Academy.
1 USC INFORMATION SCIENCES INSTITUTE EXPECT TEMPLE: TEMPLate Extension Through Knowledge Acquisition Yolanda Gil Jim Blythe Information Sciences Institute.
Teaching Bioinformatics Nevena Ackovska Ana Madevska - Bogdanova.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Examples of Chemistry Laws L. 1) Concentration of a solute In a solution, the concentration of a solute i is given by the following formula: Concentration.
PPT #1 Chapter 2 Chemistry review, properties of water and pH
Lecture 8-2CS250: Intro to AI/Lisp What do you mean, “What do I mean?” Lecture 8-2 November 18 th, 1999 CS250.
+ Protein Synthesis. + REVIEW: DNA plays 2 essential roles in organisms: #1: Allows cells to reproduce. How? DNA replication allows cells to pass along.
Gene Expression DNA, RNA, and Protein Synthesis. Gene Expression Genes contain messages that determine traits. The process of expressing those genes includes.
Of 24 lecture 11: ontology – mediation, merging & aligning.
16 April 2011 Alan, Edison, etc, Saturday.. Knowledge, Planning and Robotics 1.Knowledge 2.Types of knowledge 3.Representation of knowledge 4.Planning.
Unit 2 Lesson 6 DNA Structure and Function
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Unit 7 Lesson 1 DNA Structure and Function
Unit 2 Lesson 6 DNA Structure and Function
Unit 2 Lesson 6 DNA Structure and Function
Attributes and Values Describing Entities.
Bruce Porter Ken Barker Art Souther Department of Computer Science
Cracking the Code What is DNA?
Bruce Porter Art Souther Department of Computer Science
Presentation transcript:

Knowledge Systems and Project Halo In collaboration with SRI (Vinay Chaudhri) and Boeing (Peter Clark)

Knowledge Systems Knowledge Systems are formal representations of knowledge capable of answering unanticipated questions with coherent explanations Knowledge System = KB + Q/A + Explanation Generator + Knowledge Acq. tools

Project Halo Funded and administered by Vulcan, Inc – a Paul Allen company Objective: to assess the state of the art of knowledge systems – computer programs that know a lot and answer tough questions with coherent explanations Method: administer an AP Chemistry exam to knowledge systems built by 4 teams of researchers

A Significant Advance over Expert Systems Coverage Reasoning Explanation Rapid construction

KM: A Logic Programming Language …able to represent: –classes, instances, prototypes –defaults, fluents, constraints –(hypothetical) situations –actions (pre-, post-, and during- conditions) …and reason about: –inheritance with exceptions –deductive and abductive inference (with constraints) –automatic classification (given a partial description of an instance, determine the classes to which it belongs) –temporal projection (“my car is where I left it”) –affects of actions

A Simple Example When 70 ml of 3.0-Molar Na 2 CO 3 is added to 30 ml of 1.0-Molar NaHCO 3 the resulting concentration of Na + is: a)2.0 M b)2.4 M c)4.0 M d)4.5 M e)7.0 M

Question Representation volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M0.07 lit NaHCO lit volume 1.0 M conc.base conc. result has-part conc. Question 26 context ?? output

Background Knowledge Chemistry laws: 1.Concentration of a solute 2.Composition of strong electrolyte solutions 3.Conservation of mass 4.Conservation of volume etc.

Law 1: Concentration of a Solute The concentration of a chemical in a mixture is the quantity of the chemical divided by the volume of the mixture. Divide the quantity by the volume: / = X *molar Therefore, the concentration of in = X *molar Explanation Template Mixture volume conc. Volume *liters Concentration *molar has-part Chemical Quantity *moles quantity Compute-Concentration Method context input output Note: when this law is applied, using Novak’s code, the quantities are automatically converted to the units- of-measurement specified here

Law 1’: Quantity of a Solute Law 1 (on the previous slide) computed: Concentration = quantity / volume Of course, a slight variant computes: Quantity = concentration * volume Currently, we code this variant as a separate law (call it 1’) because it has a slightly different explanation template

Law 2: Composition of Strong Electrolytes Strong Electrolyte Anion has-part Quantity *moles quantity Quantity *moles quantity Cation Quantity *moles quantity Compute-Ions-in-Strong-Electrolyte context input output

Law 3: Conservation of Mass Conservation of Mass context input output Mix Chemical 1 Chemical n Chemical raw-material result … Quantity *moles Quantity *moles quantity Chemical has-part ?? *moles quantity part-of By the Law of Conservation of Mass, the quantity of a chemical in a mixture is the sum of the quantities of that chemical in the parts of the mix. The quantity of in is X 1 *moles … The quantity of in is X n *moles Therefore, the quantity of = X *moles Explanation Template

Law 4: Conservation of Volume Mix Chemical 1 Chemical n Mixture raw-material result … Volume Volume volume ?? *liter volume Conservation of Volume context input output By the Law of Conservation of Volume, the volume of a mixture is the sum of the volumes of the parts mixed. The sum of X 1, … and X n = X *liter Therefore, the volume of = X *liter Explanation Template

Step 1: Reclassify Terms volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M0.07 lit NaHCO lit volume 1.0 M conc.base conc. result has-part Strong Electrolyte Solution superclass

Step 2: Use Law 1 to Compute Concentration Mixture volume conc. Volume *liters Concentration *molar has-part Chemical Quantity *moles quantity Law 1 conc. ?? *molar volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M0.07 lit NaHCO lit volume 1.0 M conc. base conc. result has-part ?? *liters volume ?? *moles quantity

The Search is non-deterministic Multiple laws might be used to compute a value for any property. For example, here’s another way to compute concentration:  pH = - log [H + ], where [H + ] is the concentration of H + Since this applies only to H +, this search path ends quickly

Step 3: Use Law 4 to Compute Volume Mix Chemical raw-material result … Volume *liter Volume *liter volume Volume *liter volume Law 4.1 conc. ?? *molar volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M0.07 lit NaHCO lit volume 1.0 M conc. base conc. result has-part ?? *liters volume ?? *moles quantity

Step 4: Use Law 3 to Compute Quantity volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume ?? *moles quantity Mix Chemical raw-material result … Quantity *moles Quantity *moles quantity Chemical has-part ?? *moles quantity part-of Law 3 Na + ?? *moles ?? *moles has-part quantity

Step 5: Use Law 2 to Compute Quantity of Ionic Parts ?? *moles quantity Strong Electrolyte Anion has-part Quantity *moles quantity Quantity *moles quantity Cation Quantity *moles quantity Law 2 volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume ?? *moles quantity Na + ?? *moles ?? *moles has-part quantity

Step 6: Use Law 1’ to Compute Quantity ?? *moles quantity Mixture volume conc. Volume *liters Concentration *molar has-part Chemical Quantity *moles quantity Law 1’.21 volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume ?? *moles quantity Na + ?? *moles ?? *moles has-part quantity

Step 7: Wind out of Law 2 from step 5 Strong Electrolyte Anion has-part Quantity *moles quantity Quantity *moles quantity Cation Quantity *moles quantity Law *moles quantity volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume ?? *moles quantity Na + ?? *moles ?? *moles has-part quantity

Step 8-10: Similar to steps *moles quantity volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume ?? *moles quantity Na + ?? *moles.42 *moles has-part quantity

Step 11: Wind out of Law 3 from Step 4 Mix Chemical raw-material result … Quantity *moles Quantity *moles quantity Chemical has-part ?? *moles quantity part-of Law *moles quantity volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume ?? *moles quantity Na +.03 *moles.42 *moles has-part quantity

Step 12: Wind out of Law 1 from Step 2 Mixture volume conc. Volume *liters Concentration *molar has-part Chemical Quantity *moles quantity Law 1.21 *moles quantity volume Mix Aqueous Solution Mixture Na + raw material Na 2 CO M 0.07 liters 0.03 liters volume 1.0 M conc. base NaHCO 3 base conc. result has-part conc. ?? *molar.1 * liters volume.45 *moles quantity Na +.03 *moles.42 *moles has-part quantity 4.5

Question 26 Answer When 70 ml of 3.0-Molar Na2CO3 is added to 30 ml of 1.0-Molar NaHCO3, what is the resulting concentration of Na+?. The concentration of a chemical in a mixture is the quantity of the chemical divided by the volume of the mixture. By the Law of Conservation of Mass, the quantity of a chemical in a mixture is the sum of the quantities of that chemical in the parts of the mix. In the na2co3 strong-electrolyte-solution and the nahco3 strong-electrolyte-solution : In the na-plus : Multiply the concentration and the volume: 3 molar * 70 milliliter = 0.21 mole. The quantity of na-plus in the na-plus is 0.42 mole. In the co3-2 : The quantity of na-plus in the co3-2 is 0 mole. Multiply the concentration and the volume: 1 molar * 30 milliliter = 0.03 mole. In the na-plus : The quantity of na-plus in the na-plus is 0.03 mole. In the hco3- : The quantity of na-plus in the hco3- is 0 mole. The quantity of na-plus in the na2co3 strong-electrolyte-solution and the nahco3 strong-electrolyte-solution is 0.45 mole. Therefore, the quantity of na-plus = 0.45 mole. By the Law of Conservation of Volume, the volume of a mixture is the sum of the volumes of the parts mixed. The sum of 70 milliliter and 30 milliliter = 0.10 liter. Therefore, the volume of the strong-electrolyte-solution strong-electrolyte-solution mixture = 0.10 liter. Divide the quantity by the volume: mole / 0.10 liter = 4.50 molar. Therefore, the concentration of na-plus in the strong-electrolyte-solution strong-electrolyte-solution mixture = 4.50 molar. When 70 ml of 3.0-Molar Na2CO3 is added to 30 ml of 1.0-Molar NaHCO3, the resulting concentration of Na+ is 4.50 molar

Results of Project Halo After 4 month development effort, the knowledge systems were sequestered and given a test: –165 novel questions: 50 multiple choice; 115 free form response –Questions translated from English to formal language by each team, then assessed for fidelity by an independent committee High likelihood of long term follow on

Correctness The SRI’s team correctness score corresponds to an AP score of 3 – high enough for credit at UCSD, UIUC, and many other universities. We’ve predicted scoring 85% after a 3 month follow-on project.

Explanation Quality

Our Long Term Goal to enable distributed communities of domain experts to build knowledge systems in their area of expertise … –without direct help from knowledge engineers –working with familiar concepts and without writing axioms –with little more effort than writing technical papers

Our Current Focus Insight: even domain-specific representations contain common abstractions Approach: we build a library consisting of –a small hierarchy of reusable, composable, domain- independent knowledge units (“components”) –a small vocabulary of relations to connect them then domain experts build representations by instantiating and composing these components

BioremediationAmount OilFertilizer GetApply Break Down Absorb MicrobesScript Bio- technologist Soil Rate environment contains Q+ I- Q- I- amount product absorbed then agent patientagent script pollutant se rate agent then product se patient remediator amount Building a Representation Compositionally

BioremediationAmount OilFertilizer GetApply Break Down Absorb MicrobesScript Bio- technologist Soil Rate environment contains Q+ I- Q- I- amount product absorbed then agent patientagent script pollutant se rate agent then product se patient remediator Conversion Amount Substance Rate Q+ I- Q- I- amountraw- materials rate product Substance amount An underlying abstraction...

BioremediationAmount OilFertilizer GetApply Break Down Absorb MicrobesScript Bio- technologist Soil Rate environment contains Q+ I- Q- I- amount product absorbed then agent patientagent script pollutant se rate agent then product se remediator amount Digest Substance Break Down Absorb AgentScript absorbed agent script food se then se patient eater agent Another abstraction... patient

BioremediationAmount Oil Fertilizer Break Down Absorb Bio- technologist Soil Rate environment contains Q+ I- Q- I- amount product absorbed then agent pollutant se rate agent GetApply MicrobesScript patient script then product se remediator amount TreatmentAgent Another abstraction... patient GetApply substanceScript patient script then substance patient se

Examples of Concepts Described Compositionally a Fuel-Cell is a Producer of Electricity a Bulb is an Electrical Resistor that Produces Light a Camera is an Image Recording Device a Wire is a Conduit of Electricity

small A Library of Components easy to learn easy to use broad semantic distinctions (easy to choose) allows detailed pre-engineering

Library Contents actions — things that happen, change states –Enter, Copy, Replace, Transfer, etc. states — relatively temporally stable events –Be-Closed, Be-Attached-To, Be-Confined, etc. entities — things that are –Substance, Place, Object, etc. roles — things that are, but only in the context of things that happen –Container, Catalyst, Barrier, Vehicle, etc.

Library Contents relations between events, entities, roles –agent, donor, object, recipient, result, etc. –content, part, material, possession, etc. –causes, defeats, enables, prevents, etc. –purpose, plays, etc. properties between events/entities and values –rate, frequency, intensity, direction, etc. –size, color, integrity, shape, etc.

Computational Semantics Knowledge about Enter: –instances of Enter inherit axioms from Move, such as: the action changes the location of the object of the Move –before the Enter, the object is outside some enclosure –after the Enter, the object is inside that enclosure and contained by it –during the Enter, the object passes through a portal of the enclosure –if the portal has a covering, it must be open; and unless it is known to be closed, assume that it’s open –etc.

Searching the Library browsing the hierarchy top-down WordNet-based search –all components have hooks to WordNet –climb the WordNet hypernym tree with search terms –assemble: Attach, Come-Together mend: Repair infiltrate: Enter, Traverse, Penetrate, Move-Into gum-up: Block, Obstruct busted: Be-Broken, Be-Ruined

First Challenge Problem To enable biologists to encode college- level textbook knowledge about cells A small example: mRNA-Transport “mRNA is transported out of the cell nucleus into the cytoplasm” Transport: Move-Out-Of

unify

location

Evaluation Can Domain Experts learn to use the library to encode domain knowledge? Can sophisticated knowledge be captured through composition of components?

Methodology train biologists (4 graduate students) for six days have them encode knowledge from a college textbook, Essential Cell Biology by Bruce Alberts supply end-of-the-chapter-style Biology questions have the biologists pose the questions to their knowledge bases and record the answers have another biologist evaluate the answers on a scale of 0-3 qualitatively evaluate their KBs

Some Example Questions What nucleotide base pairs with adenine in RNA? How is uracil in RNA like thymine in DNA? What is the relationship between thymine and uracil? For a given bacterial gene, how are bacterial RNA and DNA molecules different? Describe RNA as a kind of polymer. What are the four bases/nucleotides of RNA? What is the relationship between a DNA gene and its RNA transcription product?

Evaluation — Productivity

Evaluation — Question Answering

Summary Knowledge Systems offer significant benefits compared with expert systems Multi-functional knowledge bases can be built … by domain experts, almost … and they will be, with or without sound principles of ontological engineering … and ontologists can significantly improve the results

Discussion Will the idiosyncrasies of specific domains overshadow the commonalities coded in the component library? How can NLP be used to pull information from text to build knowledge systems? How can knowledge acquisition systems use machine learning?