Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Institute for the Study of Learning and Expertise Palo Alto, California.

Slides:



Advertisements
Similar presentations
Computational Revision of Ecological Process Models
Advertisements

Pat Langley Dileep George Stephen Bay Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford,
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Pat Langley Institute for the Study of Learning and Expertise Javier Sanchez CSLI / Stanford University Ljupco Todorovski Saso Dzeroski Jozef Stefan Institute.
Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California and Center for the Study of Language and Information Stanford University,
Pat Langley Institute for the Study of Learning and Expertise and Center for the Study of Language and Information Stanford University, Stanford, California.
Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California and Center for the Study of Language and Information Stanford University,
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona USA Mental Simulation and Learning in the I CARUS Architecture.
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona USA Modeling Social Cognition in a Unified Cognitive Architecture.
Pat Langley Center for the Study of Language and Information Stanford University, Stanford, California
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Institute for the Study of Learning and Expertise Palo Alto, California.
Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California and Center for the Study of Language and Information Stanford University,
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona A Cognitive Architecture for Integrated.
Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California and Center for the Study of Language and Information Stanford University,
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Computational Discovery.
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona USA A Unified Cognitive Architecture for Embodied Agents Thanks.
Pat Langley Center for the Study of Language and Information Stanford University, Stanford, California
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Computational Discovery of Explanatory Process Models Thanks to.
Javier Sanchez Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California.
Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California and Center for the Study of Language and Information Stanford University,
SETTINGS AS COMPLEX ADAPTIVE SYSTEMS AN INTRODUCTION TO COMPLEXITY SCIENCE FOR HEALTH PROMOTION PROFESSIONALS Nastaran Keshavarz Mohammadi Don Nutbeam,
TASK: The comparison between basic and applied research.
The Role of Environmental Monitoring in the Green Economy Strategy K Nathan Hill March 2010.
1. Review- What is Science Explain- What kinds of understandings does science contribute about the natural world Form an Opinion- Do you think that scientists.
Paper Discussion: “Simultaneous Localization and Environmental Mapping with a Sensor Network”, Marinakis et. al. ICRA 2011.
Chapter 1 Conducting & Reading Research Baumgartner et al Chapter 1 Nature and Purpose of Research.
Scientific Thinking - 1 A. It is not what the man of science believes that distinguishes him, but how and why he believes it. B. A hypothesis is scientific.
Information Technology, Informatics, & Information Science How do they relate to each other? to each other?
Introduction to Communication Research
Introduction To Science
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
Inquiry Based Learning EDCP a July 7, 2015.
PROGRAMMING LANGUAGES The Study of Programming Languages.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
It would be difficult to support any claim that problem solving is not an important goal in the teaching of science. Scientists by their very nature are.
Functional Genomic Hypothesis Generation and Experimentation by a Robot Scientist King et al, Nature : Presented by Monica C. Sleumer February.
Research !!.  Philosophy The foundation of human knowledge A search for a general understanding of values and reality by chiefly speculative rather thanobservational.
Methods of Science Section 1.1. Methods of Science 3 areas of science: Life, Earth, Physical –What is involved in each? Scientific Explanations- not always.
Pat Langley Adam Arvay Department of Computer Science University of Auckland Auckland, NZ Heuristic Induction of Rate-Based Process Models Thanks to W.
Chapter 2 Paradigms, Theory, And Research. Chapter Outline Some Social Science Paradigms Elements of Social Theory Two Logical Systems Revisited Deductive.
Scholarly communications Discussion group Linked Data Workshop May 2010.
Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter menu Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter.
The Field of Psychology Gaining Insight into Behavior Behavior results from physiological (physical) processes and cognitive (intellectual) processes.
GEOPARTY!. The Branches Of Earth Science Miscellaneous Identify the Pictured Science Modern Science Scientific Methods
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Discovering Descriptive Knowledge Lecture 18. Descriptive Knowledge in Science In an earlier lecture, we introduced the representation and use of taxonomies.
Cell Theory SC.912.L.14.1 Describe the scientific theory of cells (cell theory), and relate the history of its discovery to the process of science. To.
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Section 2 Scientific Methods Chapter 1 Bellringer Complete these two tasks: 1. Describe an advertisement that cites research results. 2. Answer this question:
Chapter 1 – Introducing Psychology Section 1 - Why Study Psychology Section 2 – A Brief History in Psychology Section 3 – Psychology as a Profession.
Contrasting views of science: Popper vs. Kuhn. Sir Karl Popper Sir Karl Popper was a member of the Vienna Circle in the earlier part of the 20th century.
1 CSCD 326 Data Structures I Software Design. 2 The Software Life Cycle 1. Specification 2. Design 3. Risk Analysis 4. Verification 5. Coding 6. Testing.
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
Three Critical Matters in Big Data Projects for e- Science Kerk F. Kee, Ph.D. Assistant Professor, Chapman University Orange, California
Requirements Engineering. Requirements engineering processes The processes used for RE vary widely depending on the application domain, the people involved.
Research for Nurses: Methods and Interpretation Chapter 1 What is research? What is nursing research? What are the goals of Nursing research?
Copyright © by Holt, Rinehart and Winston. All rights reserved. Resources Chapter menu Section 2 Scientific Methods Chapter 1 Bellringer Complete these.
Introduction to Life Science. Science is a way of learning about the natural world Scientific inquiry – all the diverse ways in which scientist study.
PIAGET Termed theory genetic epistemology. Asked what would structure of thought need to be to generate formal, scientific view of reality & how would.
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Computational Assistance for Systems Biology of Aging Thanks to.
Scientific Models.
Borrett et al Computational Discovery of Process Models for Aquatic Ecosystems August 2006 Ecological Society of America, Memphis, TN Natasa Atanasova.
How to Research Lynn W Zimmerman, PhD.
Section 2: Scientific Methods
Section 2: Scientific Methods
Ontology-Based Information Integration Using INDUS System
Section 2: Scientific Methods
Nature of Science Understandings for HS
Presentation transcript:

Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Institute for the Study of Learning and Expertise Palo Alto, California Challenges for the Computational Discovery of Scientific Knowledge Thanks to K. Arrigo, D. Billman, M. Bravo, S. Borrett, W. Bridewell, S. Dzeroski, and L. Todorovski for their contributions to this research, which is funded by a grant from the National Science Foundation.

Drawbacks of Scientific Data Mining  generates models in forms inappropriate to most sciences  makes incorrect assumptions about the available inputs  focuses on convenient algorithmic issues, not scientists’ needs Because it borrows from work on commercial applications, most work on scientific data mining: We need to redirect attention toward a broader range of discovery tasks that actually arise in scientific fields. Data-mining researchers would benefit from looking at the older literature on computational scientific discovery.

Claim 1: Scientific Notations NPPc =  month max (E  ·  IPAR, 0) E = 0.56 · T1 · T2 · W T1 = · Topt – · Topt 2 T2 = 1.18 / [(1 + e 0.2 · (Topt – Tempc – 10) ) · (1 + e 0.3 · (Tempc – Topt – 10) )] W = · EET / PET PET = 1.6 · (10 · Tempc / AHI) A · PET-TW-M if Tempc > 0 PET = 0 if Tempc < 0 A = · AHI 3 – · AHI · AHI IPAR = 0.5 · FPAR-FAS · Monthly-Solar · Sol-Conver FPAR-FAS = min [(SR-FAS – 1.08) / SR (UMD-VEG), 0.95] SR-FAS = (Mon-FAS-NDVI ) / (Mon-FAS-NDVI – 1000) DFR NBLANBLR RRPhoto PBS Health psbA1 psbA2 cpcB Light + Traditional data-mining notations are not easily understood by or communicated to domain scientists. Most sciences state and communicate models in formalisms they have used for decades. We need more work on discovering scientific knowledge cast in communicable forms (Dzeroski & Todorovski, 2007). Ecosystem model Gene regulation model

Claim 2: Background Knowledge Scientists often have initial knowledge that should influence the discovery process. Ignoring this knowledge can produce models that scientists reject as nonsensical (Pazzani et al., 2001). ModelRevision Initial model DFR NBLANBLR RRPhoto PBS Health psbA1 psbA2 cpcB Light + Observations Revised model × DFR NBLANBLR RRPhoto PBS Health psbA1 psbA2 cpcB Light + ×

Claim 3: Small Data Sets Most data-mining work assumes that large data sets are available. But in many scientific domains, data are rare and hard to obtain. Discovering scientific knowledge from small data sets raises an entirely different set of challenges (Lee et al., 1998). We need more research on this important aspect of discovery. Ecosystem model Gene regulation model Number of variables Number of equations Number of parameters Number of samples 8 11  Number of variables Number of initial links Number of initial links Number of possible links Number of possible links Number of samples Number of samples 911  70 20

Claim 4: Scientific Explanation Most work on data mining finds models that, although accurate, merely describe the observations. However, scientists often want models that explain their data using familiar concepts. Explanatory models can include theoretical entities and processes that link back to domain knowledge (Langley et al., 2002). Ecosystem model Gene regulation model NPPc IPAR PET T1T2We_max E EET Tempc Topt NDVI SOLAR AHI A PETTWM SR FPAR VEG DFR NBLANBLR RRPhoto PBS Health psbA1 psbA2 cpcB Light +

Claim 5: Interactive Discovery Most data-mining work focused on entirely automated algorithms. But most scientists want computational aids rather than systems that would replace them. We need more work on interactive discovery (Bridewell et al., 2007). ModelRevision Initial model DFR NBLANBLR RRPhoto PBS Health psbA1 psbA2 cpcB Light + Observations Revised model × DFR NBLANBLR RRPhoto PBS Health psbA1 psbA2 cpcB Light + × Domain user

The P ROMETHEUS System (Bridewell et al., 2007)