Turing Tests with Turing Machines José Hernández Orallo DSIC, Universitat Politecnica de Valencia, Spain Javier Insa Cabrera DSIC,

Slides:

Advertisements

Similar presentations

Approaches, Tools, and Applications Islam A. El-Shaarawy Shoubra Faculty of Eng.

Advertisements

Lectures 6&7: Variance Reduction Techniques

AE1APS Algorithmic Problem Solving John Drake

LAST LECTURE. Functionalism Functionalism in philosophy of mind is the view that mental states should be identified with and differentiated in terms of.

Conceptual Clustering

Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)

Case Study: Genetic Algorithms GAs are an area of AI research –used to solve search problems with potentially better performance than traditional search.

Chapter 4 Design Approaches and Methods

Randomized Strategies and Temporal Difference Learning in Poker Michael Oder April 4, 2002 Advisor: Dr. David Mutchler.

M. Kathleen Kerr “Design Considerations for Efficient and Effective Microarray Studies” Biometrics 59, ; December 2003 Biostatistics Article Oncology.

Artificial intelligence COS 116, Spring 2012 Adam Finkelstein.

Games as Emergent Systems first schema on “rules”.

Artificial Intelligence in Game Design

© Curriculum Foundation1 Section 3 Assessing Skills Section 3 Assessing Skills There are three key questions here: How do we know whether or not a skill.

MATH 685/ CSI 700/ OR 682 Lecture Notes

Audiovisual Emotional Speech of Game Playing Children: Effects of Age and Culture By Shahid, Krahmer, & Swerts Presented by Alex Park

Minority Games A Complex Systems Project. Going to a concert… But which night to pick? Friday or Saturday? You want to go on the night with the least.

Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.

1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.

Planning, Outlining, Drafting e.g. Formally Starting the Process.

Chapter Ten Artificial Intelligence I: Definitional Perspective.

Evolutionary Algorithms Simon M. Lucas. The basic idea Initialise a random population of individuals repeat { evaluate select vary (e.g. mutate or crossover)

An Introduction to Black-Box Complexity

D ESIGNING G AMES WITH A P URPOSE By Luis von Ahn and Laura Dabbish.

Lecture # 6 SCIENCE 1 ASSOCIATE DEGREE IN EDUCATION TEACHING OF SCIENCE AT ELEMENTARY LEVEL.

Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.

An Approach of Artificial Intelligence Application for Laboratory Tests Evaluation Ş.l.univ.dr.ing. Corina SĂVULESCU University of Piteşti.

1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.

Senior Project – Computer Science – 2015 Modelling Opponents in Board Games Julian Jocque Advisor – Prof. Rieffel Abstract Modelling opponents in a game.

Exam 1 Median: 74 Quartiles: 68, 84 Interquartile range: 16 Mean: 74.9 Standard deviation: 12.5 z = -1: 62.4 z = -1: 87.4 z = -1z = +1 Worst Question:

BINF6201/8201 Hidden Markov Models for Sequence Analysis

Design Analysis Kristen Gardner CSE April 2007.

Cooperative Language Learning (CLL) Collaborative Learning (CL)

Vygotsky The zone of proximal development. The ZPD This was a term used by Vygotsky to refer to the distance between what a child can achieve alone, and.

Content-Area Writing Chapter 10 Writing for Tests and Assessments Darcey Helmick EIWP 2013.

Problems, Problem Spaces and Search

Siposs Arnold Konrad Artificial Intelligence Coordonator: Mr. Dr. Z. Pólkowski Turing Test.

Javier Insa-Cabrera José Hernández-Orallo Dep. de Sistemes Informàtics i Computació, Universitat Politècnica de València

Memory and Analogy in Game-Playing Agents Jonathan Rubin & Ian Watson University of Auckland Game AI Group

Artificial intelligence COS 116, Spring 2010 Adam Finkelstein.

1 On more realistic environment distributions for defining, evaluating and developing intelligence José Hernández Orallo 1, David L. Dowe 2, Sergio España-Cubillo.

Backtracking and Games Eric Roberts CS 106B January 28, 2013.

GAME PLAYING 1. There were two reasons that games appeared to be a good domain in which to explore machine intelligence: 1.They provide a structured task.

Basic Principles (continuation) 1. A Quantitative Measure of Information As we already have realized, when a statistical experiment has n eqiuprobable.

1 Turing Machines and Recursive Turing Tests José Hernández Orallo 1, Javier Insa-Cabrera 1, David L. Dowe 2, Bill Hibbard 3, 1.Departament de Sistemes.

Interaction settings for measuring (social) intelligence in multi-agent systems Javier Insa-Cabrera, José Hernández-Orallo Dep. de Sistemes Informàtics.

In seven-ish easy steps….  The thesis statement is essentially choosing a side.  Each side WANTS to win the argument  BUT THERE CAN BE ONLY ONE! (That.

Machine Learning in Practice Lecture 5 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.

Evaluation, Testing and Assessment June 9, Curriculum Evaluation Necessary to determine – How the program works – How successfully it works – Whether.

FOUNDATIONS OF ARTIFICIAL INTELLIGENCE

SP 2015 CP PROBABILITY & STATISTICS Observational Studies vs. Experiments Chapter 11.

1 Comparing Humans and AI Agents Javier Insa-Cabrera 1, David L. Dowe 2, Sergio España-Cubillo 1, M.Victoria Hernández-Lloreda 3, José Hernández Orallo.

The Standard Genetic Algorithm Start with a “population” of “individuals” Rank these individuals according to their “fitness” Select pairs of individuals.

Information Technology Michael Brand Joint work with David L. Dowe 8 February, 2016 Information Technology.

Organic Evolution and Problem Solving Je-Gun Joung.

Evolution of Cooperation in Mobile Ad Hoc Networks Jeff Hudack (working with some Italian guy)

Artificial Intelligence Hossaini Winter Outline book : Artificial intelligence a modern Approach by Stuart Russell, Peter Norvig. A Practical Guide.

CS 154 Formal Languages and Computability April 12 Class Meeting Department of Computer Science San Jose State University Spring 2016 Instructor: Ron Mak.

Artificial Intelligence in Game Design Board Games and the MinMax Algorithm.

Probability and statistics - overview Introduction Basics of probability theory Events, probability, different types of probability Random variable, probability.

Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.

Towards a Universal Test of Social Intelligence PhD Thesis Javier Insa Cabrera Valencia, SpainMay 16, 2016 Supervisor: José Hernández Orallo.

 Presented By: Abdul Aziz Ghazi  Roll No:  Presented to: Sir Harris.

Explanations of forgetting

A Conceptual Design of Multi-Agent based Personalized Quiz Game

Responses to the Design argument

Classroom Assessment Validity And Bias in Assessment.

Introduction Artificial Intelligent.

A Gentle introduction Richard P. Simpson

Cooperative Language Learning

Presentation transcript:

Turing Tests with Turing Machines José Hernández Orallo DSIC, Universitat Politecnica de Valencia, Spain Javier Insa Cabrera DSIC, Universitat Politecnica de Valencia, Spain David L. Dowe Monash University, Australia Bill Hibbard University of Wisconsin - Madison, USA

2 Intelligence Evaluation: Intelligence has been evaluated by humans in all periods of history. Only in the XXth century, this problem has been addressed scientifically: Human intelligence evaluation. Animal intelligence evaluation. What about machine intelligence evaluation? Turing Test: The imitation game was not really conceived by Turing as a test, but as a compelling argument. Problems of using the imitation game as a test of intelligence. Is there an alternative principled way of measuring intelligence? The comparative approach

3 During the past 15 years, there has been a discreet line of research advocating for a formal, computational approach to intelligence evaluation. Issues: Humans cannot be used as a reference. – No arbitrary reference is chosen. Otherwise, comparative approaches would become circular. Intelligence is a gradual (and most possibly factorial) thing. – It must be graded accordingly. Intelligence as performance on a diverse tasks and environments. – Need to define these tasks and environments. The difficulty of tasks/environments must be assessed. – Not on populations (psychometrics), but from computational principles. Computational measurement of intelligence

4 Problems this line of research is facing at the moment. Most approaches are based on tasks/environments which represent patterns that have to be discovered and correctly employed. These tasks/environments are not representative of what an intelligence being may face during its life. This idea prompted the definition of a different distribution of environments: Darwin-Wallace distribution (Hernandez-Orallo et al. 2011): environments with intelligent systems have higher probability. It is a recursive (but not circular) distribution. While resembles artificial evolution, it is guided and controlled by intelligence tests, rather than selection due to other kind of fitness. (Social) intelligence is the ability to perform well in an environment full of other agents of similar intelligence Computational measurement of intelligence

5 The setting of the Darwin-Wallace distribution suggests: Comparative approaches may not only be useful but necessary. The Turing Test might be more related to social intelligence than other kinds of intelligence. This motivates a reunion between the line of research based on computational, information-based approaches to intelligence measures with the Turing Test. However, this reunion has to be made without renouncing to one of the premises of our research: the elimination of the human reference. Use (Turing) machines, and not humans, as references. Make these references meaningful by recursion Reunion: bridging antagonistic views

6 Generalisation of the Turing Test

7 The Turing Test makes some particular choices: Takes the human reference from a distribution: adult homo sapiens. Takes the judges from a distribution (also adult homo sapiens) but they are also instructed on how to evaluate. But other choices can be made. Informally? A Turing Test for Nobel laureates, for children, for dogs or other populations? Formally? Generally? Nothing is more formal and general than a Turing Machine. Turing Test for Turing Machines

8 Distribution D Reference Subject A Evaluee B Judge J Interaction I The Turing Test for Turing Machines Distribution D Reference Subject A Evaluee B Judge J Interaction I

9 The simplest adversarial Turing Test: Symmetric roles: Evaluee B tries to imitate A. It plays the predictor role. Reference A tries to evade B. It plays the evader role. This setting is exactly the matching pennies problem. Predictors win when both coins are on the same side. Evaders win when both coins show different sides. The Turing Test for Turing Machines

10 Interestingly, Matching pennies was proposed as an intelligence test (adversarial games) (Hibbard 2008, 2011). The distribution of machines D is crucial. Machines with very low complexity (repetitive) are easy to identify. Machines with random outputs have very high complexity and are impossible to identify (a tie is the expected value). Can we derive a more realistic distribution? The Turing Test for Turing Machines

11 The Turing Test can start with a base distribution for the reference machines. Whenever we start giving scores to some machines, we can start updating the distribution. Machines which perform well will get higher probability. Machines which perform badly will get lower probability. By doing this process recursively: We get a distribution with different levels of difficulties. It is meaningful for some instances, e.g., matching pennies. Recursive TT for TMs

12 Recursive TT for TMs

13 Recursive TT for TMs The previous definition has many issues. Divergent? Intractable. But still useful conceptually. In practice, it can be substituted by a (sampling) ranking system: (e.g.) Elo’s rating system in chess. Given an original distribution, we can update the distribution by randomly choosing pairs and updating the probability.

14 Depending on the agents and the game where they are evaluated, the resulting distribution can be different. Possible resulting distributions

15 The notion of Turing Test with Turing Machines is introduced as a way: To get rid of the human reference in the tests. To see very simple social intelligence tests, mainly adversarial. The idea of making it recursive tries to: escape from the universal distribution. derive a different notion of difficulty. The setting is still too simple to make a feasible test, but it is already helpful to: Bridge the (until now) antagonistic views of intelligence testing using the Turing Test or using computational formal approaches using Kolmogorov Complexity, MML, etc. Link intelligence testing with (evolutionary) game theory. Conclusions