Cognitive Modelling Assignment Suzanne Cotter 09134328 March 2010.

Slides:



Advertisements
Similar presentations
How to Factor Quadratics of the Form
Advertisements

Setting Goals The difference between a goal and a dream is the written word. -Gene Donohue.
Here is how close you are to the knowledge or skills you are trying to develop, and heres what you need to do next.
YOU DONT KNOW CRAPS!!. An Investigation Using Probability, the Geometric Distribution, and Expected Returns for the Game of Craps By Otto Magdanz and.
Conceptual Clustering
Gallup Q12 Definitions Notes to Managers
Single category classification
Cognitive Modelling – An exemplar-based context model Benjamin Moloney Student No:
S3 Useful Expressions.
The Research Consumer Evaluates Measurement Reliability and Validity
Copyright © Cengage Learning. All rights reserved.
The Philosophy of Exotischism Listening To Your Heart 68 Happy Days and Magic Crystals Many years ago people would often talk about unlocking the secrets.
Probability Probability: what is the chance that a given event will occur? For us, what is the chance that a child, or a family of children, will have.
Visualizing The Direct Comparison Test for Infinite Series A Presentation by Pablito Delgado Sponsored by the Center for Academic Program Support (CAPS)
Cognitive Biases 2 Incomplete and Unrepresentative Data.
Mr F’s Maths Notes Statistics 8. Scatter Diagrams.
Naïve Bayes based Model Billy Doran “If the model does what people do, do people do what the model does?”
By Raza Shahid 8I. There are many questions about dreams, such as how do we have them? Why do they come? How to interpret your dreams and other similar.
Mr Barton’s Maths Notes
Single Category Classification Stage One Additive Weighted Prototype Model.
Cognitive Modelling Experiment Clodagh Collins. Clodagh Collins.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
MAE 552 – Heuristic Optimization Lecture 28 April 5, 2002 Topic:Chess Programs Utilizing Tree Searches.
Assessing cognitive models What is the aim of cognitive modelling? To try and reproduce, using equations or similar, the mechanism that people are using.
Physics and Measurements.
Traditional Method 2 means, σ’s known. The makers of a standardized exam have two versions of the exam: version A and version B. They believe the two.
Lesson 4: Percentage of Amounts.
Spelling Lists. Unit 1 Spelling List write family there yet would draw become grow try really ago almost always course less than words study then learned.
thinking hats Six of Prepared by Eman A. Al Abdullah ©
Use of spreadsheet Software!
Near East University Department of English Language Teaching Advanced Research Techniques Correlational Studies Abdalmonam H. Elkorbow.
Confirmation Bias. Critical Thinking Among our critical thinking questions were: Does the evidence really support the claim? Is there other evidence that.
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
The Loan Welcome! So you’re looking to finance a car? Before you look at taking out loans make sure that you are financially able to pay for a vehicle.
Two Categories of Responders  Type 1 - Combinations of A and B treated as a fourth category (strategy evident in complete rejection of proposed categories.
AP Statistics Section 11.1 A Basics of Significance Tests
Lesson  Rebecca and Tova have math class right after lunch.  Rebecca always eats a hot lunch on days when she has an exam, because she has a theory.
Styles of Leadership LET II. Introduction Leadership styles are the pattern of behaviors that one uses to influence others. You can influence others in.
Polynomial Operations Hanna Jusufi Julia Ly Karl Bryan Ly Huynh Girl : I keep losing stuff when I try to multiply three binomials. Boy : I do, too, but.
Playing the Lottery 6/4/2016Copyright © 2010… REMTECH, inc … All Rights Reserved1 Probability Theory and the Lottery ● The Lottery picks numbers at random.
Advanced Quantitative Research ED 602. You know, Mary Stevens has really blossomed this year. She is doing much better. Actually, this whole fifth grade.
Multigroup Models Byrne Chapter 7 Brown Chapter 7.
ME AS A LEADER BLOCK 3. I am 18 years old, I have an older brother and a younger brother, I also work as a waitress at Rams Horn and I plan to go to college.
Data Analysis Econ 176, Fall Populations When we run an experiment, we are always measuring an outcome, x. We say that an outcome belongs to some.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
TEACHER EFFECTIVENESS INITIATIVE VALUE-ADDED TRAINING Value-Added Research Center (VARC)
What is the HSPA???. HSPA - Overview The HSPA is the High School Proficiency Assessment that is given to juniors in New Jersey’s public schools. States.
STUDENT LED CONFERENCES SPRING 2015 BEN IVERSON. Cover Letter Science this year has covered a wide range of topics. We started the year with learning.
Model Classification Model by Barry Hennessy. Model Basis My model was based on the Prototype model: that previously seen examples are combined into a.
Review of statistical modeling and probability theory Alan Moses ML4bio.
1 Research Methods in Psychology AS Descriptive Statistics.
Omni-channel Maturity Analysis Lester Allan Lasrado Copenhagen Business School 28 th Jan 2016.
Statistics Tutorial.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
FACTORS AFFECTING INSOMNIA… A01: SLEEP APNOEA…  A disorder whereby a person stops breathing whilst asleep.  These pauses can last anything between a.
Algebra The greatest mathematical tool of all!!. This is a course in basic introductory algebra. Essential Prerequisites: Ability to work with directed.
Personal Responsibility and Decision Making
DSMA 0393/1414 Comments of Students. Co-requisite Model Student Comments Students were given this request on their final examination: Write a statement.
Complex Numbers and Equation Solving 1. Simple Equations 2. Compound Equations 3. Systems of Equations 4. Quadratic Equations 5. Determining Quadratic.
The Fine Art of Knowing How Wrong You Might Be. To Err Is Human Humans make an infinitude of mistakes. I figure safety in numbers makes it a little more.
PeerWise Student Instructions
The greatest mathematical tool of all!!
Data Mining (and machine learning)
Individual Science project
Inferential statistics,
Using Statistical techniques in Geography
Presentation transcript:

Cognitive Modelling Assignment Suzanne Cotter March 2010

Initial Analysis The first approach I followed was to figure out how I would approach classifying each of the test items. Dim 1Dim 2Dim 3MeParticipants ABYABA CYBCC YACACAC XBBBC XXBBB I found that naturally, I looked at the probabilities of a symptom appearing on a certain dimension pointing to a specific disease. So A on dimension 1 was always A, Z on dimension 1 was always B and C on dimension 1 was always C...this made dimension 1 very powerful to me. Similarly, C on dimension 3 was almost always C and B on dimension 3 was almost always B...the only times these were overruled were when something more powerful occurred on dimension 1 (like A or C). This further enforced my theory that dimension 1 was very powerful.

I found dimension 2 very confusing and didn’t feel it helped a whole lot although it did lead me to classify ABY as AB because B on dimension 2 was always B and A on dimension 1 was always A...this didn’t apply to the participants. The reason dimension 2 was so confusing was because most of the symptoms were split across the 3 diseases...for example, X, Y and A were all symptoms of each of the diseases at some stage or another.

Exemplar V Prototype NOTE: in order to assess models, I used the correl() function in excel, the higher the correlation, the better the model. First model I tried was the exemplar one, counting AB as a category by itself...the results were very encouraging with a correlation of I then tried to improve matters as I didn’t believe people considered AB as a separate category (I’ll explain this later). The next model gave me a correlation of 0.86.

Exemplar V Prototype ctd’d Next I tried the prototype model and found that this didn’t class things as well as the exemplar model, with correlations of 0.75 and 0.74 depending on whether AB was classed as a separate disease or not. For the Exemplar model, I used attention parameters (S values) of 0.1 on Dimension 1, 0.8 on Dimension 2 and 0.5 on Dimension 3. This reproduced the effect of making Dimension 1 very powerful, Dimension 2 very weak, and Dimension 3 somewhere in the middle. However, I wanted to see if I could figure out how to model how people classified each test case myself, so I came up with a new model based on a mixture of probability and intuition!

My theory of how people classify With a list of data so large and complicated, people try and remember the main points of the list they have. They’ll remember best what’s “easy”. For example, in the current list of training data, whenever there is an A in dimension 1, the disease classification is always A (or A&B) so whenever they see A in dimension 1, they’re more inclined to classify anything in the test data with A in this dimension as disease A. Dim 1Dim 2Dim 3 AXCcategory A AYYcategory A AAXcategory A YAYcategory A XABcategories A and B ABXcategories A and B ZBBcategory B XBBcategory B YXBcategory B ZYBcategory B CAYcategory C CXBcategory C CYCcategory C CACcategory C CXCcategory C XYCcategory C

Dim1Dim2Dim3BCAIF AorBIF AorBorC D1ABY CYB01000 YAC XBC XXB00 0 Dim1Dim2Dim3BCCA D2ABY CYB YAC XBC XXB Dim1Dim2Dim3ACBAC D3ABY CYB 00 YAC XBC XXB

Dim1Dim2Dim3BCAIF AorBIF AorBorC D1ABY CYB01000 YAC XBC XXB00 0 Dim1Dim2Dim3BCCA D2ABY CYB YAC XBC XXB Dim1Dim2Dim3ACBAC D3ABY CYB 00 YAC XBC XXB If C on dim1, always classified as disease C, easiest to remember Chances of set of symptoms classified as B C on dim1 always C so assign it no weight for disease A Will be given weight for both disease A and B only if symptom matches If Dim1 is Y, people assign it a lower priority because it's too evenly split between disease A and B

Dim1Dim2Dim3BCAIF AorBIF AorBorC D1ABY CYB01000 YAC XBC XXB00 0 Dim1Dim2Dim3BCCA D2ABY CYB YAC XBC XXB Dim1Dim2Dim3ACBAC D3ABY CYB 00 YAC XBC XXB As on dim2, lower probability, people don't consider it as important Slightly higher chance of C than A or B so will only give weight to C if symtom matches

Dim1Dim2Dim3BCAIF AorBIF AorBorC D1ABY CYB01000 YAC XBC XXB00 0 Dim1Dim2Dim3BCCA D2ABY CYB YAC XBC XXB Dim1Dim2Dim3ACBAC D3ABY CYB 00 YAC XBC XXB instead of 0.2 don't think people would even consider there to be a 20% chance 5 times of 6 occurrences were B Slightly higher chance of Y being disease A than B so higher odds. Not as high as 2/3 though as on dim 3

ABCABMineACMineBCMine ABY CYB YAC XBC XXB ScaledABCABMineACMineBCMine ABY CYB YAC XBC XXB Equal to sum of the probabilities of anytime a symptom appeared under a disease category, e.g.. On dim1, ABY was given a 0.9 chance of being disease A PLUS on dim 3 it was given a 0.1 chance of being classified as A because Y wasn’t X, the most powerful symptom on dim3 PLUS 0.2 chance of being classified as A because Y is classified as A more often than C on dim3. It received no weighting on dim2 because at no stage did disease A ever stand out on dim2.

Dim1Dim2Dim3BCAIF AorBIF AorBorC D1ABY CYB01000 YAC XBC XXB00 0 Dim1Dim2Dim3BCCA D2ABY CYB YAC XBC XXB Dim1Dim2Dim3ACBAC D3ABY CYB 00 YAC XBC XXB Adding probabilities together equals 1.2

Analysis of Model The correlation works out at which is pretty good overall. When conjugating, I compared the Product and Average functions with my own function (explained later). I found overall that my own function correlated very slightly lower than the Average function which got a correlation of However, I chose my own function because I believe when people think there’s a very similar chance of a set of symptoms being disease A or being disease B, for example, then that’s when they conjugate and classify the symptoms as AB. With the average function, if 2 diseases have the same probabilities, A, B and AB will all be classified as having the same chance when I believe there’s a better chance people will conjugate.

Analysis cont’d When something is on dimension 2, even though the odds may say that there’s a 80% chance of a certain symptom being a certain disease, I believe people don’t rate it this highly because overall dimension 2 is spread very evenly across the 3 diseases. So an 80% chance becomes 60% or lower. When something is on dimension 1, it is often considered to be a higher probability than it actually is because people remember the first symptom in a group best (known as the primacy effect in psychology). The same might apply to dimension 3 because people remember the last thing in a list better (known as the recency effect). Dimension 3 loses some of it’s power though because there is nothing that is 100% classified as one disease here...C was classified as disease A once out of 5 occurrences, and B was classified as AB and C as well as disease B 4 times.

Analysis cont’d Another factor that reduces dimension 3’s power is the ability of dimension 1 to overrule it. With symptom C, for example, the reason it was classified as disease A in one case is because on dimension 1, the symptom A appeared which was always classified as disease A. This is the same reason that one instance of symptom B was classified as disease C...symptom C appeared on dimension 1. ABY and CYB were very easy to model as A on dimension 1 was 100% of the time disease A (or AB) and C on dimension 1 was 100% of the time disease C.

Analysis cont’d YAC was very difficult to model because though intuitively I understand why people classified this as AC (the C on dimension 3 was balanced out by the YA on the first 2 dimensions which was very similar to YAY which was classified as A...maybe YA was a very easy pattern to remember), I found it very difficult to conjugate this in the model XBC was also very difficult because I felt the disease should have been B based on the fact that B (or BC) on dimension 2 was always disease B and there was a training example XBB which was classed as disease B. C on dimension 3 was more powerful in people’s minds in this instance however. XXB was ok to model also because X on dimensions 1 and 2 was never a clear indicator of disease whereas B on dimension 3 was 83% of the time B.

Graph of Actual disease classification against Model disease classification

Issues - AB The classification of certain training examples into both categories A and B presented a problem when trying to classify. I found instead of simplifying things, it made it more complicated, especially when it came to modelling. For the conjugation of AC and BC, the usual methods of doing so applied, for example multiplying the probability of a disease being in A by the probability of a disease being in C or getting the average of the probability of a disease being in B and the probability of a disease being in C. However, as there were already 2 examples of a set of symptoms being classed as both A&B, it wasn’t as simple as just multiplying probabilities. From looking at the results of the classifications, it seemed that people were inclined to ignore the fact that 2 sets of symptoms were classed as A&B. This is demonstrated by test object ABY. ABY should have been classed as AB because there was an example already where ABX was classed as AB and 100% of cases where B was in dimension 2 were classed as B or AB. However, the majority classed it as A followed by AB. I feel this is because of the strength of symptom A in dimension 1. In order to replicate this in the exemplar model, I minimised the probability of the values being classed as AB and then used normal conjugation methods to classify items in AB.

Minimising values on AB

Issues - Conjugation In conjunctive classification, the model gives each item a score as a member of each conjunction or ANDED-PAIR of categories. So, what are the odds of someone classifying a certain set of symptoms as disease AC as opposed to A or C. It’s suggested to use the functions product, sum, min, normalised sum or average. In my models, average usually worked out as the best function overall. However, because of it’s nature, it will never classify a conjugated classification as the first choice. For example, if A=0.8, B=0.7, AB=( )/2=0.75. So even though the probabilities of A and B are very similar making this a good candidate for classification as AB, the model will always choose A over AB. To overcome this, I decided to do a check that if there was a difference of less than a certain amount(0.1 in this case) between 2 probabilities, the 2 diseases would be conjugated. I believe that’s the way people thing...if 2 things have very similar likelihoods of occurring, then people won’t choose one or the other, they will conjugate.

Correlation Function Aside Found the correlation function to be useful because it checks how similar the trend of the model against the trend of the participants average response is, though not very accurate if trying to predict how close each value is against each other For example: Actual values: 2,5,3,4,4. Model 1 Values: 4, 6, 6, 6,6. Model 2 Values: 3,6, 6, 4, 4 GRAPH: Correlation function gives for Actual values against Model1 values and against Model2 values though Model 2 looks closer than Model1. This is because it favours the general trend downwards of the Model1 values which matches that of the actual values.