Probability in Decline

Slides:



Advertisements
Similar presentations
Fundamentals of Probability
Advertisements

CS CS1512 Foundations of Computing Science 2 Lecture 23 Probability and statistics (4) © J.
CS1512 Foundations of Computing Science 2 Week 3 (CSD week 32) Probability © J R W Hunter, 2006, K van Deemter 2007.
Residual Income Valuation: Valuing Common Equity
Ricard V. Solè and Sergi Valverde Prepared by Amaç Herdağdelen
I. The Goal of Science. To describe the structure and function of the physical universe.
Multiple Regression and Model Building
Review and Preview and Random Variables
Experiments and Variables
Time Series and Forecasting
Myers’ PSYCHOLOGY (7th Ed)
Myers’ PSYCHOLOGY (7th Ed) Chapter 1 Thinking Critically with Psychological Science James A. McCubbin, PhD Clemson University Worth Publishers.
Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Research Methods in Psychology
Critical Thinking.
CHAPTER 6 Statistical Analysis of Experimental Data
Scientific method - 1 Scientific method is a body of techniques for investigating phenomena and acquiring new knowledge, as well as for correcting and.
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Lecture Slides Elementary Statistics Twelfth Edition
Time Series and Forecasting
© 2016 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Chapter 2: The Research Enterprise in Psychology
Chapter 4 Probability 4-1 Overview 4-2 Fundamentals 4-3 Addition Rule
Determining Sample Size
Chapter 2 The Research Enterprise in Psychology. n Basic assumption: events are governed by some lawful order  Goals: Measurement and description Understanding.
History & Research Research Methods Unit 1 / Learning Goal 2.
Chapter 1: The Research Enterprise in Psychology.
Theory of Probability Statistics for Business and Economics.
Chapter 4 Correlation and Regression Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter 8: Probability: The Mathematics of Chance Lesson Plan Probability Models and Rules Discrete Probability Models Equally Likely Outcomes Continuous.
Signal Detection Theory October 10, 2013 Some Psychometrics! Response data from a perception experiment is usually organized in the form of a confusion.
Population distribution VS Sampling distribution
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 1 Slide © 2016 Cengage Learning. All Rights Reserved. Probability is a numerical measure of the likelihood Probability is a numerical measure of the.
Mary Jones. Psychology: The Science of Behavior and Mental Processes Psychologists attempt to understand Observable behavior: Such as speech and physical.
Statistics Introduction 2. The word Probability derives from the Latin probabilitas, which can also mean probity, a measure of the authority of a witness.
Module 2 Research Strategies.
SNS neutron background measurements using a portable 3 He LPSD detector.
Conducting A Study Designing Sample Designing Experiments Simulating Experiments Designing Sample Designing Experiments Simulating Experiments.
1 Chapter 4 – Probability An Introduction. 2 Chapter Outline – Part 1  Experiments, Counting Rules, and Assigning Probabilities  Events and Their Probability.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Module 2 Research Strategies. Scientific Method A method of learning about the world through the application of critical thinking and tools such as observation,
Theories and Hypotheses. Assumptions of science A true physical universe exists Order through cause and effect, the connections can be discovered Knowledge.
Chapter 8: Probability: The Mathematics of Chance Lesson Plan Probability Models and Rules Discrete Probability Models Equally Likely Outcomes Continuous.
Module 2 Research Strategies. Scientific Method A method of learning about the world through the application of critical thinking and tools such as observation,
+ Chapter 5 Overview 5.1 Introducing Probability 5.2 Combining Events 5.3 Conditional Probability 5.4 Counting Methods 1.
1 Chapter 4, Part 1 Basic ideas of Probability Relative Frequency, Classical Probability Compound Events, The Addition Rule Disjoint Events.
Biology. Observations are a critical component of science.
Chapter 8: Probability: The Mathematics of Chance Probability Models and Rules 1 Probability Theory  The mathematical description of randomness.  Companies.
Chapter 8: Probability: The Mathematics of Chance Lesson Plan Probability Models and Rules Discrete Probability Models Equally Likely Outcomes Continuous.
Time Series and Forecasting Chapter 16 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright ©2004 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 4-1 Probability and Counting Rules CHAPTER 4.
Chapter 2: The Research Enterprise in Psychology.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
Definition Slides Unit 2: Scientific Research Methods.
Definition Slides Unit 1.2 Research Methods Terms.
Forecasting Quantitative Methods. READ FIRST Outline Define Forecasting The Three Time Frames of Forecasting Forms of Forecast Movement Forecasting Approaches.
Research Design. How do we know what we know? The way we make reasoning Deductive logic Begins with one or more premises, reasoning then proceeds logically.
Unit 1 Review. 1.1: representing data Types of data: 1. Quantitative – can be represented by a number Discrete Data Data where a fraction/decimal is not.
Chapter 2: The Research Enterprise in Psychology
Probability and Counting Rules
7 How to Decide Which Variables to Manipulate and Measure Marziyeh Rezaee.
Chapter 4. Probability
PROBABILITY AND STATISTICS
Simple Random Sample A simple random sample (SRS) of size n consists of n elements from the population chosen in such a way that every set of n elements.
Introduction.
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Hindsight Bias Tendency to believe, after learning an outcome, that one would have foreseen it. “I knew.
Thinking critically with psychological science
Presentation transcript:

Probability in Decline Dean Brooks Ekaros Analytical Inc. Vancouver, B.C. Brooks17@gmail.com 604-817-0734

What is nonlinear probability? Application of the maximum entropy principle to predict gradual decline in frequency of rare items with increasing set size

Maximum entropy principle A probability distribution whose entropy is at least as great as that of all other members of a specified class of distributions Combines information constraint (minimizing what we expect to know about a system) with physical tendency of nature (systems wind up in this configuration).

Maximum entropy applied to nonlinear cases If the set is incomplete and growing, neither average nor total entropy are necessarily constant Increasing entropy for the set as a whole implies an increase in entropy for later elements. Net entropy can be increased in this way if rare items become rarer. This behavior has been termed a ‘decline effect’.

An example from everyday life Decline in participation in submitting comments on video-sharing websites like YouTube or Vimeo Initially, for every 10 views, one comment on average Average falls in very predictable fashion as the total set of views grows

Audience response to the ‘Flagpole Sitta’ lip dub video Nearly perfect log-log relationship from 10,000 to 400,000 views Participation in comments gradually falls from 2.3 % to 0.11 % Behavior of visitors predictably different even from morning to afternoon of same day

Another practical application: H1N1 pandemic 2009 H1N1 pandemic mortality and transmission rates based on cases in May were forecast to be high, turned out to be very low Nonlinear forecast model showed in June what the rates would fall to by October, but WHO continued to use linear assumptions

Official WHO data The initial outbreak saw alarming transmission rates between 30 and 50 percent per day By summer 2009, transmission fell to 5-10 % per day, often less By the fall, transmission was well below 1 % and WHO had abandoned official case counts

The long search for a theory My initial hypothesis in the 1990’s was that decline required a certain minimum level of system complexity (it applied to people or biological organisms but not mechanical random-number devices). Experiments starting in 2003 showed otherwise: even very simple systems decline A lengthy literature search led to early ESP experiments and George Spencer-Brown

George Spencer-Brown (1923-) Wartime naval officer 1943-47 Oxford mathematician and philosopher Colleague of Bertrand Russell, studied under Ludwig Wittgenstein Most famous for Laws of Form (1969)

ESP in the 1950’s Still semi-respectable at that time Papers would appear in Nature Blind, well-supervised experiments with good controls showed a baffling and persistent “decline effect” Correlations would start out high, then gradually fall off to below chance levels

Spencer-Brown’s hypothesis in Probability and Scientific Inference (1957) ESP is not reproducible, but the decline effect is. Assume that there is no ESP. Then these are large-scale experiments in probability and nothing more. Spencer-Brown’s own experiments using ‘chance machines’ without a human subject produce the same odd pattern. His hypothesis: perhaps these results are evidence of a subtle defect in probability theory itself.

‘Atomic’ and ‘molecular’ events Spencer-Brown referred to ‘atomic’ and ‘molecular’ events, atomic being ‘A’ or ‘B’ in this sequence, and molecular being ‘AB’ or ‘AA’ or ‘BB’ : AABABBAABAB The method laid out by Jaynes in 1957 calls for multiple layers of interpretation, each independent of the others (rate of occurrence of ‘AAA’ can decline or be different even though rate of occurrence of ‘A’ has not changed). Fundamental contrast with the approach of classical probability in which all ‘molecular’ events are strict permutations

Spencer-Brown’s experimental method Identification of ‘organic’ boundaries between data sets (e.g. change of ESP subject, new day, new machine) Comparison of first and last half (good for small data sets) Analysis by quadrants: First 25 percent of data set compared to last 25 percent

Classical expectation vs. actual result If experiments are well-controlled, there will be no bias toward fewer “rares” and trend graphs will tend to be flat The actual result: in hundreds of experiments, 80-90 percent show bias Bias is stronger as p(rare) << 1 Bias persists longer as p(rare) << 1

Tools developed since Spencer-Brown Present the data as a cumulative log-log plot Show classical odds line if available Assess likelihood of deviation from classical odds line (most experiments have wound up in the range 0.1 > p > 0.01)

Experiment with flattened dice I threw a set of four slightly flattened dice 100,000 times to see if rares would decline The favored axis showed up 37.5 percent of the time, not 33.3 The favored axis increased its predominance over time

A large range of data sets Old ESP experiments Recent ESP experiments Spencer-Brown’s ‘chance machines’ Modern random-number generators Casino games Dice, coins, toy roulette wheels

Weaknesses in standard testing Conventional tests work with relatively short blocks of data, often 1 kbit A surplus of long, repetitive strings (say 1111111111) cannot be detected using this approach. A shortage cannot be detected either. The decline effect encompasses the entire historical output of the machine, often hundreds of thousands of items. Testing would be expensive and time-consuming.

Entropy as the governing principle The most important clue is the consistent logarithmic decline Changes in the proportion of rares are steepest for low N Similar amounts of change for N, 10N, 100N events

Every field that uses probability will be affected. Future prospects This has enormous practical and theoretical potential in economics, biology, epidemiology, Web traffic analysis, sociology, military science. Every field that uses probability will be affected.