Presentation is loading. Please wait.

Presentation is loading. Please wait.

First Things First Log Tables, Random Numbers, and Tax Fraud Ahbel, copyright 2001.

Similar presentations


Presentation on theme: "First Things First Log Tables, Random Numbers, and Tax Fraud Ahbel, copyright 2001."— Presentation transcript:

1 First Things First Log Tables, Random Numbers, and Tax Fraud Ahbel, copyright 2001

2 The First Digit Phenomenon “Everyday numbers obey a law so unexpected it is hard to believe it's true.” New Scientist - 10 July 1999 Ahbel, copyright 2001

3 Simon Newcomb Astronomer and mathematician Log tables 1881 American Journal of Mathematics “The law of probability of the occurrence of numbers is such that all mantissae of their logarithms are equally likely.” Newcomb’s Conjecture P=log 10 (1+1/d) http://cepa.newschool.edu/het/profiles/newcomb.htm

4 Ahbel, copyright 2001 Graph of the Distribution http://pass.maths.org.uk/issue9/features/benford/

5 Ahbel, copyright 2001 Frank Benford 57 years later (1938) G.E. Physicist p=log 10 (1+1/d) Proceedings of the American Philosophical Society 20,229 observations Areas of 335 rivers, American League baseball statistics, specific heats of 1389 chemical compounds, numbers appearing in reader’s digest articles and newspapers.

6 Ahbel, copyright 2001 Newspapers – Benford in 1938 1990 Census – Mark Nigrini Dow Jones (1990-93) – Eduardo Ley http://www.rexswain.com/benford.html

7 Ahbel, copyright 2001 An Intuitive Explanation Plant 1 tree per year Every tree grows 1 cm per year What happens to the distribution of first digits of heights over time?

8 Ahbel, copyright 2001 Roger Pinkham Rutgers 1961 Base invariance Scale invariance Benford’s law is the only way! http://www.seanet.com/~ksbrown/kmath302.htm

9 Ahbel, copyright 2001 Scale Invariant Numbers seem to conform to a first digit phenomenon described by Benford’s Law Benford’s Law seems to be Scale Invariant If there is a first digit phenomenon that is scale invariant, then multiplying numbers by a constant will not change the distribution of the first digits.

10 Ahbel, copyright 2001 We can express any number in scientific notation: x*10 n, 1  x<10 If the distribution of x is scale-invariant, then the distribution of y = log 10 x must be “translation invariant”: log 10 ax = log 10 a + log 10 x = log 10 a + y If multiplying x by a constant leaves the distribution of first digits unchanged then adding a constant to y will leave the distribution of first digits unchanged constant

11 Ahbel, copyright 2001 x is on 1  x < 10 y is on log 10 1  y < log 10 10 y is on 0  y < 1 Since the only distribution that is “translation invariant” is the uniform distribution, y must be distributed uniformly over its interval, 0  y < 1. This is the probability distribution function y=1.

12 Ahbel, copyright 2001 P(d 1 =1) = P(1  x < 2) = P(log 10 1  y < log 10 2) = P( 0  y < log 10 2) = log 10 2 - 0 log 10 2  30.1% y=log 10 x 1 0 01 P log 10 2 d 1 =1 log 10 2log 10 3 P 1 0 01 d 1 =2 log 10 3-log 10 2  17.6%

13 Ahbel, copyright 2001 In general, P(d 1 =d) = P( d  x < d+1 ) = P(log 10 d  y < log 10 (d+1)) = log 10 (d+1) - log 10 d

14 Ahbel, copyright 2001 Theodore Hill Georgia Institute of Technology 1996 All is outcome of process normal, logarithmic, oscillating, etc. distributions “Now imagine grabbing random handfuls of data from a hotchpotch of such distributions. Hill proved that as you grab ever more of such numbers, the digits of these numbers will conform ever closer to a single, very specific law. This law is a kind of ultimate distribution, the “Distribution of Distributions.” And he showed that its mathematical form is…Benford’s Law.” New Scientist 10 July 1999

15 Ahbel, copyright 2001 http://www.math.gatech.edu/~hill/publications/cv.dir/stat-der.pdf

16 Ahbel, copyright 2001 http://www.math.gatech.edu/~hill/publications/cv.dir/stat-der.pdf Ahbel, copyright 2001

17 http://www.math.gatech.edu/~hill/publications/cv.dir/stat-der.pdf Ahbel, copyright 2001

18 http://www.math.gatech.edu/~hill/publications/cv.dir/stat-der.pdf Ahbel, copyright 2001

19 http://www.math.gatech.edu/~hill/publications/cv.dir/stat-der.pdf Ahbel, copyright 2001

20 Probability Distribution?

21 Ahbel, copyright 2001 First Digit Law Second Digit Law General Significant-Digit Law

22 Ahbel, copyright 2001 Tax Fraud Detection True Tax Data - Mark Nigrini’s 169,662 IRS model files Fraudulent Data - 1995 King’s County, New York, Cash disbursement and payroll 743 freshmen’s responses to a request to write down a six-digit number at random

23 Ahbel, copyright 2001 Testing Mathematical Models Suppose past census data follows Benford’s Law closely A model that predicts the same census data should also follow Benford’s Law http://www.math.gatech.edu/~hill/publications/cv.dir/1st-dig.pdf

24 Ahbel, copyright 2001 Computer Design Donald Knuth, Stanford University If the numbers computers process are not uniformly distributed then computers can be re-designed to: Minimize storage space Maximize calculation rate http://www.math.gatech.edu/~hill/publications/cv.dir/1st-dig.pdf

25 Ahbel, copyright 2001 1973 43.5 10 19.4 15,584 17 “Everyday numbers obey a law so unexpected it is hard to believe it's true.” New Scientist - 10 July 1999 39 9 88687 44 1826 52.4 14010 538 1905 1.837 98,817 17 795 20 29 16 36 32,000 28,341 476,140,375 31 26 2.7 467 1.53 55,592 47,901 10,000 1005.2 3,332 6 36 8.70 120 38 4500 3.6 220 63,9000 6427 11.68 276.5 33,283 15,724 1353.42 292.9.0360


Download ppt "First Things First Log Tables, Random Numbers, and Tax Fraud Ahbel, copyright 2001."

Similar presentations


Ads by Google