Slide 1 Basis for Pattern Detection Analytical review Isolate the “significant few” Detection of errors Quantified approach Objective 2.

Slides:



Advertisements
Similar presentations
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Advertisements

Anita M. Baker, Ed.D. Jamie Bassell Evaluation Services Program Evaluation Essentials Evaluation Support 2.0 Session 2 Bruner Foundation Rochester, New.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Joint meeting of RDU IIA and ISACA November 11, 2008, Capitol Club, Raleigh, North Carolina Joint meeting of the RDU IIA and ISACA chapters November 11,
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 2-1.
Chapter 2 Presenting Data in Tables and Charts
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Managing the business risk of fraud using sampling and data mining EZ-R Stats, LLC Managing the Business Risk of Fraud using Sampling and Data Mining Fall.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Chapter 2 Describing Data Sets
Crosstabs and Chi Squares Computer Applications in Psychology.
SADC Course in Statistics Producing Good Tables In Excel Module B2 Sessions 4 & 5.
2008 FRAUD FROUM MAY 14-16, 2008 / HYATT REGENCY COCONUT POINT RESORT & SPA / BONITA SPRINGS, FL, USA Detection ofFraud Detection of Fraud Patterns Using.
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Data Mining Journal Entries for Fraud Detection: A Pilot Study by Roger S. Debreceny & Glen L. Gray Discussed by Severin Grabski.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
© EZ-R Stats, LLC Duplicate Payments Slide 1 Auditing for Duplicate Payments A better way … Presentation of
Lecture 2 Graphs, Charts, and Tables Describing Your Data
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
Chapter 8 Audit Sampling: An Overview and Application to Tests of Controls McGraw-Hill/IrwinCopyright © 2012 by The McGraw-Hill Companies, Inc. All rights.
Audit Sampling: An Overview and Application to Tests of Controls
BENFORDSlide Number 1 Using Benford on Expense Reports ACL Users Group Wednesday, June 17, 2009 Richmond, Virginia Charles R. Gauntt.
© EZ-R Stats, LLC Duplicate Payments Slide 1 Auditing for Duplicate Payments A better way … Web CAAT.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
 Statistics The Baaaasics. “For most biologists, statistics is just a useful tool, like a microscope, and knowing the detailed mathematical basis of.
Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Chi-Square Training for Attribute Data.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 2-1 Chapter 2 Presenting Data in Tables and Charts Statistics For Managers 4 th.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chap 2-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course in Business Statistics 4 th Edition Chapter 2 Graphs, Charts, and Tables.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Testing hypothesis with two proportions and Chi-square testing.
1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 12 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin 8-1 Chapter Eight Audit Sampling: An Overview and Application.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
11/12 9. Inference for Two-Way Tables. Cocaine addiction Cocaine produces short-term feelings of physical and mental well being. To maintain the effect,
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
Warm up On slide.
Chi-square test.
Read page 41 Do the Academy Awards involve discrimination based on age? What do you think?
Audit Sampling: An Overview and Application
Lecture Slides Essentials of Statistics 5th Edition
Chapter 2 Summarizing and Graphing Data
Objectives (BPS chapter 23)
Exploring, Displaying, and Examining Data
Vocabulary Statistical Inference – provides methods for drawing conclusions about a population parameter from sample data Expected Values– row total *
Introduction The two-sample z procedures of Chapter 10 allow us to compare the proportions of successes in two populations or for two treatments. What.
Chapter 11 Chi-Square Tests.
i) Two way ANOVA without replication
Section 2.1 Review and Preview.
Lecture #27 Tuesday, November 29, 2016 Textbook: 15.1
Chi-Square Test Dr Kishor Bhanushali.
Lecture Slides Elementary Statistics Tenth Edition
Two Categorical Variables: The Chi-Square Test
Chapter 2 Summarizing and Graphing Data
Overview and Chi-Square
Chapter 11 Chi-Square Tests.
Chapter 13 Group Differences
Good Morning AP Stat! Day #2
Inference for Two Way Tables
Chapter 11 Chi-Square Tests.
MATH 2311 Section 8.5.
Presentation transcript:

Slide 1 Basis for Pattern Detection Analytical review Isolate the “significant few” Detection of errors Quantified approach Objective 2

Slide 2 Understanding the Basis Quantified Approach Population vs. Groups Measuring the Difference Stat 101 – Counts, Totals, Chi Square and K-S The metrics used Objective 2

Slide 3 Quantified Approach Based on measureable differences Population vs. Group “Shotgun” technique Objective 2a

Slide 4 Detection of Fraud Characteristics Something is different than expected Objective 2a

Slide 5 Fraud patterns Common theme – “something is different” Groups Group pattern is different than overall population Objective 2b

Slide 6 Measurement Basis Transaction counts Transaction amounts Objective 2c

Slide 7 A few words about statistics Detailed knowledge of statistics not necessary Software packages do the “number-crunching” Statistics used only to highlight potential errors/frauds Not used for quantification Objective 2d

Slide 8 How is digital analysis done? Comparison of group with population as a whole Can be based on either counts or amounts Difference is measured Groups can then be ranked using a selected measure High difference = possible error/fraud Objective 2d

Slide 9 Histograms Attributes tallied and categorized into “bins” Counts or sums of amounts Objective 2d

Slide 10 Two histograms obtained Population and group Objective 2d

Slide 11 Compute Cumulative Amount for each Objective 2d

Slide 12 Are the histograms different? Two statistical measures of difference Chi Squared (counts) K-S (distribution) Both yield a difference metric Objective 2d

Slide 13 Chi Squared Classic test on data in a table Answers the question – are the rows/columns different Some limitations on when it can be applied Objective 2d

Slide 14 Chi Squared Table of Counts Degrees of Freedom Chi Squared Value P-statistic Computationally intensive Objective 2d

Slide 15 Kolmogorov-Smirnov Two Russian mathematicians Comparison of distributions Metric is the “d-statistic” Objective 2d

Slide 16 How is K-S test done? Four step process 1.For each cluster element determine percentage 2.Then calculate cumulative percentage 3.Compare the differences in cumulative percentages 4.Identify the largest difference Objective 2d

Slide 17 Classification by metrics Stratification Day of week Happens on holiday Round numbers Variability Benford’s Law Trend lines Relationships (market basket) Gaps Duplicates Objective 2e

Slide 18 Kolmogorov-Smirnov Objective 2d - KS

Slide 19 Auditor’s “Top 10” Metrics 1.Outliers / Variability 2.Stratification 3.Day of Week 4.Round Numbers 5.Made Up Numbers 6.Market basket 7.Trends 8.Gaps 9.Duplicates 10.Dates Objective e

Slide 20 Understanding the Basis Quantified Approach Population vs. Groups Measuring the Difference Stat 101 – Counts, Totals, Chi Square and K-S The metrics used Objective 2

Slide 21 Objective 2 - Summarized 1.Understand why and how 2.Understand statistical basis for quantifying differences 3.Identify ten general tools and techniques 4.Understand examples done using Excel 5.How pattern detection fits in Next are the metrics …