REAL WORLD DATA Overall Long Beach Data burglaries: Total number of burglaries (2000-2005)

Slides:



Advertisements
Similar presentations
WHAT IS ELINK? Thermoflow, Inc.
Advertisements

Prepared by : Mahmoud A. Abu Hashish  Used to organize and analyze information  Made up of columns and rows  Columns and rows intersect.
A frequency distribution for two variables
Introduction to Statistics
San Jose State University Engineering 101 JKA & KY.
AP Stats Review. Assume that the probability that a baseball player will get a hit in any one at-bat is Give an expression for the probability.
Tinkerplots V Carryn Bellomo
Random Sampling. In the real world, most R.V.’s for practical applications are continuous, and have no generalized formula for f X (x) and F X (x). We.
Introduction to Spreadsheets Microsoft Excel. What is a spreadsheet? Enter data. Analyze data. Make graphs.
 The Law of Large Numbers – Read the preface to Chapter 7 on page 388 and be prepared to summarize the Law of Large Numbers.
Other Sampling Methods
12.3 – Measures of Dispersion
Random Sampling  In the real world, most R.V.’s for practical applications are continuous, and have no generalized formula for f X (x) and F X (x). 
Chapter 2 Presenting Data in Tables and Charts. Note: Sections 2.1 & examining data from 1 numerical variable. Section examining data from.
7 th Grade Chapter 11 Displaying and Analyzing Data Chapter 12 Using Probability.
Grade 8 Math. Minds On Please define histogram:  A graph with bars that show frequencies of data organized into intervals; the intervals line up side.
Enter these data into your calculator!!!
Intro Stats Lesson 1.3 B Objectives: SSBAT classify different ways to collect data. SSBAT distinguish between different sampling techniques. Standards:
Spreadsheets and Microsoft Excel. Introduction n A spreadsheet (called a worksheet in Excel) is a two-dimensional array of cells containing data to be.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Measures of Central Tendency Sixth Grade Mathematics
© A Very Good Teacher th Grade TAKS Review 2008 Objective 5 Day 1.
Quantitative Skills 1: Graphing
Confidence Interval Proportions.
The introduction to SPSS Ⅱ.Tables and Graphs for one variable ---Descriptive Statistics & Graphs.
Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc.
BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple.
Math Across the Curriculum: Statistics and Probability Paraprofessional Training August 24 th – August 28th.
PKSS Community Survey – Analysis and Conclusions Sep 11 th, 2009.
Supplemental Figure 1A. A small fraction of genes were mapped to >=20 SNPs. Supplemental Figure 1B. The density of distance from the position of an associated.
Measuring Repeat and Near- Repeat Burglary Effects.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Random Sampling Approximations of E(X), p.m.f, and p.d.f.
Working with one variable data. Measures of Central Tendency In statistics, the three most commonly used measures of central tendency are: Mean Median.
Normal distributions The most important continuous probability distribution in the entire filed of statistics is the normal distributions. All normal distributions.
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Bar Graphs Circle Graphs Line Plots.
Data Analysis. I. Mean, Median, Mode A. central tendency is a value that describes a A. central tendency is a value that describes a data set. data set.
MnSGC Ballooning Team Techniques: APRS tracking-data processing James Flaten Summer 2010.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
Section 8.3 ~ Estimating Population Proportions Introduction to Probability and Statistics Ms. Young.
Bar Graph. A single bar graph uses the same color or shade of bar to compare amounts, such as number of students per class STUDENTS PER CLASS.
Statistics. Our objective today: Learn about statistics and why they are important Explore how we can gain information about a population by examining.
Introduction to Statistics Chapter 1. § 1.1 An Overview of Statistics.
Copyright © 2014, 2010, and 2006 Pearson Education, Inc. Chapter 3 Introduction to Graphing.
LEARNING TARGET Tuesday 5/27 – Complete a Frequency Chart.
Section 1.3 Each arrangement (ordering) of n distinguishable objects is called a permutation, and the number of permutations of n distinguishable objects.
FTCE 5-9 Test Prep Center for Teaching and Learning.
Probability and Statistics 12/11/2015. Statistics Review/ Excel: Objectives Be able to find the mean, median, mode and standard deviation for a set of.
USING GRAPHING SKILLS. Axis While drawing graphs, we have two axis. X-axis: for consistent variables Y-axis: for other variable.
1-8 Scatter Plots Course 2 Warm Up Warm Up Problem of the Day Problem of the Day Lesson Presentation Lesson Presentation.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Chapter 8 Descriptive Statistics. Dot Plots Dot plot:Dot plot: Horizontal axis represents the data values. Horizontal axis represents the data values.
Ms. Drake 7th grade Math Measures of Central Tendency Lesson 2 Mean, Median, Mode and Range.
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
Cell Diameters and Normal Distribution. Frequency Distributions a frequency distribution is an arrangement of the values that one or more variables take.
Descriptive Statistics
AP Biology: Normal Distribution
The Structure of Common Genetic Variation in United States Populations
Measuring Repeat and Near-Repeat Burglary Effects
Collecting & Displaying Data
Representing Quantitative Data
Random Variables Binomial Distributions
Sampling.
Vincent B. McGinty, Antonio Rangel, William T. Newsome  Neuron 
Chapter 13 - Confidence Intervals - The Basics
Part I Review Highlights, Chap 1, 2
Samples and Populations
Presentation transcript:

REAL WORLD DATA Overall Long Beach Data burglaries: Total number of burglaries ( )

Multiple Robberies distinguished by color

LEGEND Blue – 1 Cyan – 2 Red – 3 Yellow – 4 Long Beach Burglaries from the Year 2000, Number of Times Victimized

AVERAGE FREQUENCY OF ROBBERIES The following Excel file shows Frequency of the houses burgled with respect to weeks between robberiesExcel file

OVERALL DATA ANALYSIS The following Frequency graph plotted using the average data More burglaries occur in rapid succession rather than long intervals.

AVERAGE NUMBER OF ROBBERIES The following Excel File shows the Average number of robberies for all five years.Excel File

AVERAGE NUMBER OF ROBBERIES PLOT

MY MODEL  Virtual robbers are placed on a line of length L meant to represent houses  robber can do 4 things at each step, essentially: stay put where he/she is, move left or right, or rob the house where he/she is at  The probability of moving to a neighboring location is calculated based on the “attractiveness” of the neighbor houses as follows:  After all the robbers have robbed and moved, the houses will update their b values according to this formula:

COMPARISON OF MY MODEL AND REAL WORLD DATA Robbers=800;Blue  Model Houses=13000;Black  Data Time=One year; η=.5 =0.5; =.01; b0=.01

COMPARISON OF MY MODEL AND REAL WORLD DATA Number of Robberies per site for the same parameters as before Binning number time robbed: 1  2046, 2  296, 3  75, 4  19, 5  2, 6  1

What is a Hot Spot?

Hot Spot Definition 1: Areas with high percentages of multiple burglaries

Hot Spot 1  100x100 grid  Each cell is approximately 485x500 ft  Each cell represents the number of multiple burglaries divided by the total number of burglaries

Hot Spot Definition 2: Clusters of burglaries where one burglary is within 500 ft of another

Clustering Algorithm

1.Chooses a random point 2.Finds all points within 500 ft

Clustering Algorithm

1.Chooses a random point NOT already within a cluster 2.Finds all points within 500 ft

Clustering Algorithm

Eventually, all points are in clustered in some groups

Clustering Algorithm Re-checks that points within 500 ft are in the same cluster

Clustering Algorithm Some different clusters might be within 500 ft of each other

Clustering Algorithm Clusters within 500 ft of each other are combined into one cluster

Total of 521 Clusters, top 10 are displayed The biggest cluster consisted of 76 housing units 500 ft is about the size of a block Are these hot spots? Long Beach Burglary Clusters for the year 2000

Apologies….

Possible Correlations?

Possible Correlations: 2000: Total Housing Units

Possible Correlations: 2000: Total People

Possible Correlations: 2000: Mean Earnings Based on sampling

Possible Correlations: 2000: Median Age

Possible Correlations: 2000: Percentage of Those 65 and Older

Possible Uncorrelations: 2000: Percentage of Households with 1 Person

Possible Correlations: 2000: Percentage of those with at most a 9 th grade education level Based on Sampling

Possible Correlations: 2000: Percentage of those 25 years or older with at least a Bachelors Based on Sampling

Possible Correlations: 2000: Race Percentage of those who are …. ?

Possible Correlations: 2000: Race Percentage of those who are Caucasian

Possible Correlations: 2000: Race Percentage of those who are African-American

Possible Correlations: 2000: Race Percentage of those who are Asian American

Possible Correlations: 2000: Race Percentage of those who are Hispanic/Latino