PROBABILITY AND STATISTICS

Slides:



Advertisements
Similar presentations
Displaying Data Objectives: Students should know the typical graphical displays for the different types of variables. Students should understand how frequency.
Advertisements

2- 1 Chapter Two McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Key Definitions Quantitative Data: One reason for constructing a graph of quantitative data is to examine the distribution - is the data compact, spread.
Introduction to Educational Statistics
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories Ordinal measurement Involves sorting objects.
SECTION 12-1 Visual Displays of Data Slide
Chapter 3: Central Tendency
Summarizing Scores With Measures of Central Tendency
With Statistics Workshop with Statistics Workshop FunFunFunFun.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-1 Visual Displays of Data.
12.1 – Visual Displays of Data In statistics: A population includes all of the items of interest. A sample includes some of the items in the population.
Thinking Mathematically
2- 1 Chapter Two McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Basic Definitions  Statistics Collect Organize Analyze Summarize Interpret  Information - Data Draw conclusions.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 9 Statistics Section 9.1 Frequency Distributions; Measures of Central Tendency.
Statistics Chapter 9. Statistics Statistics, the collection, tabulation, analysis, interpretation, and presentation of numerical data, provide a viable.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
Chapter 2 Describing Data.
STATISTICS. Statistics * Statistics is the area of science that deals with collection, organization, analysis, and interpretation of data. * A collection.
Probability & Statistics
Subbulakshmi Murugappan H/P:
McGraw-Hill/ Irwin © The McGraw-Hill Companies, Inc., 2003 All Rights Reserved. 2-1 Chapter Two Describing Data: Frequency Distributions and Graphic Presentation.
Chapter Eight: Using Statistics to Answer Questions.
By: Asma Al-Oneazi Supervised by… Dr. Amal Fatani.
PROBABILITY AND STATISTICS WEEK 1 Onur Doğan. What is Statistics? Onur Doğan.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
1 ES Chapter 1 & 2 ~ Descriptive Analysis & Presentation of Single-Variable Data Mean: inches Median: inches Range: 42 inches Variance:
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
How to change bad news to good one
Descriptive Statistics
Basic Statistics Statistics in Engineering (collect, organize, analyze, interpret) Collecting Engineering Data Data Presentation and Summary Types of.
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 12 Statistics.
Chapter(2) Frequency Distributions and Graphs
Measurements Statistics
CHAPTER 12 Statistics.
Chapter 2: Methods for Describing Data Sets
Chapter 2 Frequency Distribution and Graph
ORGANIZING AND GRAPHING DATA
Lesson 8 Introduction to Statistics
Elementary Applied Statistics
CHAPTER 5 Basic Statistics
Chapter 5 STATISTICS (PART 1).
Summarizing Scores With Measures of Central Tendency
Frequency Distributions and Graphs
An Introduction to Statistics
The facts or numbers that describe the results of an experiment.
THE STAGES FOR STATISTICAL THINKING ARE:
10.2 Statistics Part 1.
Paf 203 Data Analysis and Modeling for Public Affairs
Sexual Activity and the Lifespan of Male Fruitflies
LESSON 3: CENTRAL TENDENCY
THE STAGES FOR STATISTICAL THINKING ARE:
CHAPTER 12 Statistics.
The facts or numbers that describe the results of an experiment.
Constructing and Interpreting Visual Displays of Data
Educational statistics
Chapter Nine: Using Statistics to Answer Questions
Experimental Design Experiments Observational Studies
CHAPTER 12 Statistics.
Chapter 3: Central Tendency
CHAPTER 12 Statistics.
Frequency Distribution and Graphs
Presentation transcript:

PROBABILITY AND STATISTICS WEEK 1 Onur Doğan-2016/2017

What is Statistics? The science of collecting, describing, analyzing and interpreting data. Onur Doğan

What is Statistics? Descriptive Statistics: collection, presentation, and description of sample data. Inferential Statistics: making decisions and drawing conclusions about populations. Onur Doğan

Basic Terms Population: A collection, or set, of individuals or objects or events whose properties are to be analyzed. Two kinds of populations: finite or infinite. Sample: A subset of the population. Variable: A characteristic about each individual element of a population or sample. Data (singular): The value of the variable associated with one element of a population or sample. This value may be a number, a word, or a symbol. Data (plural): The set of values collected for the variable from each of the elements belonging to the sample. Experiment: A planned activity whose results yield a set of data. Parameter: A numerical value summarizing all the data of an entire population. Statistic: A numerical value summarizing the sample data. Onur Doğan

Example: A college dean is interested in learning about the average age of faculty. Identify the basic terms in this situation. The population is the age of all faculty members at the college. A sample is any subset of that population. For example, we might select 10 faculty members and determine their age. The variable is the “age” of each faculty member. One data would be the age of a specific faculty member. The data would be the set of values in the sample. The experiment would be the method used to select the ages forming the sample and determining the actual age of each faculty member in the sample. The parameter of interest is the “average” age of all faculty at the college. The statistic is the “average” age for all faculty in the sample.

Onur Doğan

Level of Measurement Nominal Scale Ordinal Scale Interval Scale In this scale, we attempt to sort elements with respect to a certain characteristic, making decisons about which elements are most similar and which most different. Ordinal Scale In this scale we are able to not only to group units into seperate categories but also to order the categories as well. Interval Scale Is it possible to indicate the exact distance between variables. Ratio Scale Zero means absence. Onur Doğan

Level of Measurement Nominal Scale (Classifications, Set memberships, etc.) Birth place, sex, etc Ordinal Scale (Ordinal data) Education level, etc. Interval Scale (Equal distances but no zero point) Temparature, etc. Ratio Scale (Absolute zero) Age, income, etc. Onur Doğan

Data Presentation Basic Presentation Frequency Distributions Relative Frequency Distributions Presentations by Classes Stem and Leaf Display Graphical Presentations Histograms, Frequency Polygons, Pie Charts, etc. Onur Doğan

Example Example: The hemoglobin test, a blood test given to diabetics during their periodic checkups, indicates the level of control of blood sugar during the past two to three months. The data in the table below was obtained for 40 different diabetics at a university clinic that treats diabetic patients: 6.5 5.0 5.6 7.6 4.8 8.0 7.5 7.9 8.0 9.2 6.4 6.0 5.6 6.0 5.7 9.2 8.1 8.0 6.5 6.6 5.0 8.0 6.5 6.1 6.4 6.6 7.2 5.9 4.0 5.7 7.9 6.0 5.6 6.0 6.2 7.7 6.7 7.7 8.2 9.0 1) Construct a grouped frequency distribution using the classes 3.7 - <4.7, 4.7 - <5.7, 5.7 - <6.7, etc. 2) Which class has the highest frequency?

Solutions Class Frequency Relative Cumulative Class 1) Class Frequency Relative Cumulative Class Boundaries f Frequency Rel. Frequency Midpoint, x --------------------------------------------------------------------------------------- 3.7 - <4.7 1 0.025 0.025 4.2 4.7 - <5.7 6 0.150 0.175 5.2 5.7 - <6.7 16 0.400 0.575 6.2 6.7 - <7.7 4 0.100 0.675 7.2 7.7 - <8.7 10 0.250 0.925 8.2 8.7 - <9.7 3 0.075 1.000 9.2 2) The class 5.7 - <6.7 has the highest frequency. The frequency is 16 and the relative frequency is 0.40

Stem & Leaf Display The stem-and-leaf display has become very popular for summarizing numerical data It is a combination of graphing and sorting The actual data is part of the graph Well-suited for computers Stem-and-Leaf Display: Pictures the data of a sample using the actual digits that make up the data values. Each numerical data is divided into two parts: The leading digit(s) becomes the stem, and the trailing digit(s) becomes the leaf. The stems are located along the main axis, and a leaf for each piece of data is located so as to display the distribution of the data.

Example Example: A city police officer, using radar, checked the speed of cars as they were traveling down the main street in town. Construct a stem-and-leaf plot for this data: 41 31 33 35 36 37 39 49 33 19 26 27 24 32 40 39 16 55 38 36 Solution: All the speeds are in the 10s, 20s, 30s, 40s, and 50s. Use the first digit of each speed as the stem and the second digit as the leaf. Draw a vertical line and list the stems, in order to the left of the line. Place each leaf on its stem: place the trailing digit on the right side of the vertical line opposite its corresponding leading digit.

Example --------------------------------------- 1 | 6 9 2 | 4 6 7 1 | 6 9 2 | 4 6 7 3 | 1 2 3 3 5 6 6 7 8 9 9 4 | 0 1 9 5 | 5 ---------------------------------------- The speeds are centered around the 30s Note: The display could be constructed so that only five possible values (instead of ten) could fall in each stem. What would the stems look like? Would there be a difference in appearance?

Histogram Histogram: A bar graph representing a frequency distribution of a quantitative variable. A histogram is made up of the following components: 1. A title, which identifies the population of interest 2. A vertical scale, which identifies the frequencies in the various classes 3. A horizontal scale, which identifies the variable x. Values for the class boundaries or class midpoints may be labeled along the x-axis. Use whichever method of labeling the axis best presents the variable. Notes: The relative frequency is sometimes used on the vertical scale. It is possible to create a histogram based on class midpoints.

Example Age Frequency Class Midpoint Example: A recent survey on people’s age in a spesific village summarized and given in the table below. Construct a histogram for this age data: Age Frequency Class Midpoint ------------------------------------------------------------ 20 up to 30 34 25 30 up to 40 58 35 40 up to 50 76 45 50 up to 60 187 55 60 up to 70 254 65 70 up to 80 241 75 80 up to 90 147 85

Solution 8 5 7 6 4 3 2 1 Frequency Age

Frequency Polygon An example for frequency polygon Onur Doğan

Example Example: The table below lists the number of automobiles sold last week by day for a local dealership. Describe the data using a circle graph and a bar graph: Day Number Sold Monday 15 Tuesday 23 Wednesday 35 Thursday 11 Friday 12 Saturday 42

Circle Graph Solution Automobiles Sold Last Week

Measures of Central Tendency Mean (Arithmetic, Geometric, Squared, etc.) Median Mode Onur Doğan

Mean Onur Doğan

Mean Since frequency shows the value of the occurence of a variable mean formula for frequency distributions becomes; Note: We should write midpoints of the intervals to find the mean of grouped data. Onur Doğan

Median Median: The value of the data that occupies the middle position when the data are ranked in order according to size To find the median: 1. Rank the data 2. Determine the depth of the median: 3. Determine the value of the median Onur Doğan

Median Median formula for grouped data; Onur Doğan

Mode Mode: The mode is the value of x that occurs most frequently Note: If two or more values in a sample are tied for the highest frequency (number of occurrences), there is no mode Onur Doğan

Mode Mode formula for grouped data; Onur Doğan