Statistics and Data. inference Sample (Data) statistics probability sampling Population Ch 2, Ch 4, Ch 6 Ch 5, Ch 9, Ch 21 Ch 7, Ch 8 Ch 3, Ch 23 Ch 10-16.

Slides:



Advertisements
Similar presentations
Introduction to Statistics
Advertisements

Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of.
Elementary Statistics MOREHEAD STATE UNIVERSITY
Chapter 1 The Where, Why, and How of Data Collection
What is Statistics Section 1.1, Page 4.
Chapter 1 The Where, Why, and How of Data Collection
Chapter 1 Sampling and Data.
Copyright © 2010 Pearson Education, Inc. Chapter 2 Data.
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
Quantitative vs. Categorical Data
Statistics with Computer Analysis Statistics Math 1551 Instructor Robert Barber.
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Chapter 1 Getting Started
Chapter 1: Introduction to Statistics
Copyright © 2012 Pearson Education. All rights reserved. Chapter 2 Data.
Understanding Statistics Eighth Edition By Brase and Brase Prepared by: Joe Kupresanin Ohio State University Chapter One Getting Started.
© The McGraw-Hill Companies, Inc., by Marc M. Triola & Mario F. Triola SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD.
STATISTICS is about how to COLLECT, ORGANIZE,
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
Chapter 2: Data CHS Statistics
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Elementary Statistics M A R I O F. T R I O L A Copyright © 1998, Triola, Elementary.
Introduction to Probability and Statistics Consultation time: Ms. Chong.
Sample Surveys.  The first idea is to draw a sample. ◦ We’d like to know about an entire population of individuals, but examining all of them is usually.
Chapter 1: The Nature of Statistics
Chapter 2 Data.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
LECTURE 3 THURSDAY, 22 JANUARY STA 291 Fall
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
MDM4U - Collecting Samples Chapter 5.2,5.3. Why Sampling? sampling is done because a census is too expensive or time consuming the challenge is being.
Understanding Research Design Can have confusing terms Research Methodology The entire process from question to analysis Research Design Clearly defined.
What is Statistics 1Section 1.1, Page 5. Definition: Statistics Statistics: The science of collecting, describing and interpreting data. Why Study Statistics?
STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.
Chapter 1 Getting Started Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Elementary Statistics (Math 145) September 8, 2010.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Unit 1 – Intro to Statistics Terminology Sampling and Bias Experimental versus Observational Studies Experimental Design.
Math 341 January 23, Outline 1. Recap 2. Other Sampling Designs 3. Graphical methods.
+ Chapter 1. + Chapter 1 Section 1: Overview of Statistics.
Lecture 1 Stat Applications, Types of Data And Statistical Inference.
Column 1 Column 2 Column 3 Column
Ch1 Larson/Farber 1 Elementary Statistics Math III Introduction to Statistics.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
1.  The practice or science of collecting and analyzing numerical data in large quantities, especially for the purpose of inferring* proportions in a.
BIOSTAT - 1 Data: What types of data do you deal with? What do you think “statistics” means? Where do you obtain your data? What is a random variable in.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
We’ve been limited to date being given to us. But we can collect it ourselves using specific sampling techniques. Chapter 12: Sample Surveys.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
An Overview of Statistics Section 1.1 After you see the slides for each section, do the Try It Yourself problems in your text for that section to see if.
Copyright © 2009 Pearson Education, Inc. Chapter 2 Data.
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Math 145 June 19, Outline 1. Recap 2. Sampling Designs 3. Graphical methods.
Copyright © 2004 Pearson Education, Inc.
Elementary Statistics
Statistics Statistics is that field of science concerned with the collection, organization, presentation, and summarization of data, and the drawing of.
HW Page 23 Have HW out to be checked.
Active Learning Lecture Slides
Copyright © 2011 Pearson Education, Inc.
Math 145 January 23, 2007.
Elementary Statistics MOREHEAD STATE UNIVERSITY
statistics Specific number
statistics Specific number
The Nature of Probability and Statistics
Elementary Statistics MOREHEAD STATE UNIVERSITY
Math 145 September 6, 2005.
Math 145 September 5, 2007.
Presentation transcript:

Statistics and Data

inference Sample (Data) statistics probability sampling Population Ch 2, Ch 4, Ch 6 Ch 5, Ch 9, Ch 21 Ch 7, Ch 8 Ch 3, Ch 23 Ch We will cover ch. 1-15, 16, 21 and 23 of SVV. 4 major parts of STATISTICS

Sample & Population population: a group, as a whole, of objects we want to know. sample: a part of population, which we observe(d).

data: (a set of) numbers and symbols indicating a fact in reality. A sample is also represented as data. The word DATA is plural form of DATUM. Sometimes people use DATA as if singular conventionally. To avoid the confusion in grammar, a word DATASET is often used as a singular form of DATA. Others use DATASET meaning for groups of DATA. Data obtained by observing (a part of) a population are a sample.

observations (records, cases) variables (attributes ) In a data table, each column means variable, and each row corresponds to observation. Database technicians use the terminologies attribute and record, instead of variable and observation. Data table is a data shown in tabular form.

Database (DB) is a collection of closely related data, managed by Database Management System (DBMS) software. Relational DB uses tabular representation of data.

Data and Statistics Statistics is the way to read data, and interprets the story the data says. To understand the story, it’s important to delve the CONTEXT of the data, as well as dataset itself.

Context of data Something told by data regardless of whether it was represented explicitly or not (implicitly). - Meaning of the variables and values of the data - Who & When are the data collected (by) - How & Why …. - All history & background of data

Types of variables identifierquantitative variable categorical variable (qualitative variable) categorical or quantitative : by scale of measurement

Categorical variable - nominal scale : sex - male (1), female (2) - ordered scale : grade of score - A,B,C,D,F Quantitative variable - interval scale : year of birth - BC300, AD ratio scale: price, weight, ….

Interval scale or Ratio scale Is quotient meaningful? Then, ratio scale. Test with the cases of -30kg, -50Won, BC1000, -10 o C. - 60kg is 2 times heavier than 30kg. - AD2000 is 2 times older than AD o C is 2 times warmer than 10 o C Won is 2 times more than 50Won.

CustomerDaily callYearly call Clinton2954 Ford1450 David0320 Gates2795 Categorical or Quantitative eg) number of phone calls rather likely to be categorical rather likely to be quantitative

Cross-Sectional Data, Longitudinal Data and Time Series When several variables are all measured at the same time point, the data is called cross-sectional data. For example, determining sales revenue, number of customers, and expenses for the last month of business. Variables that are measured at some time points are called a longitudinal data. For example, number of victims of earthquake. Longitudinal data measured at very many and regularly spaced time points are called time series. For example, monthly recorded discount rates of US treasury bond.

How to get data No TreatmentTreatment Non-designed Sampling by Voluntary Response, Convenience, etc. Observational Study Designed SamplingExperiment Using data collected from computer DB is basically considered as non-designed study. For example, using customer data from DB. Data-mining dealt in chapter 24 analyzes data collected from DB. Chapter 3 Chapter 23

Sampling & Experiment For selected patients by design, blood pressures of the patients are measured without any treatment, then that is data collection by sampling. Before measuring blood pressures, treatments are given, eg. dosing different types of medicines for comparison of the medicines, that is by experiment.

Designed Study & Non-designed Study Designed study aims to generalize the results obtained from data to assumed population. For example, the result of well performed political poll for an election can be interpreted as opinions of all voters. To generalize, the sample (or experimental units) must be selected by well designed plan. Non-designed study does not aim to generalize obtained results from data to population, and aims just to see tentative results. For example, a poll done by internet website can not be generalized to of all citizens.

Sampling schemes - cluster sampling - systematic sampling - simple random sampling (SRS) - stratified sampling

Assume we want to measure math ability of 5 th year grade students of a school. The number of students is 300 and divided into 10 classes. Examples of the sampling schemes selecting 60 students are as follows. Simple random sampling (SRS) : after giving numbers from 1 to 300 to the students, select 60 students by using random number generator (see, Stratified sampling : select 6 students by SRS, for each class. Cluster sampling : select 2 classes, and measure all students in those classes. Systematic sampling : select 60 students who got the number of multiples of 5.

Which sampling scheme will give the most precise result, if the school streamed the students for their math ability priori ? Which sampling scheme is the most convenient to measure math ability of 5 th year grade students of the school ?

Voluntary Response Sampling and Convenience Sampling In a voluntary response sample, a large group of individuals is invited to respond, and all who do respond are counted. Voluntary response samples are almost always biased, and so conclusions drawn from them are almost always wrong. In convenience sampling we simply include the individuals who are convenient. Unfortunately, this group may not be representative of the population.

Thank you !!