Elementary Statistical Concepts

Slides:



Advertisements
Similar presentations
Overview of Inferential Statistics
Advertisements

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter One What is Statistics?
Introduction to Statistics
Introduction to Statistics Quantitative Methods in HPELS 440:210.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
2007 會計資訊系統計學 ( 一 ) 上課投影片 1.1 Chapter One What is Statistics?
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter One What is Statistics?
Stats Probability Theory.
Chi-square Test of Independence
Chapter Sampling Distributions and Hypothesis Testing.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
AP Statistics Overview and Basic Vocabulary. Key Ideas The Meaning of Statistics Quantitative vs. Qualitative Data Descriptive vs. Inferential Statistics.
Elementary Statistical Concepts
PPA 501 – A NALYTICAL M ETHODS IN A DMINISTRATION Lecture 3b – Fundamentals of Quantitative Research.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 24 Statistical Inference: Conclusion.
Chapter 1 Introduction and Data Collection
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
1 BA 275 Quantitative Business Methods Housekeeping Introduction to Statistics Elements of Statistical Analysis Concept of Statistical Analysis Statgraphics.
Understanding Statistics
Education 793 Class Notes Welcome! 3 September 2003.
1.1 Chapter One What is Statistics?. 1.2 What is Statistics? “Statistics is a way to get information from data.”
Some definitions In Statistics. A sample: Is a subset of the population.
COMM 250 Agenda - Week 12 Housekeeping RP2 Due Wed. RAT 5 – Wed. (FBK 12, 13) Lecture Experiments Descriptive and Inferential Statistics.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Eng.Mosab I. Tabash Applied Statistics. Eng.Mosab I. Tabash Session 1 : Lesson 1 IntroductiontoStatisticsIntroductiontoStatistics.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
Chapters 1 and 2 Week 1, Monday. Chapter 1: Stats Starts Here What is Statistics? “Statistics is a way of reasoning, along with a collection of tools.
Introduction to Statistics Mr. Joseph Najuch Introduction to statistical concepts including descriptive statistics, basic probability rules, conditional.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
Basic Business Statistics
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Stats Probability Theory. Instructor:W.H.Laverty Office:235 McLean Hall Phone: Lectures: M W F 2:30pm - 3:20am Arts 133 Lab: M 3:30 - 4:20.
Multivariate Data Summary. Linear Regression and Correlation.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Introduction to Statistics Chapter 1. § 1.1 An Overview of Statistics.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Chapter 0: Why Study Statistics? Chapter 1: An Introduction to Statistics and Statistical Inference 1
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Stats Introduction to Statistical Methods. Instructor:W.H.Laverty Office:235 McLean Hall Phone: Lectures: M T W Th F 11:00am - 12:20pm Geol.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Multivariate Data Summary. Linear Regression and Correlation.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Appendix I A Refresher on some Statistical Terms and Tests.
Stats 242.3(02) Statistical Theory and Methodology.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
I. Introduction to statistics
Introductory Statistical Language
Multivariate Data Summary
6A Types of Data, 6E Measuring the Centre of Data
Statistical Data Analysis
11.2 Inference for Relationships
Keller: Stats for Mgmt&Econ, 7th Ed. What is Statistics?
Presentation transcript:

Elementary Statistical Concepts Stats 244.3 Elementary Statistical Concepts

Assignments, Labs, Term tests - 40% Instructor: W.H.Laverty Office: 235 McLean Hall Phone: 966-6096 Lectures: M Tu W Th F 10:30am -11:50am Arts 200 Lab: Tu W Th 12:00 - 12:50 Arts 200 Evaluation: Assignments, Labs, Term tests - 40% Each Friday – Term Test Final Examination - 60%

Text: Moore, The Basic Practice of Statistics, I will provide lecture notes (power point slides). I will provide tables. The assignments will not come from the textbook. This means that the purchasing of the text is optional.

Course Outline

Introduction Populations, samples Variables Data Collection Chapter 1

Data Presentation- Exploratory Statistics Organizing and displaying Data Numerical measures of Central Tendency an Variability Describing Bivariate Data Chapter 2 , Chapter 3 , Chapter 4

Probability Theory Concepts of Probability Random variables and their distributions Binomial distribution, Normal distribution Chapters 9, 10, 11 and 12

Inferential Statistics Estimation, Hypotheses testing Comparing Samples Analyzing count data , Contingency Tables Regression and Correlation Multiple Regression Chapters 13 - 23

Introduction

The circular process of research: Questions arise about a phenomenon A decision is made to collect data A decision is made as how to collect the data The data is collected The data is summarized and analyzed Conclusion are drawn from the analysis

What is Statistics? It is the major mathematical tool of scientific inference (research) - the art of drawing conclusion from data. Data that is to some extent corrupted by some component of random variation (random noise)

Random variation or (random noise) can be defined to be the variation in the data that is not accounted for by factors considered in the analysis.

Suppose we are collecting data on Blood Pressure Height Weight Age Example Suppose we are collecting data on Blood Pressure Height Weight Age

Suppose we are interested in how Blood Pressure is influenced by the following factors Height Weight Age

Blood Pressure will not be perfectly predictable from : Height Weight Age There will departures (random variation) from a perfect prediction because of other factors the could affect Blood pressure (diet, exercise, hereditary factors)

Another Example In this example we are interested in the use of: antidepressants, mood stabilizing medication, anxiety medication, stimulants and sleeping pills. The data were collected for n = 16383 cases

In addition we are interested in how the use these medications is affected by: Age 20-29, 30-39,40-49, 50-59, 60-69, 70+ Gender Male, female Education < Secondary, Secondary Grad., some Post-Sec., Post-Sec. Grad.

Income Low, Low Mid, Up Mid, High Role parent, partner , worker parent, worker partner, worker worker only parent only partner only no roles

Some questions of interest How are the dependent variables (antidepressant use, mood stabilizing medication use, anxiety medication use, stimulants use, sleeping pill use) interrelated? How are the dependent variables (drug use) related to the independent variables (age, gender, income, education and role)?

Again the relationships will not be perfect Because of the effects of other factors (variables) that have not been considered in the experiment If the data is recollected, the patterns observed at the second collection will not be exactly the same as that observed at the first collection

The data appears in the following Excel file drug data.xls

In Statistics Questions Data Answers About some scientific, sociological, medical or economic phenomena Data The purpose of the data is to find answers to the questions Answers Because of the random variation in the data (the noise). Conclusions based on the data will be subject to error.

The circular process of research: In what part of this process does statistics play a role? Questions arise about a phenomenon Conclusion are drawn from the analysis A decision is made to collect data Statistics Statistics Experimental Design A decision is made as how to collect the data The data is summarized and analyzed The data is collected

Statistical Theory is interested in The design of the data collection procedures. (Experimental designs, Survey designs). The experiment can be totally lost if it is not designed correctly. The techniques for analyzing the data.

In any statistical analysis it is important to assess the magnitude of the error made by the conclusions of the analysis.

Consider the following statement: You can prove anything with Statistics.

One is unable to “prove” anything with Statistics. In fact: One is unable to “prove” anything with Statistics.

At the end of any statistical analysis there always is a possibility of an error in any of the decisions that it makes.

The success of a research project does not depend on the its conclusions The success of a research project depends on the accuracy of its conclusions

If one is testing the effectiveness of a drug There is two possible conclusions: 1. The drug is effective: 2. The drug is not effective:

The success of a this project does not depend on the its conclusions The success depends on the accuracy of its conclusions

For this reason: It is extremely important in any study to assess the accuracy of its conclusions

important to Statistics Some definitions important to Statistics

A population: this is the complete collection of subjects (objects) that are of interest in the study. There may be (and frequently are) more than one in which case a major objective is that of comparison.

A case (elementary sampling unit): This is an individual unit (subject) of the population.

A variable: a measurement or type of measurement that is made on each individual case in the population.

Types of variables Some variables may be measured on a numerical scale while others are measured on a categorical scale. The nature of the variables has a great influence on which analysis will be used. .

For Variables measured on a numerical scale the measurements will be numbers. Ex: Age, Weight, Systolic Blood Pressure For Variables measured on a categorical scale the measurements will be categories. Ex: Sex, Religion, Heart Disease

Note Sometimes variables can be measured on both a numerical scale and a categorical scale. In fact, variables measured on a numerical scale can always be converted to measurements on a categorical scale.

Example The following variables were evaluated for a study of individuals receiving head injuries in Saskatchewan. Cause of the injury (categorical) Motor vehicle accident Fall Violence other

Time of year (date) (numerical or categorical) summer fall winter spring Sex on injured individual (categorical) male female

Age (numerical or categorical) < 10 10-19 20 - 29 30 - 49 50 – 65 65+ Mortality (categorical) Died from injury alive

Types of variables In addition some variables are labeled as dependent variables and some variables are labeled as independent variables.

This usually depends on the objectives of the analysis. Dependent variables are output or response variables while the independent variables are the input variables or factors.

Usually one is interested in determining equations that describe how the dependent variables are affected by the independent variables

Suppose we are collecting data on Blood Pressure Height Weight Age Example Suppose we are collecting data on Blood Pressure Height Weight Age

Suppose we are interested in how Blood Pressure is influenced by the following factors Height Weight Age

Then Blood Pressure is the dependent variable and Height Weight Age Are the independent variables

Example – Head Injury study Suppose we are interested in how Mortality is influenced by the following factors Cause of head injury Time of year Sex Age

Then Mortality is the dependent variable and Cause of head injury Time of year Sex Age Are the independent variables

dependent Response variable independent predictor variable

Is a subset of the population A sample: Is a subset of the population

In statistics: One draws conclusions about the population based on data collected from a sample

Reasons: Cost It is less costly to collect data from a sample then the entire population Accuracy

Accuracy Data from a sample sometimes leads to more accurate conclusions then data from the entire population Costs saved from using a sample can be directed to obtaining more accurate observations on each case in the population

Types of Samples different types of samples are determined by how the sample is selected.

Convenience Samples In a convenience sample the subjects that are most convenient to the researcher are selected as objects in the sample. This is not a very good procedure for inferential Statistical Analysis but is useful for exploratory preliminary work.

Quota samples In quota samples subjects are chosen conveniently until quotas are met for different subgroups of the population. This also is useful for exploratory preliminary work.

Random Samples Random samples of a given size are selected in such that all possible samples of that size have the same probability of being selected.

Convenience Samples and Quota samples are useful for preliminary studies. It is however difficult to assess the accuracy of estimates based on this type of sampling scheme. Sometimes however one has to be satisfied with a convenience sample and assume that it is equivalent to a random sampling procedure

Population Case  Sample Variables X Y Z

Some other definitions

A population statistic (parameter): Any quantity computed from the values of variables for the entire population.

A sample statistic: Any quantity computed from the values of variables for the cases in the sample.

Since only cases from the sample are observed only sample statistics are computed These are used to make inferences about population statistics It is important to be able to assess the accuracy of these inferences

To download lectures Go to the stats 244 web site Then Through PAWS or by going to the website of the department of Mathematics and Statistics -> people -> faculty -> W.H. Laverty -> Stats 244-. Lectures. Then select the lecture Right click and choose Save as

To print lectures Open the lecture using MS Powerpoint Select the menu item File -> Print

The following dialogue box appear

In the Print what box, select handouts

Set Slides per page to 6 or 3.

6 slides per page will result in the least amount of paper being printed 1 2 3 4 5 6

3 slides per page leaves room for notes. 1 2 3