Data Collection.

Slides:



Advertisements
Similar presentations
Chapter 1 Getting Started Understandable Statistics Ninth Edition
Advertisements

Data Collection Methods
Data Collection Methods Mrunalini JS. 2 ‘It is a capital mistake to theorize before one has data.’ -Sir Arthur Conan Doyle as Sherlock Holmes.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Chapter 1 The Where, Why, and How of Data Collection
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
Quantitative vs. Categorical Data
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Chapter 1 Getting Started
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Chapter 1: Introduction to Statistics
Chapter 2: Statistics of One Variable
INTRODUCTION TO STATISTICS MATH0102 Prepared by: Nurazrin Jupri.
Understanding Statistics Eighth Edition By Brase and Brase Prepared by: Joe Kupresanin Ohio State University Chapter One Getting Started.
Chapter 1 Introduction and Data Collection
Do Now: 1.Be sure to have picked up three papers upon entry. 2.Work with a partner to complete “The White House is not a Metronome Questions”
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Chapter 1 DATA AND PROBLEM SOLVING. Section 1.1 GETTING STARTED.
 Collecting Quantitative  Data  By: Zainab Aidroos.
Chapter 1: The Nature of Statistics
Surveys and Questionnaires Government agencies, news organizations, and marketing companies often conduct surveys. The results can be factual or subjective.
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Prob and Stats, Aug 26 Unit 1 Review - Fundamental Terms and Definitions Book Sections: N/A Essential Questions: What are the building blocks of Statistics,
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Chapter 2 Review MDM 4U Mr. Lieff. 2.2 – In Search of Good Data What are the variables in a study? The information that is collected What types of variables.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
MDM4U - Collecting Samples Chapter 5.2,5.3. Why Sampling? sampling is done because a census is too expensive or time consuming the challenge is being.
Chapter 1 Getting Started 1.1 What is Statistics?.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Statistics is concerned with the proper methods used to collect, analyse, present and interpret DATA There are two types of Statistics: Descriptive – any.
Chapter 1 Getting Started Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
C HAPTER 5: P RODUCING D ATA Section 5.1 – Designing Samples.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Understanding Basic Statistics Chapter One Organizing Data.
The hypothesis that most people already think is true. Ex. Eating a good breakfast before a test will help you focus Notation  NULL HYPOTHESIS HoHo.
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
1 Statistics Statistics can be found in all aspects of life:
POPULATION The set of all things or people being studied A group of people you want information about Examples – All the students of Fairwind – All the.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
CHAPTER 34 Collection and Organisation of Data. PRIMARY AND SECONDARY DATA PRIMARY DATA is collected by an individual or organisation to use for a particular.
Chapter 34 Organisation & Collection of Data. Primary & Secondary Data PRIMARY DATA is collected for a particular purpose. PRIMARY DATA is obtained from.
1. What is one method of data collection? 2. What is a truly random way to survey/sample people?
Chapter 1: Section 2-4 Variables and types of Data.
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
Chapter 1 Getting Started What is Statistics?. Individuals vs. Variables Individuals People or objects included in the study Variables Characteristic.
Warm up Which type of study is best for the following situations: cross-sectional or longitudinal? a) Determining what percent of high school students.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Sampling & Simulation Chapter – Common Sampling Techniques  For researchers to make valid inferences about population characteristics, samples.
Chapter 1-2 Review MDM 4U Mr. Lieff. Ch1 Learning Goals Classify data as Quantitative (and continous or discrete) or Qualitatitive Identify the population,
1.5 Conducting start-up market research. Candidates should be able to: define market research explain the difference between primary and secondary research.
Chapter 1 Getting Started Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Introduction/ Section 5.1 Designing Samples.  We know how to describe data in various ways ◦ Visually, Numerically, etc  Now, we’ll focus on producing.
Data Collection Methods Pros and Cons of Primary and Secondary Data.
DATA COLLECTION What is Data? Data is a collection of facts, such as numbers, words, measurements, observations or even just descriptions of things Data.
Statistics Terminology. What is statistics? The science of conducting studies to collect, organize, summarize, analyze, and draw conclusions from data.
Sampling and Experimentation
Part III – Gathering Data
Understandable Statistics
Chapter 1 Getting Started Understandable Statistics Ninth Edition
The Nature of Probability and Statistics
Use your Chapter 1 notes to complete the following warm-up.
Chapter 5: Producing Data
Chapter 5: Producing Data
Understanding Basic Statistics
Understanding Basic Statistics
Presentation transcript:

Data Collection

What is Data? Data is facts and statistics that are collected together We use data to be able to gather information for reference or analysis We translate the information we find into a form that is more convenient to understand using charts and graphs

Types of Data Numerical Data Categorical Data

Numerical Data Numerical data is quantitative This means it can be measured using numbers and these numbers can be placed in ascending or descending order We use scatter plots and line graphs to represent numerical data There are two types of numerical data – discrete and continuous

Discrete Numerical Data Discrete means the numbers used to measure the data have to be whole numbers The numbers must be distinct and separate Examples for discrete numerical data would be age, number of kittens, number of people, etc

Continuous Numerical Data Continuous means the numbers used to measure the data can be any number including decimals Examples of continuous data would be temperature, time, and height

Categorical Data Categorical data is data that can be sorted into groups or categories Categorical data is qualitative meaning it describes something We use bar graphs and pie charts to sort categorical data There are two different types of categorical data – nominal data and ordinal data

Nominal Categorical Data Nominal data can be counted but not put in ascending or descending order (sorted) Nominal data makes sense regardless of the order it is presented Examples of nominal data include gender, eye colour, hair colour, etc

Ordinal Categorical Data Values or observations that are ordinal can be ranked or have a scale attached You can count and order ordinal data, but it cannot be measured like numerical data Examples of ordinal data include house numbers, dates, swimming level, etc

Data Collection Data collection is separated into two types: primary data and secondary data Primary data is collected first hand Secondary data is data that was collected by somebody else

Primary Data - Examples Surveys Focus groups Questionnaires Personal interviews Experiments and observational study

Primary Data - Limitations Do you have the time and money for: Designing your collection instrument? Selecting your population or sample? Pretesting/piloting the instrument to work out sources of bias? Administration of the instrument? Entry/collation of data? Uniqueness May not be able to compare to other populations Researcher error Sample bias Other confounding factors

Secondary Data – Examples of Sources County health departments Vital Statistics – birth, death certificates Hospital, clinic, school nurse records City and county governments Surveillance data from state government programs Federal agency statistics - Census, NIH, etc.

Secondary Data – Limitations When was it collected? For how long? May be out of date for what you want to analyze. May not have been collected for a long enough time Is the data set complete? There may be missing information on some observations Unless such missing information is caught and corrected for, analysis will be biased. Is the data consistent/reliable? Did variables drop out over time? Did variables change in definition over time? E.g. number of years of education versus highest degree obtained.

Secondary Data – Advantages No need to reinvent the wheel. If someone has already found the data, take advantage of it. It will save you money. Even if you have to pay for access, often it is cheaper in terms of money than collecting your own data. (more on this later.) It will save you time. Primary data collection is very time consuming. (More on this later, too!) It may be very accurate. When especially a government agency has collected the data, incredible amounts of time and money went into it. It’s probably highly accurate.

Data Collection When collecting data from a group, we can do it two ways Observational data or Experimental data

Observational Data Observational data is collected by grouping people into different categories and observing how something affects them An example of observational data collection would be to separate a group into adults vs children and compare the effects of sunlight on them

Experimental Data Experimental data is collected by creating our own groups and imposing our own treatment on the groups to see the effects An example for experimental data would be administering a placebo drug to one group

Data Collection We use data collection to be able to obtain information on a smaller group and extend it to a larger population The most important thing to remember is that the group we select must represent the population as a whole It is very difficult to ensure this happens

Population vs Sample Population – the entire group being studied. Example: How many families in Canada have internet? Sample – the part of the population that is being studied. Example: We would not be able to ask every family in Canada if they have internet. But we would select smaller groups from each province and territory and extend it to the entire country We select a sample from an entire population so that it is easier to get the information we need We use various sampling techniques to select our sample. Example: Our survey would not be very valid if we selected only families in southern parts of Canada where internet is more easily accessible.

Characteristics of a Good Sample Each person must have an equal chance of being selected into the sample. The sample must be large enough to represent the population We use various sampling techniques to ensure this happens

Simple Random Sample Every member of the population has an equal chance of being picked Example: Putting names in a hat and drawing at random

Systematic Random Sample To go through a population sequentially and select at even intervals Example: Going through a phone book and selecting every 50th person

Stratified Sample A strata is a group of subjects that share a common characteristic It keeps proportionate samples of each strata to the population Example: If the population has both men and women, you ensure men and women are in the sample

Cluster Sample One representative group of the population chosen at random Example: Picking one floor of an office building and surveying them

Multi-Stage Sampling Using a combination of stages to obtain the sample

Convenience Sample A type of sampling technique that is based on how easy responses are to obtain Example: Surveying people stranded at an airport during a snowstorm about air travel

Voluntary Response Sampling Inviting subjects to voluntarily be a part of the sample Example: Receiving a survey in the mail and being asked to complete it, random phone surveys from businesses

Problems with Data Collection Questions must be simple, clear, specific, ethical, free from bias, allow for honest response, and not infringe on anyone’s privacy Questions must not contain slang, abbreviations, negatives, leading questions, and insensitivities Good questions are often anonymous and require the subject to select from a list of possible responses Survey bias can be unintentional, but can cause the data collected to be invalid. There are many different types of bias

Sampling Bias The chosen sample does not accurately reflect the population Example: Asking basketball players about issues with the math curriculum

Non-Response Bias Particular groups are under-represented in the sample because they choose not to participate When responders don’t respond, the surveyor is forced to draw their own conclusions about the sample

Measurement Bias When the data collection method consistently under- or overestimates a characteristic of the population Leading questions can also cause measurement bias Example: Police radar gun measuring for average speed on a particular road

Response Bias When participants in a survey give false or misleading answers Question quality or topic might lead to response bias Example: Teacher asks the class to raise hands if they completed their homework

Tally Charts A tally chart is a table used to record values by hand as the data is collected. One tally mark is used for each occurrence of a value Tally marks are usually grouped into sets of five to allow for easier counting

Number of days with rain Frequency Tables Tally charts are helpful during the collection of data Once the data is collected, it is more useful to summarize the data into what we call a frequency table. A frequency table shows the data numerically Number of days with rain Number of weeks 0 1 2 3 4 5 6 7 2 5 5 5 19 6 6 4 Total 52