Presentation on theme: "Created by Tom Wegleitner, Centreville, Virginia"— Presentation transcript:
1Created by Tom Wegleitner, Centreville, Virginia Section 1-1OverviewCreated by Tom Wegleitner, Centreville, Virginia
2OverviewA common goal of surveys and other data collecting tools is to collect data from a smaller part of a larger group so we can learn something about the larger group.In this section we will look at some of ways to describe data.
3DefinitionsDataobservations (such as measurements, genders, survey responses) that have been collected.
4Definitions Statistics a collection of methods for planning experiments, obtaining data, and then then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on the data.
5Definitions Population the complete collection of all elements (scores, people, measurements, and so on) to be studied. The collection is complete in the sense that it includes all subjects to be studied.
6Definitions Census Sample the collection of data from every member of the population.Samplea sub-collection of elements drawn from a population.
7A researcher wants to study the effects of smoking on cholesterol level in Jackson County. What would be his population?All adults in Jackson County who smoke at least one pack per day.What could possibly be his sample?Some reasonable number of smokers in Jackson County who smoke one pack per day.
8A sociologist hypothesizes that the average annual income of households in Marianna is less than $25,000 per year. To test her hypothesis, she samples 500 households in the city and determines the income of each.Describe the population.The set of all households in Marianna.
9Describe the sample.The sample must be a subset of the population. In this case, it is the 500 households selected by the sociologist.
10“Cola War” is the popular term for the intense competition between Coca-Cola and Pepsi displayed in their marketing campaigns. Their campaigns have featured movie and television stars, rock videos, athletic endorsements, and claims of consumer preference based on taste tests. Suppose, as part of a Pepsi marketing campaign, 1,000 cola consumers are given a blind taste test. Each consumer is asked to state a preference for Brand A or Brand B.
11What is the population?The population of interest is the set of all consumers of “cola” products.What is the sample?The sample is the 1,000 cola consumers selected from the population of all cola consumers.
12Created by Tom Wegleitner, Centreville, Virginia Section 1-2Types of DataCreated by Tom Wegleitner, Centreville, Virginia
13Definitions Parameter a numerical measurement describing some characteristic of a populationpopulationparameter
14Definitions Statistic sample statistic a numerical measurement describing some characteristic of a sample.samplestatistic
15Definitions Quantitative data numbers representing counts or measurements.Example: weights of supermodels.
16Definitions Qualitative (or categorical or attribute) data can be separated into different categories that are distinguished by some nonnumeric characteristics.Example: genders (male/female) of professional athletes.
17Classify each variable as qualitative or quantitative. Colors of automobiles in a dealer’s showroom.Number of seats in movie theaters.Classification of patients based on nursing care needed(complete,partial, or self care)Lengths of newborn cats of a certain species.Number of complaint letters received by an airline per month.
18Working withQuantitative DataQuantitative data can further be distinguished between discrete and continuous types.
19Definitions Discrete Example: The number of eggs that hens lay. data result when the number of possible values is either a finite number or a ‘countable’ number of possible values.0, 1, 2, 3, . . .Example: The number of eggs that hens lay.
20Definitions Continuous (numerical) data result from infinitely many possible values that correspond to some continuous scale that covers a range of values without gaps, interruptions, or jumps.23Example: The amount of milk that a cow produces; e.g gallons per day.
21Classify each variable as discrete or continuous. Number of cartons of milk manufactured each day.Temperatures of airplane interiors at a given airport.Incomes of college students on work study programs.Weights of newborn calfs.Number of tomatoes on each plant in a field.
22Levels of MeasurementAnother way to classify data is to use use levels of measurement. Four of these levels are discussed in the following slides.
23Definitions nominal level of measurement characterized by data that consist of names, labels, or categories only. The data cannot be arranged in an ordering scheme (such as low to high)Example: survey responses yes, no, undecided
24Definitions ordinal level of measurement involves data that may be arranged in some order, but differences between data values either cannot be determined or are meaninglessExample: Course grades A, B, C, D, or F
25Definitions interval level of measurement like the ordinal level, with the additional property that the difference between any two data values is meaningful. However, there is no natural zero starting point (where none of the quantity is present)Example: Years 1000, 2000, 1776, and 1492
26Definitions ratio level of measurement the interval level modified to include the natural zero starting point (where zero indicates that none of the quantity is present). For values at this level, differences and ratios are meaningful.Example: Prices of college textbooks ($0 represents no cost)
27Nominal - categories only Ordinal - categories with some order Summary -Levels of MeasurementNominal - categories onlyOrdinal - categories with some orderInterval - differences but no natural starting pointRatio - differences and a natural starting point
28Classify each as nominal, ordinal, interval, or ratio level data. Horsepower of motorcycle engines.Ratings of newscasts in Houston(poor, fair,good, excellent)Temperature of automatic popcorn poppersTime required be drivers to complete a courseMarital status of respondents to a survey o savings accounts.
29Recap In Sections 1-1 and 1-2 we have looked at: Basic definitions and terms describing dataParameters versus statisticsTypes of data (quantitative and qualitative)Levels of measurement
30Key ConceptsSample data must be collected in an appropriate way, such as through a process of random selection.If sample data are not collected in an appropriate way, the data may be so completely useless that no amount of statistical torturing can salvage them.