MGS 9920 Data and Statistics.

Slides:



Advertisements
Similar presentations
1 Slide AQA - Business Statistics, Quantitative Analysis Peter Matthews FDA B&M
Advertisements

Introduction to Statistics
Chapter 1 A First Look at Statistics and Data Collection.
1/71 Statistics Data 2/71 Contents Applications in Business and Economics Data Data Sources Descriptive Statistics Statistical Inference Computers and.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
I need help! Applications in Business and Economics Data Data Sources Descriptive Statistics Statistical Inference Computers & Statistical Analysis.
1 Pertemuan 01 Pendahuluan Matakuliah: I0272 – Statistik Probabilitas Tahun: 2005 Versi: Revisi.
1 1 Slide © 2001 South-Western /Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
1 1 Slide © 2006 Thomson/South-Western Chapter 1 Data and Statistics I need help! Applications in Business and Economics Data Data Sources Descriptive.
Welcome to QM Business Statistics. Course Objectives: Again 1.To gain an understanding of descriptive statistics, probability, sampling, interval.
Chapter 1 Data and Statistics
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Statistics - Descriptive statistics 2013/09/23. Data and statistics Statistics is the art of collecting, analyzing, presenting, and interpreting data.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
BUSINESS STATISTICS BQT 173
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
The Nature of Probability and Statistics
The Nature of Probability and Statistics
© Copyright McGraw-Hill CHAPTER 1 The Nature of Probability and Statistics.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 1 An Introduction to Business Statistics.
Statistics Ⅰ Teacher :刘 伟 公共经济管理学院 统计系 : Office : 2507.
Chapter 1 Data and Statistics I need help! Applications in Economics Data Data Sources Descriptive Statistics Statistical Inference Computers and Statistical.
BUSINESS STATISTICS BQT 173. CHAPTER 1 : DATA & STATISTICS.
© 2006 by Thomson Learning, a division of Thomson Asia Pte Ltd.. 1 Slide Slide Slides Prepared by Juei-Chao Chen Fu Jen Catholic University Slides Prepared.
What is Statistics? Chapter GOALS 1. Understand why we study statistics. 2. Explain what is meant by descriptive statistics and inferential statistics.
1 1 Slide 統計學 Fall 2003 授課教師:統計系余清祥 日期: 2003 年 9 月 16 日 第一週:什麼是統計?
Slides by John Loucks St. Edward’s University. Statistics n The term statistics can refer to numerical facts such as averages, medians, percents, and.
1 1 Slide Data and Data Sets n Data are the facts and figures collected, analyzed, and summarized for presentation and interpretation. and summarized.
MADAM SITI AISYAH BINTI ZAKARIA INSTITUT MATEMATIK KEJURUTERAAN UNIVERSITI MALAYSIA PERLIS.
ECON 3790 Statistics for Business and Economics
Statistics, Data, and Statistical Thinking
1 1 Slide Tuesday August 28 Class 2 Text problems for August 30: Chapter 2 - 2,6 & 10 Aplia Graded Assignment: “Introduction” due September 4, 9:00 am.
Probability & Statistics – Bell Ringer  Make a list of all the possible places where you encounter probability or statistics in your everyday life. 1.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin What is Statistics Chapter 1.
© 2016 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
BIA 2610 – Statistical Methods Chapter 1 – Data and Statistics.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Chapter 1 Data and Statistics Applications in Business and Economics Data Data Sources Descriptive Statistics Statistical Inference.
Basic Business Statistics
1 1 Slide Chapter 1 Data and Statistics n Applications in Business and Economics n Data n Data Sources n Descriptive Statistics n Statistical Inference.
1 1 Slide © 2002 South-Western /Thomson Learning.
Overview and Types of Data
Introduction to Statistics Chapter 1. § 1.1 An Overview of Statistics.
Econ 3790: Business and Economics Statistics
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1 PAUF 610 TA 1 st Discussion. 2 3 Population & Sample Population includes all members of a specified group. (total collection of objects/people studied)
1.  The practice or science of collecting and analyzing numerical data in large quantities, especially for the purpose of inferring* proportions in a.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western /Thomson Learning.
What is Statistics? Chapter 1 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Elin Driana, Ph.D.  “a systematic attempt to provide answers to questions” (Tuckman, 1999, p. 4)  “the more formal, systematic, and intensive process.
1-1 What is Statistics? Introduction. 1-2 What is Meant by Statistics? In the more common usage, statistics refers to numerical information Examples:
Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of Experiments.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Data and Statistics Data and Statistics I need help! n Applications in Business and Economics.
Business Information Analysis, Chapter 1 Business & Commerce Discipline, IVE 1-1 Chapter One What is Statistics? GOALS When you have completed this chapter,
What is Statistics? Introduction 1.
Statistics Introduction to Data.
St. Edward’s University
The Nature of Probability and Statistics
Introduction and Data Collection
The Nature of Probability and Statistics
Quantitative Methods for Business Studies
The Nature of Probability and Statistics
Essentials of Statistics for Business and Economics (8e)
What is Statistics? Chapter 1.
Chapter 1 Data and Statistics
INTRODUCTION TO STATISTICS
Presentation transcript:

MGS 9920 Data and Statistics

Outlines What is Statistics? Data Data Sources Descriptive Statistics Statistical Inference Computers and Statistical Analysis

What is Statistics? Main purpose of statistics, among others, is to develop and apply methodology for extracting useful knowledge from data. (Fisher 1990) Major activities in statistics involve: exploration and visualization of sample data summary description of sample data hypothesis testing and statistical inference design of experiments and surveys to test hypotheses stochastic modeling of uncertainty (e.g. flipped coin) forecasting based on suitable models development of new statistical theory and methods Reference http://www.gfi.uib.no/~nilsg/kurs/notes/node5.html Bullets with bold font are the topics that will be covered in the class First two are about descriptive statistics. Last one is about inferential statistics.

Statistical data analysis Starts with data Nominal, Ordinal, Interval, and Ratio Descriptive statistics Exploring, visualizing, and summarizing data without fitting the data to any models Inferential statistics Identification of a suitable model Testing either predictions or hypotheses of the model Much attention is given to inferential statistics. However, descriptive statistics are also very important in that it can reveal many interesting features in the data.

Data and Data Sets Data are the facts and figures collected, summarized, analyzed, and interpreted. The data collected in a particular study are referred to as the data set.

Data, Data Sets, Elements, Variables, and Observations Names Stock Annual Earn/ Exchange Sales($M) Share($) Company Dataram EnergySouth Keystone LandCare Psychemedics AMEX 73.10 0.86 OTC 74.00 1.67 NYSE 365.70 0.86 NYSE 111.40 0.33 AMEX 17.60 0.13 The elements are the entities on which data are collected. A variable is a characteristic of interest for the elements. The set of measurements collected for a particular element is called an observation. The total number of data values in a data set is the number of elements multiplied by the number of variables. Data Set

Scales of Measurement Scales of measurement include: Nominal Interval Ordinal Ratio The scale determines the amount of information contained in the data. The scale indicates the data summarization and statistical analyses that are most appropriate.

Scales of Measurement Nominal Data are labels or names used to identify an attribute of the element. A nonnumeric label or numeric code may be used.

Scales of Measurement Nominal Example: Students of a university are classified by the school in which they are enrolled using a nonnumeric label such as Business, Humanities, Education, and so on. Alternatively, a numeric code could be used for the school variable (e.g. 1 denotes Business, 2 denotes Humanities, 3 denotes Education, and so on). Even if numeric codes are used, order is not meaningful among them.

Scales of Measurement Ordinal The data have the properties of nominal data and the order or rank of the data is meaningful. A nonnumeric label or numeric code may be used.

Scales of Measurement Ordinal Example: Students of a university are classified by their class standing using a nonnumeric label such as Freshman, Sophomore, Junior, or Senior. Alternatively, a numeric code could be used for the class standing variable (e.g. 1 denotes Freshman, 2 denotes Sophomore, and so on). Even though the order of numeric codes is meaningful, it is not right to use numerical measures, such as mean and variance, for these codes. For example, what is the meaning of 1.4 in the example above?

Scales of Measurement Interval The data have the properties of ordinal data, and the interval between observations is expressed in terms of a fixed unit of measure. Interval data are always numeric.

Scales of Measurement Interval Example: Melissa has an SAT score of 1205, while Kevin has an SAT score of 1090. Melissa scored 115 points more than Kevin.

Scales of Measurement Ratio The data have all the properties of interval data and the ratio of two values is meaningful. Variables such as distance, height, weight, and time use the ratio scale. This scale must contain a zero value that indicates that nothing exists for the variable at the zero point.

Scales of Measurement Ratio Example: Melissa’s college record shows 36 credit hours earned, while Kevin’s record shows 72 credit hours earned. Kevin has twice as many credit hours earned as Melissa.

In-class Exercise Consider items 1.1, 1.3, 1.4, 1.6, S.4, 3.1, and 3.3 in the handout of an example questionnaire. Comment on what scale of measurement the item uses. Comment on any potential special attention needed when these items will be statistically analyzed. 1.1 Nominal / 1.3 Ordinal / 1.4 Interval / 1.6 Ordinal (could be interval) / S.4 Ordinal / 3.1 Nominal / 3.3 Ordinal Nominal data: To incorporate nominal data in the regression analysis, dummy variable (p.661) must be used. ANOVA can include nominal data. Mean, standard deviation, and other numerical descriptive measures are not applicable. Frequency table is the most common way of analyzing nominal data. Ordinal data: Analyzing ordinal data requires attention in that even though the values have a natural and clear ordering, intervals between values are not (necessarily) meaningful. Therefore, considering ordinal data as interval data is not correct. There are special regression models for ordinal data, which will not be covered in this class. Interval and Ratio data: Most statistical techniques can be used. For nominal and ordinal data, crosstabulation is commonly used to analyze the data. For the ordinal data, Pearson correlation coefficient should not be used, instead Spearman's Rank Order Correlation should be used. Source: http://www.andrews.edu/~calkins/math/edrm611/edrm13.htm

Qualitative and Quantitative Data Data can be further classified as being qualitative or quantitative. The statistical analysis that is appropriate depends on whether the data for the variable are qualitative or quantitative. In general, there are more alternatives for statistical analysis when the data are quantitative.

Qualitative Data Labels or names used to identify an attribute of each element Often referred to as categorical data Use either the nominal or ordinal scale of measurement Can be either numeric or nonnumeric Appropriate statistical analyses are rather limited

Quantitative Data Quantitative data indicate how many or how much: discrete, if measuring how many continuous, if measuring how much Quantitative data are always numeric. Ordinary arithmetic operations are meaningful for quantitative data.

Scales of Measurement Data Qualitative Quantitative Numerical Nonnumerical Numerical Nominal Ordinal Nominal Ordinal Interval Ratio

In-class exercise Q10 (old book: p20; new book: p23)

Cross-Sectional Data Cross-sectional data are collected at the same or approximately the same point in time. Example: data detailing the number of building permits issued in June 2003 in each of the counties of Ohio

Time Series Data Time series data are collected over several time periods. Example: data detailing the number of building permits issued in Lucas County, Ohio in each of the last 36 months Time series data requires different technique to analyze the data compare to cross-sectional data. Again, types of data dictate the choice of statistical analysis method.

Data Sources Existing Sources (often called secondary data) Within a firm – almost any department Business database services – Dow Jones & Co. Government agencies - U.S. Department of Labor Industry associations – Travel Industry Association of America Special-interest organizations – Graduate Management Admission Council Secondary data: Data originally collected for a different study, used again for a new research question Internet – more and more firms

Data Sources Statistical Studies (often called primary data) In experimental studies the variables of interest are first identified. Then one or more factors are controlled so that data can be obtained about how the factors influence the variables. In observational (nonexperimental) studies no attempt is made to control or influence the variables of interest. Primary data: Original data collected for a specific research goal a survey is a good example

Data Acquisition Considerations Time Requirement Searching for information can be time consuming. Information may no longer be useful by the time it is available. Cost of Acquisition Organizations often charge for information even when it is not their primary business activity. Data Errors Using any data that happens to be available or that were acquired with little care can lead to poor and misleading information.

Descriptive Statistics Descriptive statistics are the tabular, graphical, and numerical methods used to summarize data. Examples Frequency table Histogram Mean Variance Example: After exam, students may want to know and see the followings: Numbers of A, B, C, D and F. Histogram of this frequency distribution What is the average score – mean? What is the variance?

Statistical Inference Population - the set of all elements of interest in a particular study Sample - a subset of the population Statistical inference - the process of using data obtained from a sample to make estimates and test hypotheses about the characteristics of a population Census - collecting data for a population Sample survey - collecting data for a sample

Process of Statistical Inference: example 1. Population consists of heights of all GSU students. 2. A sample of 25 students are randomly selected and measured. 3. The sample data provide a sample average height of 5’ 5’’. 4. The sample average is used to estimate the population average.

In-class exercise Q21 (old book: p24; new book: p26) Q22 (old book: p24 or see below) Q22. In the fall of 2003, Arnold Schwarzenegger challenged Governor Gray Davis for the governorship of California. A Policy Institute of California survey of registered voters reported Arnold Schwarzenegger in the lead with an estimated 54% over the vote (Newsweek, September 8, 2003). What was the population of this survey? What was the sample for this survey? Why was a sample used in this situation? Explain.

Computers and Statistical Analysis

Short comparison between Excel and SPSS Good at data manipulation, such as transpose, transformation, etc. Powerful graph Easy to use Not for serious statistical use (data limit, lack of statistical functions, etc.) SPSS Widely used statistical software in research community More comprehensive statistical package than Excel Often Excel and SPSS are used together. Data can be shared between Excel and SPSS easily. Excel is often used due to its flexible graphic ability.

End of Chapter 1