Presentation on theme: "Chapter 2 Presenting Data in Charts and Tables Why use charts and graphs? Visually present information that cant easily be read from a data table. Many."— Presentation transcript:
Chapter 2 Presenting Data in Charts and Tables Why use charts and graphs? Visually present information that cant easily be read from a data table. Many details can be shown in a small area. Readers can see immediately major similarities and differences without having to compare and interpret figures.
Computer software can be used to create charts and graphs: SPSS MINITAB Ms. Excel Ms. Visio Others
How to present categorical data? Categorical data Tabulating data Summary table Graphing data Bar chartsPie charts
Bar chart Bar chart and pie chart are often used for quantitative data(categorical data) Height of bar chart shows the frequency for each category Bar graphs compare the values of different items in specific categories or t discrete point in time.
Bar chart example:
Pie chart The size of pie slice shows the percentage for each category It is suitable for illustrating percentage distributions of qualitative data It displays the contribution of each value to a total It should not contain too many sectors- maximum 5 or 6
Pie char example:
How to present numerical data? Numerical data Ordered array Stem-and- Leaf Frequency Distribution HistogramPolygon Cumulative Distributions Ogive
The ordered array The sequence of data in rank order: Shows range (min to max) Provides some signals about variability within the range Outliers can be identified It is useful for small data set Example: Data in raw form: Data in ordered array: (min to max)
Tabulating Numerical Data: Frequency Distribution A frequency distribution is a list or a table…. It contains class groups and The corresponding frequencies with which data fall within each group or category Why use a Frequency Distribution? To summarize numerical data To condense the raw data into a more useful form To visualize interpretation of data quickly
Organizing data set into a table of frequency distribution: Determine the number of classes The number of classes can be determined by using the formula: 2 k >n -k is the number of classes -n is the number of data points Example: Prices of laptops sold last month at PSC: 299, 336, 450, 480, 520, 570, 650, 680, , 800, 850, 900, 920, 990, 1050, 1300, 1500
In this example, the number of data points is n=18. If we try k=4 which means we would use 4 classes, then 2 4 =16 that is less than 18. So the recommended number of classes is 5. Determine the class interval or width -The class interval should be the same for all classes -Class boundaries never overlap
-The class interval can be expressed in a formula: Where i is the class interval, H is the highest value in the data set, L is the lowest value in the data set, and k is the number of classes. In the example above, H is 1500 and L is 299. So the class interval can be at least = The class interval used in this data set is 250 Determine class boundaries: Tally the laptop selling prices into the classes: Classes: 260 up to up to up to up to up to 1510
Compute class midpoints: (midpoint=(Lower bound+ Upper bound)/2) Count the number of items in each class. The number of items observed in each class is called the class frequency: Laptop selling Frequency Cumulative Freq. price9($) 260 up to up to up to up to up to
Step-and-leaf A statistical technique to present a set of data. Each numerical value is divided in two parts stem(leading digits), and leaf(trailing digit) The steps are located along the y-axis, and the leaf along the x-axis.
Histogram A graph of the data in a frequency distribution It uses adjoining columns to represent the number of observations(frequency) for each class interval in the distribution The area of each column is proportional to the number of observations in that interval
Example of histogram:
How can you construct the histogram in SPSS?
Polygon A frequency polygon, like a histogram, is the graph of a frequency distribution In a frequency polygon, we mark the number observations within an interval with a single point placed at the midpoint of the interval, and then connect each set of points with a straight line.
How can you construct the polygon in SPSS?
Ogivea graph of cumulative frequency Ogive example:
How can you construct the Ogive in SPSS?
Exercises 1.The price-earnings ratios for 24 stocks in the retail store are: a.Organize this data set into step-and-leaf display b.How many values are less than 10.0? c.What are the smallest and largest values
Exercises 2. The following stem-and-leaf chart shows the number of units produced per day in a factory
a.How many days were studied? b.How many values are in the first class? c.What are the smallest and the largest values? d.How many values are less than 70? e.How many values are between 50 and 70?
3. The following frequency distribution represents the number of days during a year that employees at GDNT were absent from work due to illness. Number of Days absentEmployees 0 up to 45 4 up to up to up to up to 202
a.What is the midpoint of the first class? b.Construct a histogram c.Construct a frequency polygon d.Interpret the rate of employee absenteeism using the two charts