Business Forecasting Chapter 4 Data Collection and Analysis in Forecasting
Chapter Topics Preliminary Adjustments to Data Data Transformation Patterns in Time Series Data The Classical Decomposition Method
Preliminary Data Adjustments Trading Day Adjustments Price Change Adjustments Population Change Adjustments
Trading Day Adjustments Yr JFMAMJJASOND
Trading Day Adjustment Number of Computers Sold (in Millions) in 2005, 2006, and 2007 Month January345 February445 March356 April536 May536 June547 July656 August646 September657 October554 November564 December665 Yearly unit sales595467
Trading Day Adjustment
Trading Day Adjustments
Price Change Adjustments
Having computed the price index, we are now able to deflate the sales revenue with the weighted price index in the following way:
Price Change Adjustments To see the impact of separating the effect of price level changes, we graph the price of computers in constant and current dollars.
Population Change Adjustments Disposable Personal Income and Per Capita Income for the U.S and 2005 Disposable IncomePopulationPer Capita Disposable YearBillions of Dollars(in Millions)Income ($)
Data Transformation Most appropriate remedial measure for variance heterogeneity. Original data are converted into a new scale, resulting in a new data set that is expected to satisfy the condition of homogeneity of variance. Several transformation techniques are available.
Data Transformation Linear Transformation: An important assumption in using the regression model for forecasting is that the pattern of observation is linear. Obviously, there are many situations in which this is not a valid assumption. For example, if we were forecasting monthly sales and it was believed that those sales varied according to the season of the year, then the assumption of linearity would not hold.
Linear Transformation A forecasting equation may be of the form: The above could easily be transformed into a linear form for estimation purposes:
Logarithmic Transformation
Time Operating Revenue 0 2,000 4,000 6,000 8,000 10, Log of Operating Revenue ActualLog Figure 4.2 Actual and Logarithmically Transformed Operating Revenue for Southwest Airlines
Square Root Transformation
Square Root Transformation ( Scaled Square Root Data ) 0 2,000 4,000 6,000 8,000 10, Time Operating RevenueSquare Root Operating Revenue
Square Root Transformation ( Unscaled Square Root Data )
Classical Time Series Model Secular Trend (T ) Seasonal Variation (S ) Cyclical Variation (C ) Random or Irregular Variation
Trend Linear Trend Non-linear Trend
Trend Computing the Linear Trend The Freehand Method The Semi-average Method The Least Squares Method
Freehand Method
Since a linear trend by this method is simply an approximation of a straight line equation, we have to determine the intercept and the slope of the line. Based on our data, we have:
Freehand Method
Now we can use this equation to make a forecast of the trend. For example, the forecast for 2006 would be:
Freehand Method Based on your understanding, what are the pitfalls of using the freehand method? Simple method but not objective. Why not objective?
The Semi-average Method Simple but objective method in fitting a trend line. Divides the data into two equal parts and computes the average for each part. The computed averages for each part provide two points on a straight line. The slope of the line is computed by taking the difference between the averages and dividing it by half of the total number of observations.
The Semi-average Method Fitting a Straight Line by the Semi-Average Method to Income from the Export of Durable Goods, 1996–2005 YearIncomeSemi-totalSemi-averageCoded Time − − , ,
The Semi-average Method We see that the intercept of the line is: The fitted equation is: The slope is:
The Semi-average Method For the year 2005, the forecast revenue from export of durable goods is:
The Least Squares Method Provides the best method of fitting a trend. The intercept and the slope are computed as follows:
The Least Squares Method Using the data from the previous example, we have:
The Least Squares Method The fitted trend line equation is: Note: Since x is measured in a half year, we have to multiply it by two to get the full year. x = 0 in 2000 ½ 1 x = ½ year Y = Billions of Dollars
The Least Squares Method To compare the two methods, we note: Least squares: Semi-average:
Nonlinear Trend In many business and economic environments we observe that the time series does not follow a constant rate of increase or decrease, but follows an increasing or decreasing pattern. Whenever there is dramatic change in production technology, we expect the trend line not to follow a constant linear pattern.
Nonlinear Trend A polynomial function best exemplifies business conditions. A second-degree parabola provides a good historical description of an increase or decrease per time period.
Nonlinear Trend To solve for the constants a, b, and c in the previous equation, we use the following simultaneous equations:
Nonlinear Trend World Carbon Emissions from Fossil Fuel Burning 1982–1994 YearMillion tonnes 19824,960−6−29,760178,560361, ,947−5−24,735123, ,109−4−20,436 81, ,282−3−15,846 47, ,464−2−10,928 21, ,584−1−5,584 5, , , ,912 5, , ,882 23, , ,078 54, , ,640 94, , ,465147, , ,550213,300361,296 72, ,238998, ,550 XYxxY
Nonlinear Trend The data from the table is used to compute the following: x = 0 in x = one year Y = million tonnes
Logarithmic Trend When we wish to fit a trend line to percentage rates of change, we use the logarithmic trend line. This is more prevalent when dealing with economic growth in an environment. The logarithmic trend equation is:
Logarithmic Trend The least squares trend is computed as:
Logarithmic Trend Example − 15 − − 13 − − 11 − − 9 − , − 7 − , − 5 − , − 3 − , − 1 − , , , , , , , , Chinese Exports Year ( $ 100 Million)log Yxx log Y
Logarithmic Trend Example (continued)
The estimated trend line equation is:
Logarithmic Trend (continued) Check the goodness of fit by substituting two data points such as 1992 and 2003, into the fitted equation. For 1992, we will have:
Logarithmic Trend (continued) For 2003, we will have:
Logarithmic Trend Interpretation of the estimated trend line would be similar to a linear trend. However, before we can interpret the estimated values, we have to convert the log values into actual values of the data points. This is done by taking the antilog.
Logarithmic Trend The results are: And
Logarithmic Trend To determine the rate of change or the slope of the line we have: R = antilog = Since the rate of change (r) was defined as R −1, then r = −1 = r = 7.9 percent per half-year Therefore the growth rate is 15.8% or 16% per year.
Other Approaches to Trend Line Two more sophisticated methods of determining whether there is a trend in the data: Differencing Autocorrelation (Box–Jenkins Methodology) Allows the analyst to see whether a linear equation, a second-degree polynomial, or a higher-degree equation should be used to determine a trend.
Differencing First Difference Second Difference
Differencing Method Example First and Second Difference of Hypothetical Data YtFirst DifferenceSecond Difference 20,000 22,000 2,000 24,300 2, ,900 2, ,800 2, ,000 3,200300
Seasonal Analysis Seasonal variation is defined as a predictable and repetitive movement observed around a trend line within a period of 1 year or less. There are several reasons for measuring seasonal variations. When analyzing the data from a time series, it is important to be able to know how much of the variation in the data is due to the seasonal factors.
Seasonal Variation (Continued) We may use seasonal variation patterns in making projections or forecasts of a short- term nature. By eliminating the seasonal variation from a time series, we may discover the cyclical pattern of the time series.
Seasonal Variation Computation of Ratio of Original Data to Moving Average
Seasonal Variation To compute a seasonal index, we do the following: sum the modified means
Seasonal Variation
If production is full, we expect the index to equal 100 for each month. If not, we have to adjust it by computing a correction factor. Compute the seasonal index.
Seasonal Variation
Cyclical Variation Similar to seasonal variation except that it occurs every 5 to 10 years. There is a systematic pattern in the data that mirrors what is happening in the economy. Movements from a recession to a depression or recession to recovery follow a cycle. Every time series data has a random component. If there were no random components, we would have perfect prediction of future values. However, this is not the case with real-world conditions. The cyclical component is measured as a proportion of the trend.
Cyclical Variation
Cyclical Variation Example
Chapter Summary Preliminary Adjustments to Data: Trading Day Adjustment Price Adjustment Population Adjustment Data Transformation Patterns in Time Series Data The Classical Decomposition Method