Comparing Automatic Modeling Procedures of TRAMO and X-12-ARIMA, an Update Kathy McDonald-Johnson, U.S. Census Bureau Catherine Hood, Catherine Hood Consulting.

Slides:

Advertisements

Similar presentations

EcoTherm Plus WGB-K 20 E 4,5 – 20 kW.

Advertisements

AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory

Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.

5.1 Rules for Exponents Review of Bases and Exponents Zero Exponents

AP STUDY SESSION 2.

EuroCondens SGB E.

RWTÜV Fahrzeug Gmbh, Institute for Vehicle TechnologyTÜV Mitte Group 1 GRB Working Group Acceleration Pattern Results of pass-by noise measurements carried.

Slide 1Fig 25-CO, p.762. Slide 2Fig 25-1, p.765 Slide 3Fig 25-2, p.765.

& dding ubtracting ractions.

STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.

Addition and Subtraction Equations

Issues in Modeling and Adjusting Calendar Effects in Economic Time Series Brian C. Monsell U. S. Census Bureau ICES III June 2007 U S C E N S U S B U R.

David Burdett May 11, 2004 Package Binding for WS CDL.

Create an Application Title 1Y - Youth Chapter 5.

Add Governors Discretionary (1G) Grants Chapter 6.

CHAPTER 18 The Ankle and Lower Leg

The 5S numbers game..

A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.

Media-Monitoring Final Report April - May 2010 News.

Break Time Remaining 10:00.

The basics for simulations

Factoring Quadratics — ax² + bx + c Topic

A sample problem. The cash in bank account for J. B. Lindsay Co. at May 31 of the current year indicated a balance of $14, after both the cash receipts.

PP Test Review Sections 6-1 to 6-6

1 IMDS Tutorial Integrated Microarray Database System.

Regression with Panel Data

TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”

TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”

Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.

Progressive Aerobic Cardiovascular Endurance Run

Biology 2 Plant Kingdom Identification Test Review.

Adding Up In Chunks.

MaK_Full ahead loaded 1 Alarm Page Directory (F11)

Facebook Pages 101: Your Organization’s Foothold on the Social Web A Volunteer Leader Webinar Sponsored by CACO December 1, 2010 Andrew Gossen, Senior.

TCCI Barometer September “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”

1 Termination and shape-shifting heaps Byron Cook Microsoft Research, Cambridge Joint work with Josh Berdine, Dino Distefano, and.

2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.

Before Between After.

2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.

Slide R - 1 Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Prentice Hall Active Learning Lecture Slides For use with Classroom Response.

12 October, 2014 St Joseph's College ADVANCED HIGHER REVISION 1 ADVANCED HIGHER MATHS REVISION AND FORMULAE UNIT 2.

Subtraction: Adding UP

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Static Equilibrium; Elasticity and Fracture

FIGURE 12-1 Op-amp symbols and packages.

Converting a Fraction to %

Resistência dos Materiais, 5ª ed.

Clock will move after 1 minute

Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.

Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.

Select a time to count down from the clock above

16. Mean Square Estimation

Copyright Tim Morris/St Stephen's School

1.step PMIT start + initial project data input Concept Concept.

9. Two Functions of Two Random Variables

A Data Warehouse Mining Tool Stephen Turner Chris Frala

1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.

Chart Deception Main Source: How to Lie with Charts, by Gerald E. Jones Dr. Michael R. Hyman, NMSU.

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.

Presentation transcript:

Comparing Automatic Modeling Procedures of TRAMO and X-12-ARIMA, an Update Kathy McDonald-Johnson, U.S. Census Bureau Catherine Hood, Catherine Hood Consulting Brian Monsell, U.S. Census Bureau Chak Li, U.S. Census Bureau ICES III June 2007

2 Acknowledgments Agustín Maravall Víctor Gómez Rita Petroni James Gomish

3 Update Similar comparisons in the past, especially Farooque, Hood, and Findley (2001) »X-12-ARIMA chose models of similar quality to TRAMO models »X-12-ARIMA perhaps better at identifying trading day effects than TRAMO

4 Update, Similar Approach We used a similar approach to that of Farooque, Hood, and Findley (2001), but we used improved versions of both programs

5 Outline Background on Automatic Modeling Methods Results »Actual time series »Simulated time series Conclusions

Background

7 RegARIMA Model = Regression + ARIMA Autoregressive Integrated Moving Average

8 ARIMA Model (p d q)(P D Q) φ (B) Φ (B s )(1 – B) d (1 – B s ) D z t = θ (B) Θ (B s )a t

9 Automatic ARIMA Modeling X-11-ARIMA from Statistics Canada, Dagum »Picks best model from list TRAMO (Time series Regression with ARIMA noise, Missing observations and Outliers) from the Bank of Spain, Gómez and Maravall »Multiple steps to obtain a model

10 TRAMO Automatic Modeling Gómez and Maravall (2000) gives description FORTRAN code from Gómez and Maravall provides additional detail »Generously provided to U. S. Census Bureau for X-12-ARIMA Version 0.3 development

11 X-12-ARIMA Version 0.3 Retains pick model method » Pickmdl specification Adds step-through method based on the TRAMO method » Automdl specification

12 X-12-ARIMA Comparisons Dent, Hood, McDonald-Johnson, and Feldpausch (2005) compared the step- through method to the pick model method »Models of similar quality »Step-through method more flexible

13 X-12-ARIMA's Automatic Transformation Selection Identification with the transform specification Fit a default model with the log transformation and with no transformation »Usually the airline model (0 1 1)(0 1 1) from Box and Jenkins (1976) The model is chosen whose maximum likelihood value is larger »Likelihood of the log data is adjusted to be a likelihood of the untransformed data

14 By default, slight bias toward the log transformation

15 X-12-ARIMA's Automatic Regression Selection Trend Constant »Identification with the step-through method, automdl specification Outliers »Identification with the outlier specification

16 X-12-ARIMA's Automatic Regression Selection Trading-Day Effect Easter Effect Identification with the regression specification, aictest argument Test uses the AICC »No bias (user can set bias)

17 AICC AIC Corrected (for sample size) Note: As N gets larger, AICC approaches the AIC

18 Trading-Day Effects User specifies type »Flow (cumulative) »Stock (inventory) X-12-ARIMA compares AICC with and without the effect »No bias (user can set bias)

19 Easter Effects Default tests for Easter effects of length 1, 8, and 15 days »User can specify length X-12-ARIMA compares four AICC values »No effect vs. each of the three different length effects »No bias (user can set bias)

20 Modeling Diagnostics Ljung-Box Q »Goodness-of-fit diagnostics (Ljung and Box 1978) Spectrum of the model residuals »Diagnostic indicating seasonal or trading day effects remaining in the model residuals »Trading day frequencies defined in Cleveland and Devlin (1980)

21 Ljung-Box Q Based on sample autocorrelation of the regARIMA model residuals Residuals should behave like white noise Each Ljung-Box Q statistic of positive degrees of freedom has a corresponding p value An individual lag fails if the p value for the Q statistic for the lag is less than 0.05

22 Ljung-Box Q Failure For this study If seven or more of the first 12 lags fail or If 13 or more of the first 24 lags fail or If lag 12 fails Then the model fails according to this diagnostic

23 Spectrum of the Model Residuals Diagnostic indicating strength at frequencies of interest Visually significant peaks at seasonal or trading day frequencies indicate possible model problems

24 Significant Spectrum Peaks A spectral peak is considered to be significant if it »Reaches a height beyond the median height of all the frequency measures »Are taller than nearest neighbors by a visually significant amount

25 Significant Spectrum Peaks For this study, any significant peak at »seasonal frequencies one, two, three, for or five cycles per year and »At either of the two trading-day frequencies indicates model failure according to this diagnostic

27 Spectrum Diagnostic Information Graphical form »Output file line printer graph »Higher resolution graph Text form »Log file »Diagnostics file Failure warnings listed onscreen when X- 12-ARIMA runs

Methods

29 Automatic Modeling Settings Test for log transformation Automatic regARIMA model identification Automatic outlier detection Test for »Usual trading day »Leap year »Easter effects

30 Settings for X-12-ARIMA We expected some quarterly effects (higher autocorrelation three months apart), so we chose the maximum nonseasonal model order (maximum p, q) to be three »Default is two We chose to prefer balanced models to have an approach more like the TRAMO procedure »Default is not to prefer balanced models

31 Model Choices Ran TRAMO, X-12-ARIMA to identify transformation, model Hard-coded results into X-12-ARIMA input specification files Compared diagnostics from X-12-ARIMA

32 Clarification "TRAMO model" results are from X- 12-ARIMA runs »Initial TRAMO runs determined the transformation and model choices

33 Changes to Models Used X-12-ARIMA outlier set If any Easter regressor chosen, used X- 12-ARIMA Easter effect of eight days

Actual Time Series

U. S. Census Bureau Series U. S. Building Permits Manufacturing Retail Sales Import/Export data Descriptions available at

36 Transformation Choice TRAMO and X-12-ARIMA agreed for 91% (417) of the series 40 series differed »85% (34/40) TRAMO chose log and X-12- ARIMA chose no transformation »15% (6/40) X-12-ARIMA chose log and TRAMO chose no transformation

37 Transformation Choice Transformation choice is fundamental We did not want to favor one programs transformation over the other We dropped the 40 series of disagreement from further comparisons

38 Full Model Agreement (of 417 Series) 30% (124) of the regARIMA models agreed »Any length Easter considered match 293 series to compare diagnostics

39 ARIMA Model Agreement (of 293 Series) 24% (70) of the ARIMA models agreed, showing differences only in the chosen regression effects

40 Easter Effects 76% (222) Easter effect agreement »13% (37) both programs chose an Easter effect »63% (185) neither program chose an Easter effect 24% (71) Easter effect disagreement »24% (70) X-12-ARIMA chose Easter and TRAMO did not »0.3% (1) TRAMO chose Easter and X-12-ARIMA did not

41 Why Does X-12-ARIMA Include Easter Effects More Often? TRAMO checks for an Easter effect of one length X-12-ARIMA checks for three different lengths Do more possible regressors raise the chance of including an Easter effect?

42 Are the Easter Effects Appropriate? These economic series could indeed have Easter effects, but the results for X-12-ARIMA show Easter effects to be more prevalent than we would have expected

43 Trading-day Effects 57% (166) trading day agreement »24% (70) both programs chose trading-day effects »33% (96) neither program chose trading-day effects 43% (127) trading day disagreement »35% (104) X-12-ARIMA chose trading-day effects and TRAMO did not »8% (23) TRAMO chose trading-day effects and X- 12-ARIMA did not

44 Appropriate Trading-day Effects Under specific conditions, we can evaluate whether a trading-day effect was missed »One model includes a trading-day effect but the other does not »The model with a trading-day effect has no spectrum peak at either trading-day frequency, but the model without a trading-day effect results in a peak at one or both of the trading-day frequencies

45 Trading Day Omitted 22% (64) of the series had this omission problem »20% (60) TRAMO omission »1% (4) X-12-ARIMA omission Using a binomial distribution, the probability of seeing 60 out of 64 failures for one method if the probability of failure were equally 0.5 for each method is less than 0.01

46 Spectrum Frequency Choice Some statisticians prefer to use only the first trading-day frequency of the spectrum, so we calculated those omissions 21% (61) omissions »20% (58) TRAMO omissions »1% (3) X-12-ARIMA omissions

47 Ljung-Box Q Model Failures 24% (69) one model passed and the other model failed »17% (50) TRAMO model failed »6% (19) X-12-ARIMA model failed Binomial probability that 50 of 69 failures would be from one method is less than 0.01

48 Seasonal Spectrum Model Failures 14% (41) one model passed and the other model failed »8% (24) TRAMO model failed »6% (17) X-12-ARIMA model failed Binomial probability of 24 of 41 failures being from one method is not significant at the 10% level, so there was no significant difference in the seasonal spectrum results

49 Combined Diagnostic Failures 30% (87) one model passed and the other failed 21% (61) TRAMO model failed 9% (26) X-12-ARIMA model failed Binomial probability of 61 of 87 failures being from one method is less than 0.01

Simulated Time Series

51 Airline Model Series (0 1 1)(0 1 1) 3,500 monthly series 15 years long Nonseasonal moving average coefficient 0.6 Seasonal moving average coefficient 0.9 Start date 1980 (arbitrary choice)

52 No Model 0.6% (21) X-12-ARIMA did not choose a model TRAMO identified a model for each series

53 Fully Correct Model Identification Airline model with no trading day or Easter effects 66% (2,305) TRAMO correct 72% (2,516) X-12-ARIMA correct

54 Correct ARIMA Identification Also identified trading day or Easter effects 85% (2,978) TRAMO correct ARIMA 90% (3,159) X-12-ARIMA correct ARIMA

55 Nonseasonal Differencing 99% (3,480) TRAMO chose nonseasonal difference of order 1 99% (3,466) X-12-ARIMA chose nonseasonal difference of order 1

56 Seasonal Differencing 97% (3,378) TRAMO chose seasonal differencing of order 1 99% (3,470) X-12-ARIMA chose seasonal differencing of order 1

57 Easter Effect Identification 4% (148) TRAMO chose Easter effect 11% (392) X-12-ARIMA chose Easter effect No Easter effect present Binomial probability is less than 0.01 that we would see such a difference assuming equal probabilities of selection

58 Trading-day Effect Identification 13% (460) TRAMO chose trading-day effect 4% (138) X-12-ARIMA chose trading-day effect Binomial probability is less than 0.01 that we would see such a difference assuming equal probabilities of selection

59 Conclusions X-12-ARIMA mistakenly chooses an Easter effect more often than TRAMO As noted in Farooque, Hood, and Findley (2001), X-12-ARIMA still seems to choose trading-day effects more appropriately than TRAMO

60 Conclusions For known airline model simulations, X- 12-ARIMA performed as well as TRAMO in identifying the ARIMA model X-12-ARIMA models performed as well as TRAMO when measured by the standard model diagnostics »Ljung-Box Q »Spectrum of the model residuals

61 Newer Version of X-12-ARIMA We now have an improved version of X-12-ARIMA and hope to rerun the model identification to see if there are any changes to these results

62 Future Work Expand the study of simulated series to perform a more thorough evaluation of X-12-ARIMAs new automatic modeling procedure using more varied models, model coefficients, regression effects, and series lengths Investigate how to improve the selection of the Easter effect

63 Disclaimer This report is released to inform interested parties of ongoing research and to encourage discussion of work in progress. Any views expressed on statistical, methodological, technical, or operational issues are those of the authors and not necessarily those of the U.S. Census Bureau.

64 Much of the data analysis for this paper was generated using Base SAS® software, SAS/AF® software, and SAS/GRAPH® software, Versions 8 and 9 of the SAS System for Windows. Copyright © SAS Institute Inc. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc., Cary, NC, USA.

65 We used R to simulate the airline model time series. Additional analysis was performed using Microsoft® Excel Copyright © Microsoft Corporation. We checked our own calculations of the binomial probabilities involving the actual data using the Binomial Calculator at onlinestatbook.com/java/binomialProb.html (home page at onlinestatbook.com), and we used it alone for the comparisons involving the simulated data.