Sébastien CHAMI 5 May, 2010 Reengineering French structural business statistics An extended use of administrative data.

Slides:



Advertisements
Similar presentations
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Advertisements

Paul Smith Office for National Statistics
Monitoring year-to-year variation in structural business statistics Contribution to Q2008 – Rome, 9 July 2008 Session: Editing and Imputation I
United Nations Statistics Division/DESA
Using Business Taxation Data as Auxiliary Variables and as Substitution Variables in the Australian Bureau of Statistics Frank Yu, Robert Clark and Gabriele.
Christine PINEL Insee, France July, 9th 2008 Implementation of the NACE Rev.2 in the French structural business statistics and data back-casting method.
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
The Many Ways of Improving the Industrial Coding for Statistics Canada’s Business Register Yanick Beaucage ICES III June 2007.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
BPT2423 – STATISTICAL PROCESS CONTROL
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Sébastien FAIVRE INSEE Workshop on scanner data, Stockholm, /06/2012 Would scanner data improve the French CPI?
Overview of CSO Business Demography release Workshop on Business Demography and Job Churn statistics Dublin Castle, May 12 th 2011 Jillian Delaney.
United Nations Statistics Division/DESA International Recommendations for the Index of Industrial Production (IIP)
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Role of editing and imputation in integration of sources for structural business statistics Svein Gåsemyr, Statistics Norway Svein Nordbotten, University.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Eurostat Statistical Data Editing and Imputation.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Chapter 13: Inference in Regression
Carmela Pascucci – Istat - Italy Meeting of the Working Party on International Trade in Goods and Trade in Services Statistics (WPTGS) Linking business.
Electronic reporting in Poland 27th Voorburg Group Meeting Warsaw, Poland October 1st to October 5th, 2012 Central Statistical Office of Poland.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
12th Meeting of the Group of Experts on Business Registers
Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg,
Improving the Design of UK Business Surveys Gareth James Methodology Directorate UK Office for National Statistics.
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
European Conference on Quality in Official Statistics Roma, July 8-11, 2008 New Sampling Design of INSEE’s Labour Force Survey Sébastien Hallépée Vincent.
Implementation of quality indicators in the Finnish statistics production process Kari Djerf Statistics Finland Q2008, Rome Italy.
The application of selective editing to the ONS Monthly Business Survey Emma Hooper Office for National Statistics
United Nations Statistics Division/DESA International Recommendations for the Index of Industrial Production (IIP)
Statistics Portugal « (Quality Rome, 10 July 2008) « Simplified Business Information: « Improving quality by using administrative data in Portugal.
AEG recommendations on Non-life insurance services (Issue 5) Workshop on National Accounts December 2006, Cairo 1 Gulab Singh UN STATISTICS DIVISION.
15 may 2008 The validation of the french population census Olivier Lefebvre, Insee.
Comp. Genomics Recitation 3 The statistics of database searching.
Data Triangulation Applications in Population and Health Programs- -The Global Experience.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
New sources – administrative registers Genovefa RUŽIĆ.
Methodology of Allocating Generic Field to its Details Jessica Andrews Nathalie Hamel François Brisebois ICESIII - June 19, 2007.
Philippe Brion Johara Khélif Insee 28/04/2014 Questions raised by the implementation of the data editing device for French structural business statistics.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Olivier Haag Helsinki - Q2010 Reengineering French structural business statistics: redesign of the annual survey.
Data sources of the EuroGroups Register Presentation by Eurostat
Copyright 2010, The World Bank Group. All Rights Reserved. Business demography, part 1 Dynamics of the population of businesses 1 Business statistics and.
Module Three: Identifying your Patient in SIS. Introduction – Search for 1 st T Specimen The Search for 1 st T Specimen screen is used to access your.
2.1 Factorial Notation (Textbook Section 4.6). Warm – Up Question  How many four-digit numbers can be made using the numbers 1, 2, 3, & 4?  (all numbers.
Study of Editing and Imputation Practices at Statistics Finland Janika Konnu and Pauli Ollila Statistics Finland Q2010: Editing session Wednesday 5 th.
Evaluating the benefits of using VAT data to improve the efficiency of editing in a multivariate annual business survey Daniel Lewis.
Ph. Brion Insee Redesigning French structural business statistics: first methodological studies Bonn, september 2006.
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
FDI - Imputation. Overview Introduction Overview of Imputation Methods Overview of Outliering methods Overview of Estimation methods Aggregation Disclosure.
F6 Taxation (UK). Section A: The UK tax system Section B: Income tax liabilities Section C: Chargeable gains Section D: Corporation tax liabilities Section.
Redesigning French structural business statistics, using more administrative data ICESIII, Montréal, june 2007.
Rudi Seljak, Aleš Krajnc
Re-engineering the French Business Register ( )
Chapter 1 The Role Of Accounting.
Assessment of National Accounts Compilation in the GCC Countries Giovanni Savio, Statistics Division, UN-ESCWA High level seminar on the implementation.
A new fantastic source for updating the Statistical Business Register
Estimation techniques for missing intra-EU trade
A New Business Statistics in Finland - Quarterly Investments
Issues in Administrative Data
The French longitudinal database of enterprise groups
PRODCOM SURVEY IN THE UNITED KINGDOM
Prodcom ESTP course October 2010
DEVELOPMENT OF IMPUTATION MODEL FOR SMALL ENTERPRISES
RENATA TUMĖNIENĖ Chief specialist Enterprises statistics division
The Swedish survey on turnover in the service sector
Task Force on Small and Medium Sized Enterprise Data (SMED)
Methodological questions raised by the combined use of administrative and survey data for the French structural business statistics Work session on statistical.
Presentation transcript:

Sébastien CHAMI 5 May, 2010 Reengineering French structural business statistics An extended use of administrative data

Page 2 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Outlines › 1- Presentation of administrative tax data › 2- Links between the statistical register and tax data › 3- Tax data micro-editing › 4- Selective data editing and manual review of tax data

Page 3 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 The administrative tax data › Businesses are taxed on their profit › To determine this tax, they send annually a return to the administration based on their accounts › The tax administration cedes back to Insee this collected tax data from which structural business statistics are derived

Page 4 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 The diversity of tax forms › There are several types of tax forms depending on the sector of activity and the size of the company › The sector of activity determines the type of profits subject to taxation –Agricultural profits (out of the scope of SBS) –Industrial and business profits –Non-commercial profits › The size determines the system of taxation –Normal system for big units –Simplified system for small units –Extremely simplified system for very small units

Page 5 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 A detailed set of data › Tax returns are very detailed forms with lots of characteristics : more than 1000 overall › Some are common in every forms, some are specific › 250 characteristics of interest have been chosen within these 1000 to meet our statistical purpose › Note : the return for micro-businesses consists in a simple declaration of turnover => No tax form and no data for them

Page 6 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 An highly prescriptive accounting › As explained before, tax returns are taken from the companies accounts › The French accounting standard is very prescriptive › Consequences : the information provided by the tax form is, with few exceptions, very consistent : –One characteristic represents the same concept in every return – The value of this characteristic has been determined with the same accounting method for everyone

Page 7 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Flexible rules for accounting periods › A major problem for the homogeneity of our statistics is the accounting period on which is based the tax form › Two constraints : –At least one accounting period-ends each calendar year –Continuity of accounting periods : neither overlap nor gap

Page 8 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 From the accounting period to the reference period › Our statistics are based on the calendar year thus we need a restatement to obtain an homogeneous period for the tax returns –Choice of the accounting period with the most common months with the calendar year –Estimating to 12 months the tax forms with an accounting period different from 12 months (births and deaths excluded)

Page 9 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 From the accounting period to the reference period N-1NN+1 Year N Birth No change Year N+1 Year N-1 Year N and 12 months rectifying Death No change

Page 10 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 The statistical register (Ocsane) › The scope covered by our statistics is defined by a statistical register (Ocsane) › Before editing tax data, it is essential to match the tax forms sent to us with the units of our register in order to have the fundamental rule satisfied at the end of our process : 1 unit of the register = 1 return (either collected, derived from collected or imputed) › This is done in 3 steps : –Matching the administrative returns with our register –Dealing with multiple returns –Imputing a return for units that do not have one

Page 11 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 The identification of returns › Tax administration uses a different id number (IFRP) than ours (“Siren” number) but it has the “Siren” number in its own database as a simple characteristic › They have made a strong effort for several years to improve the quality of this “Siren” number  about 98% of tax forms have a correct Siren number that can be found in our register › With the remaining 2% : –If turnover >= 50M € : manual review to find a matching unit in the register (1000 matches per year this way) –If turnover< 50M € : return is discarded

Page 12 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Multiple tax returns Year N “Paste” 2 nd return Consecutive periods 1 st return merger Correction 2 nd return 1 st return 2 nd return 1 st return 2 nd return : non-com. profits 1 st return : I&B profits Multiple types of profit Consolidation

Page 13 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Imputed data › units of our register have no tax return : – units because the administration did not send us their forms (tax audits for example) or did it too late – micro-businesses › They represent about 20% of the units of the register but only 3% of the total turnover

Page 14 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Imputed data Three methods are used to impute data : › If a non-imputed return is available from the year before, the new return is the previous one inflated by a median evolution of turnover of the company sector › Otherwise, the return for the current year is imputed as an average return of its sector and size class › Micro-businesses are imputed in a similar way as the second method but with a specific structure of accounts

Page 15 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Data are strongly constrained by mathematical relationships › Accounting data is very redundant › The 250 characteristics of interest are linked with nearly 100 relations of types : X = X1+X2+ …. +Xn or X = Y - Z

Page 16 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Micro-editing methods › Micro-editing is mainly based on the constraints exposed before. › There are two methods : X X1 X2 Unsatisfied relation X=X1+X2 Error X Micro editing 1 : Shaping the breakdown X1 X2 Micro editing 2 : Recalculation of the total X X1 X2

Page 17 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Selective editing process › A selective editing process is then implemented to determine the most influential companies for our statistic aggregates › The main principle is to calculate an aggregate ratio (i.e. the ratio of 2 aggregates) with and without each company : influential data corresponds to the highest difference between the two ratios › The influences on the different characteristics are synthesized in one score. By setting thresholds for this score, one defines which companies have to be manually reviewed

Page 18 An extended use of administrative data Sébastien ChamiHelsinki-Q May, 2010 Human reviewing of tax data › The most influential selected tax returns are then submitted to a staff of clerks for reviewing › A specific software has been developed in order to achieve this review › This software presents to the clerks for each return they must review : –The list of the characteristics that need to be reviewed (i.e. the characteristics that this return influences the most) –The values before editing and after editing of these characteristics –The values of year N, N-1, N-2 of these characteristics –The errors before micro-editing involving these characteristics › With this information the clerks then recall the company to validate or modify the values of the selected characteristics