Eurostat Web activity evidence to increase timeliness of official statistics IAOS 2014 8 – 10 October.

Slides:



Advertisements
Similar presentations
Establishing a New Accreditation Program in the U.S.
Advertisements

8 th OG Meeting, BAKU Chapter 9: Data Dissemination Mr. Robert Maluta Kwinda Deputy Director.
Will ‘big data’ transform official statistics?
Data lifecycle Data Management & Survey Conception
Google Analytics Tool for the Future. Web Analytics Web analytics are the cornerstone of online marketing efforts and campaigns. The efficient utilization.
WILL BIG DATA CHANGE EVERYTHING IN ACCOUNTING AND AUDITING? Miklos A. Vasarhelyi Rutgers University.
Barteld Braaksma and Kees Zeelenberg “Re-make / Re-model”: Should big data change the modelling paradigm in official statistics?
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
WebDataNet Conference 2015 Salamanca, 26th – 28th May 2015
Big Data at Eurostat and the ESS
Google Flu Trends Terminology –Influenza = flu –ILI = influenza like illness CDC ILI time series –Weekly –1-2 week publication lag Predicting it using.
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
United Nations Economic Commission for Europe Statistical Division Big Data International Cooperation Steven Vale UNECE
ESTAT International Seminar on Modernizing Official Statistics: Meeting Productivity and New Data Challenges Tianjin, People’s Republic of China
1 Vision Infrastructure Project (VIP) Enhanced Dissemination Chain 2 nd SISAI meeting, June 2012 B4 – IT for statistical production Unit B5 – Management.
Introducing NICE... Gateshead Council Gillian Mathews Implementation consultant - north.
Basic Marketing Research Customer Insights and Managerial Action
Influenza Surveillance at IRID Immunization and Respiratory Infections Division Centre for Infectious Disease Prevention & Control Public Health Agency.
STC - Statistics Accessibility and Presentation Group (STAP) Communication of Statistics Joint ECB NBRM Seminar on Statistics Skopje, 4 October 2013 ECB-RESTRICTED.
Susan McHenry NEMSIS, Research Agenda National Association of State EMS Officials.
Cognitive Interviewing for Question Evaluation Kristen Miller, Ph.D. National Center for Health Statistics
Google Apps for Education WCPS Summer Institute 2011.
Session 5: Flash Estimates of Gross Domestic Product Introduction.
Big Data Activities at Eurostat Workshop on Statistical Data Collection, 29 Apr – 1 May 2015, Washington D.C, USA
4 May 2010 Towards a common revision for European statistics By Gian Luigi Mazzi and Rosa Ruggeri Cannata Q2010 European Conference on Quality in Official.
Sore throat? Sniffles?Sore throat? Sniffles?  Google it! Duh!  During flu season, more people enter search queries concerning the flu.  Each year 90.
1 DG Enterprise & Industry European Commission Conference on Better Regulation: Practical Steps Forward Reykjavík 6 June 2006 OVERVIEW OF THE BETTER REGULATION.
Query trends CS 349 Presentation December 2 nd, 2008 Catherine Grevet.
1 1 Resources and Funding of Official Statistics Olav Ljones SADC Work Shop 2- 6 Dec 2006, Luanda.
Introduction for Basic Epidemiological Analysis for Surveillance Data National Center for Immunization & Respiratory Diseases Influenza Division.
Eurostat WebDataNet Conference 2015 Salamanca, 26 th – 28 th May 2015 Fernando Reis, Big Data Task-Force European Commission (Eurostat) Web activity evidence.
Putting the Public Back into Public Health. FNY ORIENTATION Introduction to FNY Dashboard Analytics Reporting Requirements FNY Toolkit & Resources Questions.
Implementation of the European Statistics Code of Practice Yalta September 2009 Pieter Everaers, Eurostat.
Quality requirements EMOS RESEARCH TO COLLECT EVIDENCE BEFORE ANYTHING Benchmark against good existing model, e.g. European Masters in Translation, European.
© 2009 IBM Corporation Smarter Decisions for Optimized Performance IBM Global Executive Forum Panel Discussion Business Analytics and Optimization Fred.
Bringing Together the Social and Technical in Big Data Analytics: Why You Can't Predict the Flu from Twitter, and Here's How David A. Broniatowski Asst.
SEO for Google in Hello I'm Dave Taylor from Webmedia.
WEEK 4 Job Search e-Portfolio: An Art of Self-Promotion.
Some Final Material. GOOGLE FLU TRENDS Sore throat? Sniffles? Google it! Duh! During flu season, more people enter search queries concerning the flu.
Big Data activities at SURS Statistical Office of the Republic of Slovenia DIME/ITDG meeting, February 2016.
By, CA K RAGHU, PAST PRESIDENT – INSTITUTE OF CHARTERED ACCOUNTANTS OF INDIA.
June 2009 Regulation on pesticide statistics Pierre NADIN ESTAT E1- Farms, agro-environment and rural development
Dr. Silvia Bidart Coordinator Session 9 Consulting Services and Research Studies Libon, Portugal March 16th 2014.
Marketing Research. Good marketing requires much more than just creativity and technical tools. It requires research! Who needs it? Who wants it? Where.
DIRECTORATE GENERAL ECONOMICS, RESEARCH AND STATISTICS Forecasting Tourist Inflows Through Google use Concha Artola Economic Analysis and Forecasting General.
Life circumstances and service delivery Community survey Finalise pilot survey (June 2006) List of dwellings completed (September 2006) Processes, systems.
1 Recent developments in quality related matters in the ESS High level seminar for Eastern Europe, Caucasus and Central Asia countries Claudia Junker,
1 1 Accounting The Business Process Dr Clive Vlieland-Boddy.
Introducing Precictive Analytics
Big Data, Analytics, and Modeling at Pitt Public Health
CALIFORNIA STATE UNIVERSITY, SACRAMENTO
Big Data ESSNet: Web Scraping for Job Vacancy Statistics Nigel Swier UK Office for National Statistics.
GDP growth estimates for Europe at 30 days
Dissemination of experimental statistics
SAEG 7th June 2016 Item 5.2 Eurostat migration to JD+: state of the art By Dario Buono.
New ways to get the data Multiple mode and big data
Experimental statistics
The ESS.VIP Programme: an update
GDP growth estimates for Europe at 30 days
SAEG 15th March 2018 Item 2.1 Use of By Dario Buono.
Item 8 Cost assessment survey of production of statistics in the ESS
Use of Wikipedia for Statistics on Culture
Sub-Regional Workshop on International Merchandise Trade Statistics Compilation and Export and Import Unit Value Indices 21 – 25 November Guam.
Cost accounting in the ESS
Cost assessment survey of production of statistics in the MSs and EFTA countries Daiva Norkevičienė, Directorate A ItItem 10 June 2016 ESS RDG em 10.
Analyzing social media data to monitor public health trends
Indicators for EU policy making
Ethical Implications of using Big Data for Official Statistics
Business architecture
Presentation transcript:

Eurostat Web activity evidence to increase timeliness of official statistics IAOS – 10 October

Eurostat My definition of big data Data deluge Larger, faster, more (a.k.a. Volume, Velocity, Variety) Everything is data Text, sound, images, video Analytics Predictive analytics Ex: Google translate, voice recognition, suggestions systems, health applications The new data product by excellence Official stat: chances of getting a new job An emergent market

Eurostat ESS Big Data action plan Scheveningen memorandum Action plan adopted by European Statistical System Committee Strategy Pilots, three time horizons roadmap, review as needed Areas Policy, Communication, Big data sources, Applications / pilots, Methods, Quality, IT infrastructure, Skills, Experience sharing, Legislation, Governance action-plan-and-roadmap-10

Eurostat Past experiences 2005: Association between web activity and unemployment identified 2006: Google Trends 2008: Google Flu Trends (GFT) 2009: GFT underestimated official figures 1 st revision of GFT model 2013: GFT overestimated flu peak values 2 nd revision of GFT model 2014: Backlash against big data

Eurostat Data Source: Google Trends (

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Source: Financial Times Magazine (2014).

Eurostat Lessons from GFT Premature release of statistical product can harm its reputation Avoid big data hubris Google search algorithms frequent changes impacts validity of models We need transparency and replicability GFT search terms unknown GT is based on a sample which sampling methodology is unknown

Eurostat Other sources of web activity Wikipedia page views Flu Twitter International and internal migration flows Possibly other Visits to particular websites

Eurostat How to introduce web activity data in official flash estimates? Launch a larger scale balanced study Negative results normally are not published Purpose: guide decision on investment

Eurostat How to introduce web activity data in official flash estimates? Diversification and assessment of the web activity data sources NSI lack control of the source Black box Inability to guarantee that there was no manipulation Breaks in series Lack of continuity Diversify the sources Revision of prediction models Accreditation and certification

Eurostat How to introduce web activity data in official flash estimates? Integration of web activity data with traditional official statistics sources Official statistics should not simply reproduce what others can do, but instead do it making use of its specific comparative advantages We are the original producers, we know its details Use more detail than what is published Traditional methods (surveys)

Eurostat How to introduce web activity data in official flash estimates? Research on relation between web activity and the phenomena being predicted Remember lesson from GFT Do not confuse web activity with the phenomenon itself

Eurostat How to introduce web activity data in official flash estimates? Joint effort on the development of appropriate prediction models Learn from each other Transparency International comparability