The Web-based Data Collection in the Italian Population and Housing Census Leonardo Tininini and Antonino Virgillito ISTAT Meeting on the Management of.

Slides:



Advertisements
Similar presentations
PHP I.
Advertisements

Edoclite and Managing Client Engagements What is Edoclite? How is it used at IU? Development Process?
Management Information Systems, Sixth Edition
3.1 Data and Information –The rapid development of technology exposes us to a lot of facts and figures every day. –Some of these facts are not very meaningful.
IIS Technologies.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Guide to Oracle10G1 Introduction To Forms Builder Chapter 5.
Maintenance Modifying the data –Add records –Delete records –Update records Modifying the design –Add fields into tables –Remove fields from a table –Change.
A Guide to Oracle9i1 Introduction To Forms Builder Chapter 5.
A Guide to Oracle9i1 Advanced SQL And PL/SQL Topics Chapter 9.
1 Web Search Interfaces. 2 Web Search Interface Web search engines of course need a web-based interface. Search page must accept a query string and submit.
Microsoft Office Word 2013 Expert Microsoft Office Word 2013 Expert Courseware # 3251 Lesson 4: Working with Forms.
Chapter 9 Database Management
MS Access: Database Concepts Instructor: Vicki Weidler.
UNIT-V The MVC architecture and Struts Framework.
Tracking Services for ANY websites and web applications Zhu Xiong CSE 403 LCO.
Form Handling, Validation and Functions. Form Handling Forms are a graphical user interfaces (GUIs) that enables the interaction between users and servers.
System Analysis Overview Document functional requirements by creating models Two concepts help identify functional requirements in the traditional approach.
1 Web Developer & Design Foundations with XHTML Chapter 6 Key Concepts.
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
With Microsoft Office 2007 Intermediate© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Office 2007 Intermediate.
Database-Driven Web Sites, Second Edition1 Chapter 8 Processing ASP.NET Web Forms and Working With Server Controls.
Geneva, 30 October 2009 Giuseppe Sindoni, Istat, Italy An online system for multi-channel, register-based census data collection.
With Microsoft Access 2007 Volume 1© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Access 2007 Volume 1 Chapter.
Overview of Previous Lesson(s) Over View  ASP.NET Pages  Modular in nature and divided into the core sections  Page directives  Code Section  Page.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
Eric Westfall – Indiana University Jeremy Hanson – Iowa State University Building Applications with the KNS.
Using Visual Basic 6.0 to Create Web-Based Database Applications
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
 A database is a collection of data that is organized so that its contents can easily be accessed, managed, and updated. What is Database?
Software Systems for Survey and Census Yudi Agusta Statistics Indonesia (Chief of IT Division Regional Statistics Office of Bali Province) Joint Meeting.
ASP.NET.. ASP.NET Environment ASP.NET is Microsoft's programming framework that enables the development of Web applications and services. It is an easy.
POPULATION AND HOUSING CENSUSES IN SLOVAKIA ON THE WEBSITE Miroslav Hudec Pavol Büchler INFOSTAT – Bratislava MSIS Geneva
Lecturer: Prof. Piero Fraternali, Teaching Assistant: Alessandro Bozzon, Advanced Web Technologies: Struts–
AUTOMATION OF WEB-FORM CREATION - KINNERA ANGADI – MS FINAL DEFENSE GUIDANCE BY – DR. DANIEL ANDRESEN.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
Prime Esperienze di Utilizzo di R all’Interno dell’Istat The Web-Based Information System of Italian Population Census Maura Giacummo Leonardo Tininini.
Chapter 11 Arrays Continued
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Principles of Database Design, Conclusions AIMS 2710 R. Nakatsu.
Monitoring the acquisition process by web widgets Leonardo Tininini and Antonino Virgillito ISTAT Meeting on the Management of Statistical Information.
McGraw-Hill/Irwin The O’Leary Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Lab 6 Creating and Using Lists and.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
Database Application Design and Data Integrity AIMS 3710 R. Nakatsu.
3 Copyright © 2004, Oracle. All rights reserved. Working in the Forms Developer Environment.
1 Web Servers (Chapter 21 – Pages( ) Outline 21.1 Introduction 21.2 HTTP Request Types 21.3 System Architecture.
LOGO 1 Mining Templates from Search Result Records of Search Engines Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hongkun Zhao, Weiyi.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Chapter 5 Introduction To Form Builder. Lesson A Objectives  Display Forms Builder forms in a Web browser  Use a data block form to view, insert, update,
©SoftMooreSlide 1 Introduction to HTML: Forms ©SoftMooreSlide 2 Forms Forms provide a simple mechanism for collecting user data and submitting it to.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Presentation.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
ASP-2-1 SERVER AND CLIENT SIDE SCRITPING Colorado Technical University IT420 Tim Peterson.
1) PHP – Personal Home Page Scripting Language 2) JavaScript.
Unit-6 Handling Sessions and Cookies. Concept of Session Session values are store in server side not in user’s machine. A session is available as long.
VOCAB REVIEW. A field that can be computed from other fields Calculated field Click for the answer Next Question.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Section 10.1 Define scripting
E-commerce | WWW World Wide Web - Concepts
E-commerce | WWW World Wide Web - Concepts
Database Vocabulary Terms.
Chapter 27 WWW and HTTP.
Use of handheld electronic devices for data collection in GeoStat
Software Systems for Survey and Census
The ultimate in data organization
Chapter 3 Database Management
SDMX IT Tools SDMX Registry
Presentation transcript:

The Web-based Data Collection in the Italian Population and Housing Census Leonardo Tininini and Antonino Virgillito ISTAT Meeting on the Management of Statistical Information Systems (MSIS 2012) Washington DC - May 21-23, 2012

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS The Census Web-based Information System SGR: the Census management system –assignment of households to enumerators –monitoring of collection activities, particularly of questionnaires collected in the various possible ways (online, munic. collection centers, post offices, enumerators) –visualization of some key indicators (a kind of data warehouse on the collection process) –Census to Local Population Registries comparison and re-alignment –… RETE: the online documentation for operators QPOP: the online questionnaire –the main topic of this presentation...

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS QPOP: the main requirements To be used by both citizens (self-compilation) and operators (online data entry) tight integration with the SGR Census Management System, in particular with its workflow Easy to use, fast and scalable Assisting users in following the correct compilation rules (without bothering them) Multi-language (Italian, German and Slovenian) Immediate coding of open questions (textual in the paper version)

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS QPOP: the main requirements To be used by both citizens (self-compilation) and operators (online data entry) tight integration with the SGR Census Management System, in particular with its workflow Easy to use, fast and scalable Assisting users in following the correct compilation rules (without bothering them) Multi-language (Italian, German and Slovenian) Immediate coding of open questions (textual in the paper version) Almost impossible re-using already available applications

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS The application design GUI: JSP pages implementing the graphical user interface. They can be forms for sending data to the server, processed by an action, and/or results of an action execution; Actions: Java classes whose execution is triggered by a HTTP call, activated by a form submission on the GUI. They receive data from the HTTP request and execute some server-side processing by calling Services; Services: Java classes that implement database transactions, realized through sequences of calls to DAOs; Data Access Objects (DAOs): Java classes that implement so- called CRUD (Create-Read-Update-Delete) database operations related to one or more domain objects; Entities: Java classes representing records of one database table. GUI ActionsServicesDAOsEntities Struts2Spring Hibernate

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS A metadata-driven application The leading principle: write more metadata, write less (more generalized) programming code Metadata to specify the type (single choice, multi- response, textual input, data, etc.) of a question –Questions sharing the same type are handled by the same pieces of Java code (templates) –The whole processing chain from HTML forms down to DB records (and viceversa) is automatically handled Metadata to specify (multi-language) texts in all GUI fragments BUT ALSO Metadata to specify question routing –Based on the concept of Questionnaire Graph

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS The Questionnaire Graph (QG) The basic idea: formally modeling the structure of the questionnaire and the correct set and sequence of questions to be filled in by respondents A Questionnaire Graph (QG) in QPOP is a Directed Acyclic Graph (DAG), such that: –Nodes N i are in 1-1 correspondence with each questionnaire fragment (mainly questions, but not only); –Node types correspond to templates (which in turn determine appearance and behavior) –Edge labels represent conditions on questions (e.g. “Has the respondent checked the option 2 of question X?”). –A (directed) labeled edge from node (question) N i to node (question) N j corresponds to the fact that the user has to respond to question N j after having given a response to node N i, if the condition expressed on the edge label is true QG is used by the application (both on client and server side) to enable and disable questions on the web page and to validate the user’s input before saving the user’s answers in the microdata tables, i.e. to enforce consistency

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS From the questionnaire to the QG

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS En(Dis-)abling questions by updating QG node states (1)

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS En(Dis-)abling questions by updating QG node states (2)

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS En(Dis-)abling questions by updating QG node states (3)

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS The search engine for assisted coding M A T H E A M T I C S D E G R E E Reference dictionary Degree in Astronomy Degree in Chemistry Degree in Mathematics Degree in Physics ?

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS Reference dictionary pre-processing 1.Character normalization accented letters are replaced with the corresponding unaccented version, uppercase letters with lowercase ones, other characters like punctuation marks are removed 2.Stopword removal “useless” words are removed from the character-normalized version of the items, produced in the previous step. Both “general” (like conjunctions, articles, etc.) and “context- specific” stopwords (e.g. the word “degree”, when considering a list of academic degrees) are removed 3.Search terms extraction and weighting the single terms (words) constituting the normalized items produced by the previous two steps are extracted and stored in the search engine DB tables. A weight is also assigned to each term, depending on its relative frequency inside the dictionary

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS Search string processing 4.Similarity search each (normalized) term to be searched is compared with those in the database; the terms that produce a similarity above a given (relatively high) threshold are passed to the following step 5.Extraction of the dictionary items the dictionary items containing one or more terms obtained in the previous step are extracted from the DB. At the same time, for each item extracted, some values are either read or computed, which will be used in the following step 6.Dictionary item sorting by using the values extracted/computed in the previous step, the score of each item in the result set is computed and the list is sorted accordingly in descending order. This sorted list is proposed to the respondent

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS QPOP in (a few) figures

Tininini and Virgillito - The Web-based Data Collection in the Italian Population and Housing Census - MSIS Future (current) work Questionnaires for the Industry and Services Census comprising: –Businesses (2 “fairly similar” questionnaires implemented as one with “special” routing conditions) –Non-profit institutions More general question templates More general checks and routing conditions (support for existential and universal quantifications, as well as counting)