Creating a National Remote Access System for Register-based Research Marianne Johnson, Statistics Finland Statistical Data Confidentiality Work Session.

Slides:



Advertisements
Similar presentations
15 Maintaining a Web Site Section 15.1 Identify Webmastering tasks Identify Web server maintenance techniques Describe the importance of backups Section.
Advertisements

Meganet Corporation VME Office Meganet Corporation Meganet Corporation is a leading worldwide provider of data security to Governments, Military,
Identification and Disposition of Official University Records University of Texas at Arlington Records Management.
Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
15.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 15: Configuring a Windows.
The Social Statistics Database: Invaluable source of micro-data for socio-economic statistics Johan van Rooijen.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Designing Security In Web Applications Andrew Tomkowiak 10/8/2013 UW-Platteville Software Engineering Department
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
Development of Remote Access Systems Tanvi Desai LSE Research Laboratory Data Manager Research Laboratory IASSIST 2008: Stanford.
Luxembourg Income Study (LIS) asbl 17, rue des Pommiers L-2343 Luxembourg –City Tél : +(352) Fax: +(352)
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
RARECARENet project High-resolution study in the Finnish Cancer Registry Maarit Leinonen Chief Medical Officer Finnish Cancer Registry, Helsinki.
Documenting Register Data for Research Purposes Finnish Information Centre for Register Research Marianne Johnson Irma-Leena Notkola
Trimble Connected Community
USE OF LITHUANIAN CLASSIFICATION OF OCCUPATIONS ISCO 88, ISCO 2008 and the Development of the ESeC Regional Meeting, Oslo, 7 June 2005 Violeta Skamarociene.
14 Publishing a Web Site Section 14.1 Identify the technical needs of a Web server Evaluate Web hosts Compare and contrast internal and external Web hosting.
15 Maintaining a Web Site Section 15.1 Identify Webmastering tasks Identify Web server maintenance techniques Describe the importance of backups Section.
Section 15.1 Identify Webmastering tasks Identify Web server maintenance techniques Describe the importance of backups Section 15.2 Identify guidelines.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Administrative Registers & Register Based Population Census Sonia Jackson CARICOM Census Symposium Radisson Grenada Beach Resort May 27, 2014 CARICOM Census.
Web Page Design I Basic Computer Terms “How the Internet & the World Wide Web (www) Works”
Digital Filing A Simple Way to Digitally Centralize and Distribute Documents.
What is MediaCAST. MediaCAST is an on-demand learning platform purchased by the CCSD to enhance the delivery of lessons in the classroom. The system provides.
Access to official statistical micro data at the Statistical Office of the Republic of Slovenia and cooperation with the Slovenian Social Science Data.
Frameworks for the Access and Use of Administrative Data, With the Example of Current Practice in the UK Steven Vale Office for National Statistics UK.
Automated (meta)data collection – problems and solutions Grete Christina Lingjærde and Andora Sjøgren USIT, University of Oslo.
Administrative procedures for microdata access at SURS October 2013.
Editing of linked micro files for statistics and research.
The experience of a National Statistical Institute after a law change: Estonia First Regional Workshop Microdata Access in European Countries ― Cooperation.
An Introduction to Networking
Nordic platform for sensitive biomedical data The Tryggve project Antti Pursula
Security fundamentals Topic 5 Using a Public Key Infrastructure.
Coding Compliance Components Writing Custom Policies for Auditing, Expiration and More Jason Morrill Program Manager Windows SharePoint Services.
User Management. User Registration Policy The issues of creation and management often clash in distributed organisations Central creation and management.
DLI and EQUINOX Question 1 How do I find out what survey datasets are available from Statistics Canada ?
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
19-20 October 2010IT Directors’ Group Meeting 1 Item 3.3.g of the agenda Vision Infrastructure Project on Secure Infrastructure for CONfidential data access.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
VPN. CONFIDENTIAL Agenda Introduction Types of VPN What are VPN Tokens Types of VPN Tokens RSA How tokens Work How does a user login to VPN using VPN.
1 Development of Cash Benefits Management Information System-CBMIS Sanja Andovska, Conditional Cash Transfers Project.
Researchers’ Usage of Microdata The example of Statistics Finland Advanced presentation – Some additional details Consultation Mission on Promoting the.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
Researchers’ Usage of Microdata The example of Statistics Finland Advanced presentation Consultation Mission on Promoting the activity and Creating a positive.
1 (c) 2013 FabSoft. MOST Cloud Service What is a Cloud Service? A cloud service is internet-based, meaning that MOST is hosted on a server farm on the.
Researchers’ Usage of Microdata The example of Statistics Finland Basic presentation Consultation Mission on Promoting the activity and Creating a positive.
Supporting the NHS to deliver better, safer, quality care NHS Connecting for Health.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Audit Trail LIS 4776 Advanced Health Informatics Week 14
3.02H Publishing a Website 3.02 Develop webpages..
WHAT IS A NETWORK TYPES OF NETWORK NETWORK HARDWARE
Chapter 2: System Structures
Karen Dennison Collections Development Manager
Section 15.1 Section 15.2 Identify Webmastering tasks
Unit 27: Network Operating Systems
Marianne Johnson, Researcher. Services
Torben Søborg Theme 4.2 Remote access to Microdata Files from Researchers IT Directors Group 24 and 25 October 2005 Torben Søborg
An Introduction to Computer Networking
Development roadmap of Suomi.fi-services
Other Sources of Information
Microsoft Office Access 2003
Unit 11- Computer Networks
Development roadmap of Suomi.fi-services
Unit# 5: Internet and Worldwide Web
4.02 Develop web pages using various layouts and technologies.
Using advanced IT tools to collect data of small units
Data Security Awareness
Development roadmap of Suomi.fi-services
Presentation transcript:

Creating a National Remote Access System for Register-based Research Marianne Johnson, Statistics Finland Statistical Data Confidentiality Work Session Oct 2015

Finnish administrative registers several comprehensive national registers contain unit level data on individuals, families, housing, enterprises compiled and maintained for administrative or statistical purposes, e.g. –Population Register Centre (VRK) –Population information system –Social Insurance Institution (KELA) –Registers on obtained social benefits –National Institute for Health and Welfare (THL) –Medical Birth Register, –Care Registers for Social Welfare and Health Care (HILMO), –Finnish Cancer Register –Ministry of Labour (TEM) –Register over job seekers – Statistics Finland (Tilastokeskus) Statitics Finland /Researcher Services2

Secondary usage of administrative registers Production of official statistics is to a large extent based on registers in Finland - the population and housing census has been based totally on register sources since Handbook: Use of Registers and Administrative Data Sources for Statistical Purposes – Best Practices of Statistics Finland Register-based research –20 % of doctoral thesis’ within medicine in Finland include data from national registers Statitics Finland /Researcher Services3

Statitics Finland /Researcher Services4

Prerequisites for register-based research Common personal identification number in all registers –first used in 1964 ( between two different systems) –since 1971 a digital population register –all Finns have a PIN  data from different registers can be linked by PIN e.g. for research purposes Legislation that allows the use of confidential personal data for scientific research Trust in register keepers and researchers Comprehensive, well documented registers Statitics Finland /Researcher Services5

Legislative basis for research use of data from Statistics Finland - Statistics Act (280/2004) - In 2013 the Statistics Act was amended to better facilitate the use of data gathered at Statistics Finland for research purposes. - New objective of the Act –To extend the use of the data collected for statistical purposes in scientific studies and statistical surveys on social conditions. - Possibility for researchers to gain access to confidential data from which only the direct identifiers have been removed. –Before 2013 statistical authorities could not give permission to such confidential data from which the statistical unit could be indirectly identified. –Gain access = see and analyze data by a remote access - system Statitics Finland /Researcher Services6

Remote access system (FIONA) - In use at Statistics Finland since 2009, development project - Model taken from Sweden, Denmark and the Netherlands - Researchers use data on Statistics Finland’s server at their own workplace via a secured Internet connection, data remains at SF - Researchers use a Windows remote desktop, and have access to the data they have obtained permission to as well as to metadata - The researchers have access to wide range of statistical programs : STATA, SPSS, R, SAS, Python Anaconda, … - Each research project has its dedicated folders and storage space in the system - Technical maintenance of the FIONA-system transferred to CSC-It Centre for Science in 2015 - Number of users and data sets in the remote access system is growing steadily, currently about 150 active users Statitics Finland /Researcher Services7

Confidentiality - Research data sets are stored on Statistics Finland’s /CSC’s servers - Only mouse, keyboard and graphic signals are transferred - Access to the system only from preapproved IP-addresses - A disposable SMS password is sent each time the researcher logs in to FIONA - All data transfers from and to FIONA are handled by personnel at the Researcher Services of SF –Outputs are checked so that direct or indirect identification is not possible and files are saved for possible future reference - Access to data is terminated when the permit for the project expires - FIONA environment is separated from the production network - The system will be audited in fall 2015 after being transferred to CSC Statitics Finland /Researcher Services8

A typical process in applying for sensitive research data A researcher applies for a licence to access data for a research project The application must include a research plan and a pledge of secrecy The Ethics Committee is consulted in cases involving large datasets with confidential data If the data can be given out the licence is granted (possibly with modifications) A contract is signed specifying the dataset and the fee as well as the date of delivery The data is put together, edited and uploaded to the remote access system The researcher uses a remote connection to analyse the data and sends the results to Research Services The results are checked to make sure that no units (persons, companies) can be identified The results are sent to the researcher and they can be used in publications Statitics Finland /Researcher Services9

Present process for obtaining register data for research RESEARCHER Authority Statistics Finland Authority § § § § Handling permit applications Control and specification Compiling data-sets Researcher responsible of data security and disposal of data sets Searching for data sets and applying for permits from several different authorities, with varying practices Delivering data using varying practices § Possible corrections and re-sending Data protection Authority Statitics Finland /Researcher Services10 Internet

FMAS Remote access system Services that require permit Remote desktop for analysing data (programs and tools) Separated server space for data and metadata Output service for results, Input service for researcher’s data Services that require permit Remote desktop for analysing data (programs and tools) Separated server space for data and metadata Output service for results, Input service for researcher’s data Services that require registration Centralized digital permit application service Services that require registration Centralized digital permit application service Public services Data catalogue Helpdesk for research and tuition Public services Data catalogue Helpdesk for research and tuition Interface service for data and meta data, Pseudonymization Administration services for user rights Organiza- tion A Organiza- tion C Organiza- tion E - Commonly agreed metadata standards – Data warehouse - Archive of multiple user files Researcher Organiza- tion B Organiza- tion D Statitics Finland /Researcher Services11

Linking data from different sources - Present method –Register keepers send the data requested by the researcher over a secure connection, by recommended mail, with courier services etc. to Statistics Finland –The data includes the Finnish PIN or BIN ( or a pseudocode created by the register keeper and the key is sent separately) –Statistics Finland creates a project specific pseudocode, changes the PIN (BIN) in the research data sets and uploads the data in the remote access system - Aim –Pseudocodes should be used in all data deliveries –Register keepers should be able to upload their data direct to the remote access system using a standard pseudonymization method Statitics Finland /Researcher Services12

Pseudonymization –project specific Statitics Finland /Researcher Services Project 211 Statistics Finland FIONA Other registerkeeper Common9843 Project A, woman C, man nvaoepanwzl, woman bleokldawgs, man A, age C, age 44 nvaoepanwzl, age 15 bleokldawgs, age 44 Common984 3 Project 211 De-identification nvaoepanwzl, age 15 bleokldawgs, age 44 nvaoepanwzl, woman bleokldawgs, man

To be developed…. - We see a problem with the set pseudocodes of the ’ready-made’ data files Solution 1: Create project specific pseudocode also for projects that use the ’ready made’ –Problem: A copy of ’ready made’ data sets has to be made for each project -> much excessive disc space is needed Solution 2: Send the seed code that has been used for the ’ready made’ files to the other register keepers –Problem: The key PIN /BIN - pseudocode used by Statistics Finland will be widely known Statitics Finland /Researcher Services14