Data quality control, Data formats and preservation, Versioning and authenticity, Data storage Managing research data well workshop London, 30 June 2009.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Data Entry and Processing.
Advertisements

Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
Preserving for the Future Mike King Systems Manager UK Data Archive (University of Essex)
Digital Storage Solutions John Southall ESDS Qualidata, University of Essex Sounds Good Improving Sound Archives in the East of England 19th November 2007.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
MANAGING YOUR DATA WELL …………………………………………
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
Protect Your Data: How to Store and Back up your Data Securely Open Access and Data Curation Team With thanks to the UKDA for allowing us to reuse and.
                      Digital Audio 1.
Backup Strategy. An Exam question will ask you to describe a backup strategy. Be able to explain: Safe, secure place in different location. Why? – For.
Data Storage and Security Best Practices for storing and securing your data The goal of data storage is to ensure that your research data are in a safe.
11 BACKING UP AND RESTORING DATA Chapter 4. Chapter 4: BACKING UP AND RESTORING DATA2 CHAPTER OVERVIEW Describe the various types of hardware used to.
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS? …………………………………………
Data format translation and migration Future possibilities Alasdair Crockett, Data Standards Manager UK Data Archive.
Managing Information Systems Information Systems Security and Control Part 2 Dr. Stephania Loizidou Himona ACSC 345.
Open Exeter Project Team
Guide to Linux Installation and Administration, 2e1 Chapter 13 Backing Up System Data.
Data Preservation Best Practices for preserving your research data for future reuse The goal of data preservation is to ensure that your data is in a sustainable.
STORING YOUR DATA ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH DATA MANAGEMENT TEAM UK DATA.
1 The Vietnam Center and Archive Stephen Maxner, Ph.D.
Instructions and forms
Managing Your Own Data (…if you have to) Kathryn A. Carson, Sc.M. Senior Research Associate Department of Epidemiology Johns Hopkins Bloomberg School of.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
Biostatistics Analysis Center Center for Clinical Epidemiology and Biostatistics University of Pennsylvania School of Medicine Minimum Documentation Requirements.
MANAGING YOUR RESEARCH DATA: PLANNING TO SHARE ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH.
1 Data Management (1) Data Management (1) “Application of Information and Communication Technology to Production and Dissemination of Official statistics”
How to Organise your Files and Folders Gareth Cole. Data Curation Officer. 6 th October 2014.
BACKUP AND ARCHIVING DATA BACKUP AND RECOVERY OF DATA.
Data management in the field Ari Haukijärvi 2nd EHES training seminar.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
DIGITAL IMAGING What Every Archivist and Records Officer Should Know DIGITAL IMAGING What Every Archivist and Records Officer Should Know Presented by.
1 Maintain System Integrity Maintain Equipment and Consumables ICAS2017B_ICAU2007B Using Computer Operating system ICAU2231B Caring for Technology Backup.
Managing Disks and Drives Chapter 13 powered by dj.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
Guide to Computer Forensics and Investigations Fourth Edition
Managing Your Data: Backing Up Your Data Robert Cook Oak Ridge National Laboratory Section: Local Data Management Version 1.0 October 2012.
Backup & Restore The purpose of backup is to protect data from loss. The purpose of restore is to recover data that is temporarily unavailable due to some.
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
MCSE Guide to Microsoft Windows Vista Professional Chapter 5 Managing File Systems.
Digital Preservation 8/7/2012 Karen Estlund Head, Digital Library Services
General Purpose Packages DATA TYPES. Data Types Computer store information in the form of data. Information has meaning. Eg 23 May 2005 Data has no meaning.
How Not to Lose Track of Your Research Organization and Planning Resources at Brandeis Melanie Radik and Raphael Fennimore Library & Technology Services.
Data Management in Clinical Research Rosanne M. Pogash, MPA Manager, PHS Data Management Unit January 12,
Verification & Validation
Thanapoom Boondee M.2/2 No.22. Pattawan Tangpattananon M.2/2No.5 Tuchatham Tosakul M.2/2No.13 Thanapoom Boondee M.2/2No.22 Suvit Pathomthanasarn M.2/2No.30.
New & Improved Meteorological Data Archives Kenneth G. Wastrack Jennifer M. Call D. Sherea Burns Tennessee Valley Authority.
Information Systems Design and Development Technical Implications (Storage) Computing Science.
A Beginner’s Guide to Preserving Digital Resources in Historic Environment Records Catherine Hardman and Kieron Niven Archaeology Data Service.
Digital Stewardship Lee Dotson Digital Initiatives Librarian University of Central Florida John C. Hitt Library Presentation available at
Enw / Name. What is a on-line / paper based data capture form Can you give an example where each are used? Automated data capture systems are used around.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
First Tuna Data Workshop (TDW-1) October 2006, Noumea, New Caledonia Oceanic Fisheries Programme (OFP) Secretariat of the Pacific Community (SPC)
Storing and securing your research data lib.uts.edu.au utslibrary.
( ) 1 Chapter # 8 How Data is stored DATABASE.
Research Data Management in the Humanities: an Introduction to the Basics Open Exeter Project Team.
RECORDS MANAGEMENT Judith Read and Mary Lea Ginn Chapter 12 Electronic Media and Image Records 1 © 2016 Cengage Learning ®. May not be scanned, copied.
BACKUP AND RESTORE. The main area to be consider when designing a backup strategy Which information should be backed up Which technology should be backed.
Documenting and organising your data For an easier life lib.uts.edu.au utslibrary.
Preservation Planning Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
KEEPS – a system for UELMA preservation and security
Water quality data - providing information for management
DOCUMENT AND DATA CONTROL
Slide Template for Module 4 Data Storage, Backup, and Security
KEEPS – a system for UELMA preservation and security
Use It or Lose It! Preserving Your Digital Documents
Storage Basic recommendations:
Research Data Management
The Office Procedures and Technology
Presentation transcript:

Data quality control, Data formats and preservation, Versioning and authenticity, Data storage Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

2 Good data management good research high quality data needs to be planned specific for purpose data can be understood and used now and in future data can then be shared and re-used

3 Can you understand / use these data? SrvMthdDraft.doc SrvMthdFinal.doc SrvMthdLastOne.doc SrvMthdRealVersion.doc

4 Quality control Data quality control at various stages: data collection –e.g. instrument calibration; expert opinion; multiple measurements; computer assisted interviews data entry, digitisation, transcription and coding - standardised and consistent procedures –e.g. set up validation rules for data entry; use input masks; detailed variable labelling; missing value coding; use controlled vocabularies or choice lists; best structure to organise data and data files data checking and verifying - automated and/or manual –e.g. double entry; check for out-of-range values; apply random sample validation; statistical analyses (descriptives, frequencies, means, range, clustering) to detect errors or find anomalous values; verify data completeness

5 Data formats choice of software format for digital data: –planned data analyses –software availability –hardware used –discipline specific standards and customs digital data software dependent digital data endangered by obsolescence of software/hardware best formats for long-term preservation - standard formats, interchangeable formats, open formats –e.g. tab-delimited; comma-delimited (CSV); ASCII; OpenDocument format; SPSS portable; XML

6 Data format conversions convert data for preservation or back-up, e.g. export, save as beware of conversion errors: –loss of internal metadata >e.g. convert MS Access to tab-delimited tables –loss of editing, formatting, formulae >e.g. convert MS Word to RTF –truncation or loss of data >e.g. string variables lost in SPSS – STATA conversion check for errors and changes after conversion Example 1: MS Excel to tab-delimited Example 2: Word to XML Example 3: Proprietary audio file (DVF) to WAV

7 MS Excel format Tab–delimited text format

8 Version control keep track of different copies or versions of data files which methods: › single site vs. across locations › single vs. multiple users › different versions to be stored vs. files to be synchronised single user of data files: › file naming – unique file names with date or version number (avoid spaces!) e.g. FoodInterview_1_draft; FoodInterview_1_final; HealthTests_ ; BGHSurveyProcedures_00_04 › version control table or file history within or alongside data file › version control facility within software, e.g. MS Windows software multiple users of data files › same as above › control rights to file editing: read/write permissions, e.g. Windows Explorer › versioning/file sharing software: check files out/in, e.g. SVN, VSS, Google Docs, Amazon S3 › manual merging of multiple entries/edits synchronise files, e.g. MS SyncToy software

9 Authenticity of data master files assign responsibility for master files record changes to master files

10 Data storage digital storage media unreliable file formats and physical storage media ultimately become obsolete optical (CD, DVD) and magnetic media (hard drive, tapes) vulnerable and subject to physical degradation Best practice: use data formats with long-term readability storage strategy with at least two different forms of storage copy/migrate data files to new media between two and five years after first created check data integrity of stored data files at regular intervals (checksum) know your back-up strategy: institutional/personal; network server/PC/laptop maintain original copy, external local copy and external remote copy test file recovery Data Protection Act and data back-up – may require minimal data copies for personal data; secure storage

11 Example: data storage and preservation at UKDA  preservation copy (UKDA)  shadow copy (UKDA)  dissemination copy to reduce load on main system  near-site online copy (on campus)  off-site online copy  tape-based offline copy (UKDA) Multi-copy, multi-storage media and multi version resilience: scheduled nightly robotic 3-monthly

12 Good data management practice plan data management early assign roles and responsibilities design data management according to needs and purpose of research data management throughout research

13 Resources ESDS (2008). Guide to good practice: micro data handling and security. Finch, L. & Webster, J. (2008). Caring for CDs and DVDs. NPO Preservation Guidance. Preservation in Practice Series. London, National Preservation Office. Available at UK Data Archive (2009). Manage and Share Data. archive.ac.uk/sharing/ archive.ac.uk/sharing/ See: