Research Databases for NRES London 29 th Feb 2012.

Slides:



Advertisements
Similar presentations
Reconciling the sharing of research data with ethical review for research with people as participants Dr Veerle Van den Eynden UK Data Archive Data support.
Advertisements

The Role of the IRB An Institutional Review Board (IRB) is a review committee established to help protect the rights and welfare of human research subjects.
Quality Improvement in the ONS Cynthia Z F Clark Frank Nolan Office for National Statistics United Kingdom.
NIGB Legal requirements for use of personal data in research OnCore UK / NRES Training workshop Ethical Principles relating to consent for use of samples.
NATIONAL INFORMATION GOVERNANCE BOARD
NIGB NATIONAL INFORMATION GOVERNANCE BOARD Harry Cayton, Chair, National Information Governance Board.
Grid Security/Edinburgh 5 th & 6 th December 2002 Confidentiality, Consent & Access Peter Singleton - Cambridge Health Informatics.
Good Medical Practice Evidence to use for Appraisal Good Medical Practice 2006.
Clinical Governance VTS Scheme Presentation Feb 2003 Matt Walsh.
Agenda Problem Existing Approaches The e-Lab Is DRM the solution?
Rev.DescriptionAuthorDate 0.0First draftDavid Stone14/07/10 0.1ReviewPhil Walker Magi Nwoli Tony Heap Vanessa Kaliapermall 15/07/10 1.0FinalDavid Stone18/07/10.
Julia Hippisley-Cox University of Nottingham 19 th March 2010.
Copyright © Healthcare Quality Quest, Proposed standards for a national clinical audit — How we got involved and what we have learned.
Clare Sanderson Executive Director of Information Governance The NHS Information Centre for health and social care.
Open Pseudonymiser Project Julia Hippisley-Cox,
Pseudonymisation at source “preserving patient confidentiality & public trust in doctors” Julia Hippisley-Cox 11 th July 2013 BMA House JGPIT.
Pilot HRSS Pseudonymisation and Person Matching An Outline of the Approach Alan Barcroft.
Linda Ward Clinical Review & Effectiveness Specialist, EMSCG Primary care treatment funding decisions: developing a resource to.
Wireless access Nottingham, 23 rd April 2013 Pseudonymisation workshop.
Complying with Privacy to Enable Innovation & Research
Guide to Massachusetts Data Privacy Laws & Steps you can take towards Compliance.
Directorate of Donor Care UK Transplant NHSBT Strategic Plan and ODTF Recommendations Regional Managers.
Why bother? Trying to do something differently in an academic or NHS setting can sometimes be a frustrating experience.
Data Linkage Service Garry Coleman, Health and Social Care Information Centre.
Open Pseudonymisation Project Julia Hippisley-Cox,
Information Sharing Options Phil Walker. Outline I have been asked to present a range of options for lawful data sharing. There is unlikely to be one.
Research Ethics-Integrity-Governance. University Initiative:The Catalyst? ‘02 Good Research Practice Standards & Procedure to Investigate Potential Research.
ISB Notice and preparing for the implementation of the new IAPT Data Standard Shaun Crowe Mental Health, Employment and IAPT Mental Health Collaborative.
Promoting Excellence in Family Medicine Enabling Patients to Access Electronic Health Records Guidance for Health Professionals.
Stockport Health Record sharing health records to improve patient care 1 Stockport Health Record Awareness Event 16 th September 2009 Stockport Town Hall.
DATA PROTECTION AND PATIENT CONFIDENTIALITY IN RESEARCH Nic Drew Data Protection Manager University Hospital of Wales   
The Nuffield Council on Bioethics Report : The collection, linking and use of data in biomedical research and health care: ethical issues. Martin Richards.
Practice EDI Administrator’s Workshop Pathology Messaging Implementation Programme (PMIP)
Open Pseudonymisation workshop Nottingham 22 nd Sept 2011.
Open Data Platform Supplier Forum 13 January 2012.
Security Baseline. Definition A preliminary assessment of a newly implemented system Serves as a starting point to measure changes in configurations and.
Care.data: listening to you Robin Burgess Regional Head of Intelligence
Organ Transplants Presentation
Medical Audit.
The Audit Process Tahera Chaudry March Clinical audit A quality improvement process that seeks to improve patient care and outcomes through systematic.
SPIRE Project Scottish Primary Care Information Resource SCIMP Conference 2013.
Julia Hippisley-Cox University of Nottingham June 2013 Open Pseudonymisation.
NHS Connecting for Health A National Framework For Implementing Electronic SAP Summary of Recommendations.
Professor Julia Hippisley-Cox GP Clinical Epidemiologist Director QResearch Director ClinRisk Ltd Member ECC NIGB London July 2011.
Name Position Organisation Date. What is data integration? Dataset A Dataset B Integrated dataset Education data + EMPLOYMENT data = understanding education.
The power of information Putting all of us in control of the health and care information we need Dr Susan Hamer National Director of Nursing, Midwifery.
R&D – a perspective Dr Nana Theodorou Research Coordinator Sheffield Clinical Research Office.
FGM – THE ENHANCED DATASET DR EMMA TUKMACHI LEAD GP FOR SAFEGUARDING CHILDREN IN TOWER HAMLETS.
Your health record How the local NHS uses and protects the information held about you Other ways that your records may be used Your local NHS services.
Access to data for local authority public health AGW Public Health Network Training Event: Public Health Data, Information and Intelligence 11 th November.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
FORUM GUIDE TO SUPPORTING DATA ACCESS FOR RESEARCHERS A STATE EDUCATION AGENCY PERSPECTIVE Kathy Gosa, Kansas State Department of Education.
HIT Policy Committee NHIN Workgroup HIE Trust Framework: HIE Trust Framework: Essential Components for Trust April 21, 2010 David Lansky, Chair Farzad.
E-Authentication & Authorization Presentation to the EA2 Task Force March 6, 2007.
Title of Clinical Audit Project Name of presenter Date of presentation Presentation template via
8 th November 2007 Research: ethics and research governance Rossana Dowsett Research and Regional Development Division [Pre Award Support] University of.
1 Information Governance (For Dental Practices) Norman Pottinger Information Governance Manager NHS Suffolk.
Information Sharing for Integrated Care A 5 Step Blueprint.
Ukpmc.ac.uk As a result of the mandates Research in the open How mandates work in practice 29 th May, 2009 Paul Davey, UK PubMed Central Engagement Manager,
What data are available, and how are they accessed?
V April 2016 Training Guide 1 NOTE: All screen shots from Communicare indicate PCEHR. Any reference to the PCEHR or the My Health Record within this.
Effective Board Governance & role of the Audit Committee Presentation by Cluster Audit Committee – July / August 2012.
GCP (GOOD CLINICAL PRACTISE)
Information Sharing for Integrated care A 5 Step Blueprint
General Data Protection Regulation
CPRD: An introduction to the Clinical Practice Research Datalink in Cambridge Rupert Payne.
General Data Protection Regulation
D3 Confidentiality.
Evidence to use for Appraisal Good Medical Practice 2006
TRACE INITIATIVE: Confidentiality, Data Security, and Procedures for Protocol Violation or Adverse Event.
Presentation transcript:

Research Databases for NRES London 29 th Feb 2012

JHC roles 1.Research chair at UoN –epidemiology, risk prediction and drug safety 2.Member of the ECC NIGB 3.Developed and run the not-for-profit QResearch database with EMIS 4.Inner city GP

Outline Background Key ethical issues ScientificConfidentiality Example of QResearch Data linkage and pseudonymisation Discussion /questions

Background Large volumes of electronic data now collected in the NHSLarge volumes of electronic data now collected in the NHS Huge potential for useful researchHuge potential for useful research Technology exists to extract data and assemble it into databasesTechnology exists to extract data and assemble it into databases Databases popular with academics and DHDatabases popular with academics and DH Large numbers for studiesLarge numbers for studies Relative efficiencyRelative efficiency Increasing potential for data linkagesIncreasing potential for data linkages

Definition research database in NRES SOP “a structured collection of individual level personal information, which is stored for potential research purposes beyond tehh life of a specific research project with defined end points”“a structured collection of individual level personal information, which is stored for potential research purposes beyond tehh life of a specific research project with defined end points” Includes databases set up for researchIncludes databases set up for research Re-use of databases established forRe-use of databases established for - audit - audit - disease registers - disease registers

Research databases Included in NRES SOPsIncluded in NRES SOPs Specific section within IRAS formSpecific section within IRAS form Approvals generally for 5 years renewableApprovals generally for 5 years renewable Can include generic approvalCan include generic approval Can include providing data to third parties as part of a research serviceCan include providing data to third parties as part of a research service Detailed protocol required on purpose, operation, methods, policies, governanceDetailed protocol required on purpose, operation, methods, policies, governance

New research databases What is the purpose?What is the purpose? Do we need a new one or can an existing database be used?Do we need a new one or can an existing database be used? Who is will ‘own’ it and be responsible for it?Who is will ‘own’ it and be responsible for it? What data will it contain and how will it be accessed?What data will it contain and how will it be accessed? What is the governance framework ?What is the governance framework ? Will it contain identifiable data +/- consent?Will it contain identifiable data +/- consent? ? S251 support required? S251 support required

Key objectives for safe data sharing Patient and their data Minimise risk Privacy Maximise public benefit Maintain public trust

Three main options for data access Patient and their data Minimise risk Privacy Maximise public benefit Maintain public trust consent Pseudo nymisation s251

De-identification Various methods to reduce identifiability of dataVarious methods to reduce identifiability of data PseudonymisationPseudonymisation Use of samples and limited data items rather than whole databaseUse of samples and limited data items rather than whole database Conversion of dob to year of birth or age.Conversion of dob to year of birth or age. Contracts/data sharing agreements with clear liabilities and penalitiesContracts/data sharing agreements with clear liabilities and penalities

Example for QResearch Established in 2002 to support ethical medical researchEstablished in 2002 to support ethical medical research Largest of three UK databases & expandingLargest of three UK databases & expanding Management board – UoN and EMIS.Management board – UoN and EMIS. Advisory board – professional and lay representation. Advises on policy, strategy etc.Advisory board – professional and lay representation. Advises on policy, strategy etc. Scientific Board – review science and risk assessment.Scientific Board – review science and risk assessment.

QResearch key facts Large pseudonymised databaseLarge pseudonymised database >700 GP practices, 14 million patients>700 GP practices, 14 million patients Patient and event level dataPatient and event level data Demographics – year birth, sex, ethnicityDemographics – year birth, sex, ethnicity Diagnoses, Lab results, clinical valuesDiagnoses, Lab results, clinical values Medication, referralsMedication, referrals No free text. No strong identifiersNo free text. No strong identifiers All research peer reviewed & published.All research peer reviewed & published.

QResearch uploads informed consent from practiceinformed consent from practice Practice displays notice in waiting roomPractice displays notice in waiting room Practice activates upload softwarePractice activates upload software Data pseudonymised BEFORE data leaves practiceData pseudonymised BEFORE data leaves practice Patients can be opted out of uploadPatients can be opted out of upload Secure upload to server at EMIS with full NHS security clearanceSecure upload to server at EMIS with full NHS security clearance Backups delivered to UniversityBackups delivered to University

QResearch - security Full database stored on off line serverFull database stored on off line server Full encryption of hard driveFull encryption of hard drive Key padded server room with limited accessKey padded server room with limited access 24 hour CCTV with monitoring24 hour CCTV with monitoring Confidentiality clauses in staff contractsConfidentiality clauses in staff contracts Full log of all data accessesFull log of all data accesses Log of all uses of dataLog of all uses of data No losses data or breaches in 10 yearsNo losses data or breaches in 10 years

QResearch policy Whilst all data are pseudonymised, we have same safeguard as it identifiableWhilst all data are pseudonymised, we have same safeguard as it identifiable To minimise any risks of re-identification patients (and practices)To minimise any risks of re-identification patients (and practices) To maintain public and professional trustTo maintain public and professional trust Explicit policy to ensure all results of research studies are widely and freely available for public benefit.Explicit policy to ensure all results of research studies are widely and freely available for public benefit.

Researcher access University based academicsUniversity based academics One must be GMC registeredOne must be GMC registered Standard application formStandard application form Clarify research question and methodsClarify research question and methods Independent Scientific reviewIndependent Scientific review Provided with sample size and data items needed to answer questionProvided with sample size and data items needed to answer question Data only used for agreed purposeData only used for agreed purpose Data destroyed after project completedData destroyed after project completed

Why is it important to ensure robust scientific methods Published research must give valid results which don’t mislead or misinform doctors, patients, policy makersPublished research must give valid results which don’t mislead or misinform doctors, patients, policy makers Equally need to avoid unpublished research – eg a good study with important resultsEqually need to avoid unpublished research – eg a good study with important results Avoid duplication effortAvoid duplication effort Avoid publication biasAvoid publication bias Avoid suppression of unpopular results (eg side effects medicines)Avoid suppression of unpopular results (eg side effects medicines)

Ensuring scientific quality 1.Is there a clear research question? 2.Can the data answer the question? 3.Are the methods scientifically valid? 4.Are the results likely to be generalisable? 5.Does team have skills to do the project 6.Is the researcher free to publish? Some databases with generic REC agreement will organise independent scientific review to answer the above.Some databases with generic REC agreement will organise independent scientific review to answer the above.

Risk to confidentialty Each study needs risk assessment even if pseudonymisedEach study needs risk assessment even if pseudonymised Could the study lead to identification of the patients because ofCould the study lead to identification of the patients because of - other data that the researcher might have - other data that the researcher might have - small numbers/rare events - small numbers/rare events Minimise risk by de-identification dataMinimise risk by de-identification data Data sharing agreement & sanctions for misconduct.Data sharing agreement & sanctions for misconduct.

QResearch data linkage study Linked to deprivation in 2002Linked to deprivation in 2002 Linked to ONS cause death in 2007Linked to ONS cause death in 2007 Currently being linked to HES and cancer registryCurrently being linked to HES and cancer registry Testing out new method of data linkage using pseudonymised data linkageTesting out new method of data linkage using pseudonymised data linkage Exceptionally high levels of valid, complete NHS numbers for ONS data, HES, GP dataExceptionally high levels of valid, complete NHS numbers for ONS data, HES, GP data

Open pseudonymiser project Need approach which doesn’t extract identifiable data but still allows linkageNeed approach which doesn’t extract identifiable data but still allows linkage Legal ethical and NIGB approvalsLegal ethical and NIGB approvals Secure, ScalableSecure, Scalable Reliable, AffordableReliable, Affordable Generates ID which are Unique to projectGenerates ID which are Unique to project Can be used by any set of organisations wishing to share dataCan be used by any set of organisations wishing to share data Pseudoymisation applied as close as possible to identifiable data ie within clinical systemsPseudoymisation applied as close as possible to identifiable data ie within clinical systems

Pseudonymisation: method Scrambles NHS number BEFORE extraction from clinical systemScrambles NHS number BEFORE extraction from clinical system Takes NHS number + project specific encrypted ‘salt code’ One way hashing algorithm (SHA2-256) – no collisions and US standard from 2010 Applied twice - before leaving clinical system & on receipt by next organisation Apply identical software to second datasetApply identical software to second dataset Allows two pseudonymised datasets to be linkedAllows two pseudonymised datasets to be linked Cant be reversed engineeredCant be reversed engineered

Web tool to create encrypted salt: proof of concept Web site private key used to encrypt user defined project specific saltWeb site private key used to encrypt user defined project specific salt Encrypted salt distributed to relevant data supplier with identifiable dataEncrypted salt distributed to relevant data supplier with identifiable data Public key in supplier’s software to decrypt salt at run time and concatenate to NHS number (or equivalent)Public key in supplier’s software to decrypt salt at run time and concatenate to NHS number (or equivalent) Hash then appliedHash then applied Resulting ID then unique to patient within projectResulting ID then unique to patient within project

Openpseudonymiser.org WebsiteWebsite Desktop applicationDesktop application Software for integrationSoftware for integration Test dataTest data DocumentationDocumentation Utility to generate encrypted salt codesUtility to generate encrypted salt codes Source code GNU GPLSource code GNU GPL

Progress so far Pseudonymised entiredPseudonymised entired HES database since 1997HES database since 1997 Cause of death data since 1993Cause of death data since 1993 Cancer registrations since 1990Cancer registrations since 1990 Linked all three datasets based only on pseudo NHS number - >99% completeLinked all three datasets based only on pseudo NHS number - >99% complete Due to linked GP data Spring 2012Due to linked GP data Spring 2012 Implementing into major GP computer systemsImplementing into major GP computer systems

Key points Pseudonymisation at sourcePseudonymisation at source Instead of extracting identifiers and storing lookup tables/keys centrally, then technology to generate key is stored within the clinical systemsInstead of extracting identifiers and storing lookup tables/keys centrally, then technology to generate key is stored within the clinical systems Use of project specific encrypted salted hash ensures secure sets of ID unique to projectUse of project specific encrypted salted hash ensures secure sets of ID unique to project Full control of data controllerFull control of data controller Can work in addition to existing approachesCan work in addition to existing approaches Open source technology so transparent & freeOpen source technology so transparent & free

Definition of clinical care team Important as determines whether s251 requiredImportant as determines whether s251 required Tendency by research community to adopt v broad definition to justify accessTendency by research community to adopt v broad definition to justify access Definition is tricky as a guideDefinition is tricky as a guide Individual has a duty of care to patientIndividual has a duty of care to patient Has duty of confidenceHas duty of confidence Would be recognised in that role by a reasonable patientWould be recognised in that role by a reasonable patient

Implications of Open Data VERY difficult to see how patient level data can be suitably de-identified so that it can published on line to meet Cameron’s promisesVERY difficult to see how patient level data can be suitably de-identified so that it can published on line to meet Cameron’s promises Current work on de-identification standard by IC/DH to help custodians decided when data can be published.Current work on de-identification standard by IC/DH to help custodians decided when data can be published.