Data Migration- Two Separate Databases Michael Odawa.

Slides:



Advertisements
Similar presentations
T2 PowerPark and SCT Banner
Advertisements

Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Database Management Systems and Enterprise Software
Database management system (DBMS)  a DBMS allows users and other software to store and retrieve data in a structured way  controls the organization,
GOOGLE SUMMER OF CODE Enhancing the OpenMRS Patient Matching Module Demo Mentored By Shaun Grannis James Egg Gauthami Pingili.
Wisconsin Department of Health Services Richard Miller Research Scientist Wisconsin Office of Health Informatics October 28, 2014 Matching Traffic Crash.
The Comprehensive Resource Protection Management System: Field Observation Recording Tool Developed by Neal D. Labrie Backcountry Supervior Acadia National.
Geodatabase basic. The geodatabase The geodatabase is a collection of geographic datasets of various types used in ArcGIS and managed in either a file.
Progress Report 11/1/01 Matt Bridges. Overview Data collection and analysis tool for web site traffic Lets website administrators know who is on their.
Unit 4: Monitoring Data Quality For HIV Case Surveillance Systems #6-0-1.
Introduction to databases Developed by Anna Feldman for the Association for Progressive Communications (APC)
Air Quality Data Analysis Using Open Source Tools
Database Structure Basics National Center for Immunization & Respiratory Diseases Influenza Division Pam Kennedy Analyst, McKing Consulting Regional Training.
Integrate your people maximize your knowledge Tel SalesBase Customer.
Office of Research and Development National Exposure Research Laboratory, Atmospheric Modeling Division, Applied Modeling Research Branch October 8, 2008.
INTERGRATING TB/HIV DATABASES INTERGRATING TB/HIV DATABASES Presenter: DR. LAMECK DIERO.
Francis Muma, BSc.N, MPH. Fellow, University of Nairobi Institute of Tropical and Infectious Diseases (UNITID). HIV Programme Management and Policy Track.
HIV Patient ART Monitoring Meeting: International Conference Centre, Geneva March 2004 Defining the variables.
How to process data from clinical trials and their open label extensions PhUSE, Berlin, October 2010 Thomas Grupe and Stephanie Bartsch, Clinical Data.
Unit 1: Overview of HIV/AIDS Case Reporting #6-0-1.
GIS Concepts ‣ What is a table? What is a table? ‣ Queries on tables Queries on tables ‣ Joining and relating tables Joining and relating tables ‣ Summary.
Attribute Data in GIS Data in GIS are stored as features AND tabular info Tabular information can be associated with features OR Tabular data may NOT be.
Canada’s NDS Experience Office of Controlled Substances An Overview May 8, 2007.
Assessing & Improving Quality of Care Newly Developed Quality of Care Tools President’s Emergency Plan for AIDS Relief Track 1.0 ART Program Meeting Seema.
McGraw-Hill/Irwin Copyright © 2005 by The McGraw-Hill Companies, Inc. All rights reserved. ENTERPRISE INFORMATION SYSTEMS A PATTERN BASED APPROACH Chapter.
Microsoft ® Office Access ™ 2007 Training Choose between Access and Excel ICT Staff Development presents:
TURKISH STATISTICAL INSTITUTE Social Sector Statistics Department Tourism Statistics Group
Advisory Board: Judith E. Tintinalli, MD, MS, FACEP Stephen H. Thomas, MD, MPH David M. Cline, MD, FACEP Mary.
M1G Introduction to Database Development 6. Building Applications.
PopMedNet in Mini-Sentinel Tiffany Siu Woodworth PopMedNet User Group Conference July 27, 2015.
Classifier Evaluation Vasileios Hatzivassiloglou University of Texas at Dallas.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Medical Eligibility: Forms & Notices of Action (NOAs) 1.
GOLAN SALMAN RAZI MUKATREN PRIVACY IN A DEMOGRAPHIC DATABASE PROJECT PLAN.
Use of the JAC electronic prescribing system to facilitate audit data collection Dr. Richard Hughes, Anaesthetic Research Fellow Dr. Nicola Edwards, Anaesthetic.
1 McGraw-Hill Professional Learn More. Do More. 1 Advisory Board: Judith E. Tintinalli, MD, MS, FACEP Stephen H. Thomas, MD, MPH David M. Cline, MD, FACEP.
Microsoft Access Designing and creating tables and populating data.
DECREASING LOST TO FOLLOW-UP : AN INFORMATICS ASSISTED APPROACH (AND HOW TO BUILD A REPORTING APP) Daniel Ochieng Jonathan Dick.
Course 6: IHS MPI Overview and Tool Usage February 2, 2011.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Disciplined Software Engineering Lecture #2 Software Engineering Institute Carnegie Mellon University Pittsburgh, PA Sponsored by the U.S. Department.
Copyright © 1994 Carnegie Mellon University Disciplined Software Engineering - Lecture 1 1 Disciplined Software Engineering Lecture #2 Software Engineering.
26 Mar 04 1 Application Software Practical 5/6 MS Access.
Infection Adjudication on REDCAP Module 1 Version: Feb 1st, 2012.
Database Concepts Track 3: Managing Information using Database.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
1 MS Access. 2 Database – collection of related data Relational Database Management System (RDBMS) – software that uses related data stored in different.
Module 9 Planning and Implementing Monitoring and Maintenance.
CS5604: Final Presentation ProjOpenDSA: Log Support Victoria Suwardiman Anand Swaminathan Shiyi Wei Department of Computer Science, Virginia Tech December.
Unit 7: Data Management, Analysis and Interpretation #3-7-1.
Following the need to reduce the TAT and taking testing closer to patients, some lab tests have been devolved to the facility level. Below is a list of.
Database Overview What is a database? What types of databases are there? How are databases more powerful than spreadsheets?
Session 6: Data Flow, Data Management, and Data Quality.
Introduction to FFI: Why and how FFI was developed Introduction to FFI: Why and how FFI was developed 04/02/2013.
Leveraging Open-Source Matching Tools and Health Information Exchange to Improve Newborn Screening Follow-up Shaun Grannis, MD MS Medical Informatics Research.
Microsoft Access 2013 ®® Case Study Creating a Database.
A consolidated review of multiple analyses using JMP Clinical
IMPAACT 2010 Screening Visits
IMPAACT 2010 Screening Visits
Advantages of sas for reporting
View Integration and Implementation Compromises
CPAD Data Migration Demo SI Meeting, Kisumu 11 September, 2017.
Can Statistical monitoring really improve data integrity?
Unit 4: Monitoring Data Quality For HIV Case Surveillance Systems
SEER Case Consolidation Study: Design & Objective
JRA2 Pisa, Tuesday, 25 October 2005
Database Management  .
Databases A brief introduction….
Spreadsheets, Modelling & Databases
Storing and Processing Sensor Networks Data in Public Clouds
Presentation transcript:

Data Migration- Two Separate Databases Michael Odawa.

Overview Migration is the movement of data from one database to one master database. What happens if there are instances where there are some people present in both databases? Do you continue with the migration? How do you identify these individuals? Merging (in this context) refers to the process of identifying matching records in two separate databases and consolidating the information for these matching records into one record.

Importance of Merging Data. The data contained in both databases are different and they may belong to the same patient. By merging the data, it becomes easier to analyse the patient level data since they will be in one consolidated file. Clinically, it is advisable to have all patient records in one file. Fragmented data storage of patient records results in easier tracking of the patient’s medical history. This assists in programme management since the total number of patients who need medical assistance will be centralized and as such this reduces the possibility of duplicated efforts in making drug orders and other program interventions for these patients for those patients who exist in both databases. To monitor and evaluate the efficiency of linkage to care for patients who are identified as HIV positive by the HCT Research Assistants in the field into the HIV Care system offered by AMPATH.

Summary of Activities Involved. FLOWCHART SHOWING THE MIGRATION PROCESS Set up the server to contain the Port Victoria HCT data Run a dedup query in the HCT Port Victoria server and in the AMRS Merge the matching records in the HCT Port Victoria Rerun the dedup report to generate the Port Victoria Import table Run queries to obtain the AMRS and Port import tables Create a flat file for analysis and run RecMatch on this Conduct manual review on the true matches Identify the uuids for the true and false matches Migrate demographics for the false matches Merge the demographics for the true matches Classify the gray area patients into true matches and false matches Migrate the encounters for all the patients Migrate the observations for all the patients

RecMatch This is a process by which the demographics from the two databases are joined together and then processed to get potential true matches, true matches and false matches. The tool that does this disaggregation is a stand-alone Java patient matching file. org.regenstrief.linkage.gui.RecMatch.jar

How does it work? The process involves measuring the agreement rates of specific fields between true and false matches, then calculating the likelihood of a given pair being a true match, and assigns a score based on how well the fields match. Various algorithms can be used to do the matching, eg the Jaro-Winkler Estimation, Levenshtein Least Common Subsequence or the Exact Matching algorithm.

When not in Use

When in use

Summary of Softwares Used FLOWCHART SHOWING THE MIGRATION PROCESS Set up the server to contain the Port Victoria HCT data Run a dedup query in the HCT Port Victoria server and in the AMRS Merge the matching records in the HCT Port Victoria Rerun the dedup report to generate the Port Victoria Import table Run queries to obtain the AMRS and Port import tables Create a flat file for analysis and run RecMatch on this Conduct manual review on the true matches Identify the uuids for the true and false matches Migrate demographics for the false matches Merge the demographics for the true matches Classify the gray area patients into true matches and false matches Migrate the encounters for all the patients Migrate the observations for all the patients OpenMRS MySQL SAS, R Statistical Programming Language, STATA, Excel OpenMRS- Yank Module

Blocker to completion of exercise. FLOWCHART SHOWING THE MIGRATION PROCESS Set up the server to contain the Port Victoria HCT data Run a dedup query in the HCT Port Victoria server and in the AMRS Merge the matching records in the HCT Port Victoria Rerun the dedup report to generate the Port Victoria Import table Run queries to obtain the AMRS and Port import tables Create a flat file for analysis and run RecMatch on this Conduct manual review on the true matches Identify the uuids for the true and false matches Migrate demographics for the false matches Merge the demographics for the true matches Classify the gray area patients into true matches and false matches Migrate the encounters for all the patients Migrate the observations for all the patients Help Required……

Sites to be reviewed Port Victoria- HCT-AMRS (Done apart from obstacles mentioned earlier) Port Victoria – HCT-PHCT-AMRS (Ongoing- Currently doing the Manual Data Review- hopefully to be done in 2-3 Weeks) Chulaimbo- HCT-PHCT-AMRS (Next to be tackled) Teso- HCT-PHCT-AMRS All other HCT-PHCT Data stored in stand alone databases.

Going Forward This process needs to be completed for the baseline HCT Data collected from the various sites from 2009 – Going forward, this data should ideally be sent directly to the AMRS but this can only be done successfully if the baseline data is already in the AMRS (for reasons mentioned in the importance section).

Thanks!! Questions??