Amar K. Das, MD, PhD Associate Professor of Biomedical Data Science, Psychiatry and Health Policy & Clinical Practice Geisel School of Medicine at Dartmouth.

Slides:



Advertisements
Similar presentations
PROFESSIONAL NURSING PRACTICE
Advertisements

Sentient Medical Systems Health Tech Net Meeting December 14,
PROVIDER CONSOLIDATION FTC-DOJ Public Workshop on U.S. Health Care Competition Washington, D.C. February 25, 2015 Kenneth W. Kizer, MD, MPH Distinguished.
Actionable Barriers and Gaps: Nursing Initiatives Bonnie L. Westra, PhD, RN, FAAN Associate Professor, University of Minnesota School of Nursing Director,
Computerization of the practice Grzegorz Margas, M.D., Ph.D. Department of Family Medicine Jagiellonian University Medical College.
Overview Clinical Documentation & Revenue Management: Capturing the Services Prepared and Presented by Linda Hagen and Mae Regalado.
Draft manuscript: “Implementing Point-of-Care Newborn Screening” From the SACHDNC Follow-up & Treatment Sub-committee 1/27/2012 Nancy S. Green, MD Associate.
© 2014 | AMERICAN COLLEGE OF RADIOLOGY | IMAGING 3.0 TM | ALL RIGHTS RESERVED. Reasons Radiologists Should Embrace Point Of Care Clinical Decision Support.
Division of Biomedical Informatics Beyond Interoperability: What Ontology Can Do for the EHR William R. Hogan, MD, MS July 30 th, 2011 International Conference.
John Wieler Management Information Systems In a Healthcare Setting.
Connecting the Dots Creating a learning health system linking clinical quality improvement, Maintenance of Certification, and research Maureen Smith, MD,
University of Pittsburgh Department of Biomedical Informatics Healthcare institutions have established local clinical data research repositories to enable.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Chapter 5. Describe the purpose, use, key attributes, and functions of major types of clinical information systems used in health care. Define the key.
What is “Biomedical Informatics”?. Biomedical Informatics Biomedical informatics (BMI) is the interdisciplinary field that studies and pursues.
NCCN and NCCN Clinical Practice Guidelines in Oncology™
Theresa Tsosie-Robledo MS RN-BC February 15, 2012
Chapter 2 Electronic Health Records
Component 1: Introduction to Health Care and Public Health in the U.S. 1.5: Unit 5: Financing Health Care (Part 2) 1.5d: Controlling Medical Expenses.
Inter-institutional Data Sharing, Standards and Legal Arthur Davidson, MD, MSPH Agency for Healthcare Research and Quality, Washington, DC June 9, 2005.
Medical Informatics Basics
IDR Snapshot: Quantitative Assessment Methodology Evaluating Size and Comprehensiveness of an Integrated Data Repository Vojtech Huser, MD, PhD a James.
The influence of Breast Cancer Pay for Performance Initiatives on breast cancer survival and performance measures: a pilot study in Taiwan Raymond NC Kuo,
Technology Council of Maryland Health IT Forum “Big Data” and the Real World.
by Joint Commission International (JCI)
Taming EHR Data Using Semantic Similarity to Reduce Dimensionality
NAPHSIS Annual Meeting 2009 Electronic Health Records: Why are they important? Linette T Scott, MD, MPH Deputy Director Health Information and Strategic.
Bioinformatics Education: Training a New Generation of Professionals Patricia Dombrowski May 2005.
Registries: Clinical Perspective J. Marc Overhage, MD, PhD, FACP, FACMI Chief Medical Informatics Officer Siemens Health Services.
ACOVE 4: Continuity and Coordination of Care in Vulnerable Elders Continuity is ‘‘care over time by a single individual or team of healthcare professionals’’
Confidentiality and Security Issues in ART & MTCT Clinical Monitoring Systems Meade Morgan and Xen Santas Informatics Team Surveillance and Infrastructure.
Computers in Healthcare Jinbo Bi Department of Computer Science and Engineering Connecticut Institute for Clinical and Translational Research University.
Interprofessional Education M. David Stockton, MD, MPH Professor Department of Family Medicine UT Graduate School of Medicine Sept. 4, 2013.
July 31, 2009Prepared by the Maine Health Information Center Overview of All Payer Claims Data Suanne Singer, Senior Consultant Maine Health Information.
CAS Seminar on Health and Managed Care Benchmarking Measures and Quality Control October 18-19, 1999 page CAS Seminar on Health and Managed Care.
“Doing Well to Do Well” – Financial Health Vital Signs MGMA Data Experience 2013.
1 How Informatics Can Drive Your Research Barry Smith
Leveraging Big Data and Electronic Health Records to Create a Data Fluent Culture for Cancer Medicine Mia Levy, MD, PhD October 8, 2015 Komen Big Data.
Chapter 7: Indexes, Registers, and Health Data Collection
Phenotype generation from EMR by tensor factorization SEDI Durham Cohort James Lu M.D. Ph.D. Department of Electrical and Computer Engineering Department.
Component 6 - Health Management Information Systems Unit 9-2 Administrative, Billing, and Financial Systems This material was developed by Duke University,
Blindfolded Record Linkage Presented by Gautam Sanka Susan C. Weber, Henry Lowe, Amar Das, Todd Ferris.
Integrated Population Health Bill Young Product Strategist/Principal Thomas Hohner Senior Report Developer.
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
N VISUAL ANALYTICS FOR HEALTHCARE: BIG DATA, BIG DECISIONS David Gotz Healthcare Analytics Research Group IBM T.J. Watson Research Center.
The Cancer Registry of Norway Jan F Nygård Head of the IT-department.
From Qualitative to Quality Impact Heather Bryant, MD, PhD Health System Use Summit February, 2016.
Linking Electronic Health Records Across Institutions to Understand Why Women Seek Care at Multiple Sites for Breast Cancer Caroline A. Thompson, PhD,
NURSING INFORMATICS IN U.S.A AND CANADA By: Ms. Gladys D. Papa, RN.
Clinical Department of Psychiatry University of Michigan Medical School Ann Arbor, June 28, 2002 Why Medicine Should be an Information Science Bruce R.
Henry M. Sondheimer, MD Association of American Medical Colleges 7 August 2013 A Common Taxonomy of Competency Domains for the Health Professions and Competencies.
TEACHING BIOMEDICAL INFORMATICS LIBRARY AS PARTNER Evelyn B. Morgen, Director UConn Health Center Library
© 2016 Chapter 6 Data Management Health Information Management Technology: An Applied Approach.
Definition and Use of Clinical Pathways and Case Definition Templates
IT Solutions – Improving Timely Access to Health Care
Presented by Cynthia Wilson BSN, RN-BC
Definition and Use of Clinical Pathways and Case Definition Templates
S117: Acute Setting Predictive Analytics Sharon E. Davis, MS
Community Oriented Approach to Population Health
Enhancement of study programs in Public Health Law, Health
The Learning Healthcare System: Create, Implement and Maintain
What is “Biomedical Informatics”?
Medical Insurance Coding
Component 1: Introduction to Health Care and Public Health in the U.S.
April 18th 2018 Moderator: Matthew Rioth
Megan Eguchi, MPh Sana karam, md, phd
COMPUTERIZED PHYSICIAN ORDER ENTRY (CPOE)
Digital Biomarkers – Patient data mining & precision medicine Stefan Schulz, Medical University of Graz Donausymposium Vienna, March 14, 2018.
Registered Nurse’s Use of HIT, 2006: Findings from a National Survey
REACHnet: Research Action for Health Network
Presentation transcript:

Amar K. Das, MD, PhD Associate Professor of Biomedical Data Science, Psychiatry and Health Policy & Clinical Practice Geisel School of Medicine at Dartmouth Mining Big Healthcare Data: Tales from an Informatics Odyssey

Disclosure No relationship of any of the authors or their life partners with commercial interests 1

Sources of Big Healthcare Data

An Era of Big Healthcare Data Traditional Vs of Big Data –Volume –Variety –Velocity –Veracity Other Vs relevant to healthcare –Value –Viscosity –Visualization –Variability

Handling Big Healthcare Data Data quality and complexity matters most. Data is structured in a way that limits direct clinical interpretation Data sources have a degree of error and are missing critical information Data exploration is the first step in understanding the hidden complexity

Oncoshare Project iProject initiated with the support of the Richard and Susan Levy Gift Fund A shared informatics resource that collects, integrates and links clinical data from multiple institutions Data structure reflects patterns of breast cancer care and measures factors driving treatment decisions

Overlapping Patient Populations Stanford Hospital and Clinics Palo Alto Medical Foundation 1 mile

Oncoshare Resource Longitudinal data on 18,000-plus patients who have received breast cancer treatment at either setting since 2000 Includes over 400 data elements such as demographics, pathology, labs, imaging tests, procedures and medications Contains over 200,000 full-text clinical, procedure and imaging notes

Data Quality Source: StraightStatistics

All Data Sources Stanford Cancer Registry Stanford Cancer Registry PAMF Cancer Registry PAMF Cancer Registry CPIC Registry CPIC Registry PAMF EHR PAMF EHR Stanford EHR Stanford EHR 10,593 7,996 4,290 2,847 5,996

Defining the Analytic Cohort Registry source –Systematically captures incident cases –Gathers limited data on treatment EHR source –Provides coded billing data and clinic notes –Can indicate visits for consultations Need uniform criteria for cohort inclusion

2% 4% Count Cohort Definition

Data Integration Model Source: AMIA (2012)

Data Sharing Infrastructure Weber et al, manuscript submitted Source: AMIA (2012)

Rates of Treatments before Linking Treatment Stanford (n = 8210) PAMF (n = 5770) Mastectomy43% 38% Billing 22% 17% Registry 41% 36% Chemotherapy42% 35% Billing 10% 19% Registry 39% 30% Radiotherapy52% 46% Billing 25% Registry 47% 41% Source: Cancer (2014)

Rates of Treatments after Linking Treatment Stanford Only (n = 6321) PAMF Only (n = 3886) Both (n = 1902) Mastectomy40% 31% 56% Billing 18% 13% 48% Registry 38% 29% 52% Chemotherapy42% 30% 47% Billing 10% 17% 31% Registry 39% 24% 41% Radiotherapy53% 45% 54% Billing 26% 42% Registry 47% 40% 46% Source: Cancer (2014)

Rate of Diagnostic MRI after Linking Source: Cancer (2014)

21-Gene Recurrence Score NCCN guideline (2011)

Big Data Analysis with Sequence Alignment Wikimedia

Transactional Data as Sequences Sequence of events across time Many sources of such sequence data Time ABDE ED saves-costs--Stock-Photo.jpg C C

Transactional Data as Long Data Long Data: a specific type of Big Data that has an essential temporal component, including the temporal distance between transactions Application need: Find known templates (such as treatment patterns) in long data Research approach: Extend sequence alignment to measure temporal similarity between templates and long data

Convert Long Data into Sequences t CD Time ABCD FE t AB t BC t DE t EF... 0.A t AB.B t BC.C t CD.D t DE.E t EF.F Sequences Raw Long Data Encoded temporal distance

Convert Regimens into Sequences AC P P 14 7 P … AC P P 14 P … FEC H H … … P P … ZZ … Regimen 1 Regimen 2 Regimen AC 14.AC 14.AC 14.AC 14.P 7.P 7.P Sequence for Regimen 1 Encoded temporal distance 14

Using Sequence Alignment on Long Data Sequence alignment approach –Widely used approach in Bioinformatics –Aligns sequences for maximal overlap Needleman-Wunsch algorithm –Global alignment approach –Guarantees an optimal alignment for a given scoring scheme and gap penalty –Does not account for temporal distance between sequence elements

A B C D A D Aligned Sequences: _ _ 1 + -g + -g + 1 = 2 - 2g Needleman-Wunsch Sequence 1 Sequence 2

ABCD A0 D0 M[i-1, j-1] + S[A[i], B[j]] M[i-1, j] – gap_penalty M[i, j-1] – gap_penalty M[i, j] = max Value from Scoring matrix C -C - B-B- AAAA DDDD Optimal Alignment A B C D A D Align: Needleman-Wunsch

1 Aligned Sequences: _ _ 1 + -g + -g + 1 – f(t 4,t 4 ) A B C D A D 1-f(t 1 +t 2 +t 3, t 4 ) -g Temporal Needleman-Wunsch Sequence 1 Sequence 2

Results and Comparison of Methods # correctly identified regimen (top match) # correctly identified regimen (top 2 matches) # correctly identified regimen (top match) # correctly identified regimen (top 2 matches) 83 (91%)89 (98%)107 (93%)113 (98%) *Results for 91 patients (24 patients could not be resolved because they matched more than one encoded regimen) Needleman-Wunsch* Temporal Needleman-Wunsch Source: DSAA (2015) Study: Match 115 patients who were manually annotated to a treatment regimen to 44 regimen templates using sequence alignment

Big Data Analysis with Network Science Wikimedia

Understanding Patterns of Care How are physicians linked across sites and specialty in providing care? Solution: Create a ‘social network’ of physicians linked by patients they have co-treated

146 physicians 331 links Provider Network of Care Source: AMIA (2011)

Provider Network of Care

Learning Health System

Lessons from an Informatics Odyssey Understand the sources of data and their limitations in structure, scope, and quality Get more data (more variety of data) if possible Create new methods to explore hidden patterns in long data