Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.

Slides:



Advertisements
Similar presentations
DNA Identification: Mixture Weight & Inference
Advertisements

Sherlock Holmes and the DNA Likelihood Ratio American Academy of Forensic Sciences February, 2011 Chicago, IL Mark W Perlin, PhD, MD, PhD Cybergenetics,
Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
DNA Identification as an Information Science
DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
New York State Police TrueAllele ® Casework Developmental Validation Cybergenetics © New York State DNA Subcommittee March, 2010.
Combining DNA Evidence for Greater Match Information
The Way Forward in the UK
Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
TrueAllele ® Challenges in Court and Culture Cybergenetics © th Annual DNA Technology Educational Seminar Centre of Forensic Sciences and the.
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
Preventing rape in the military through effective DNA computing Forensics Europe Expo Forensics Seminar Theatre April, 2014 London, UK Mark W Perlin, PhD,
Coding a Safer Society through Computer Interpretation of DNA Evidence Cybergenetics © MATLAB Virtual Conference March, 2014 Mark W Perlin, PhD,
TrueAllele ® Interpretation of DNA Mixture Evidence 9 th International Conference on Forensic Inference and Statistics August, 2014 Leiden University,
How TrueAllele® Works (Part 1)
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
No DNA Left Behind: When "inconclusive" really means "informative" Schenectady County District Attorney’s Office January, 2014 Mark W Perlin, PhD, MD,
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
TrueAllele ® Mixture Interpretation Cybergenetics © th Annual DNA Technology Educational Seminar Centre of Forensic Sciences and the Promega.
DNA Mixture Statistics Cybergenetics © Spring Institute Commonwealth's Attorney's Services Council Richmond, Virginia March, 2013 Mark W.
More informative DNA identification: Computer reinterpretation of existing data Ria David, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Scientific Validation of Mixture Interpretation Methods 17th International Symposium on Human Identification Sponsored by the Promega Corporation October,
DNA Identification Science: The Search for Truth Cybergenetics © Summer Research Symposium Duquesne University July, 2010 Mark W Perlin, PhD,
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
TrueAllele ® Modeling of DNA Mixture Genotypes California Association of Crime Laboratory Directors October, 2014 San Francisco, CA Mark W Perlin, PhD,
How TrueAllele ® Works (Part 2) Degraded DNA and Allele Dropout Cybergenetics Webinar November, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh,
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Cracking the DNA mixture code – computer analysis of UK crime cases Forensics Europe Expo Forensic Innovation Conference April, 2014 London, UK Mark W.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Rapid DNA Response: On the Wings of TrueAllele Mid-Atlantic Association of Forensic Scientists May, 2015 Cambridge, Maryland Martin Bowkley, Matthew Legler,
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
TrueAllele ® interpretation of Allegheny County DNA mixtures Cybergenetics © Continuing Legal Education Allegheny County Courthouse February,
Virginia TrueAllele ® Validation Study: Casework Comparison Presented at AAFS, February, 2013 Published in PLOS ONE, March, 2014 Mark W Perlin, PhD, MD,
TrueAllele ® Computing: All the DNA, all the time Continuing Professional Development Sydney, Australia March, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Murder in McKeesport October 25, 2008 Tamir Thomas.
When Good DNA Goes Bad International Conference on Forensic Research & Technology October, 2012 Chicago, Illinois Mark W Perlin, PhD, MD, PhD Cybergenetics,
DNA Mapping the Crime Scene: Do Computers Dream of Electric Peaks? 23rd International Symposium on Human Identification October, 2012 Nashville, TN Mark.
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © Duquesne University October, 2015 Pittsburgh, PA What’s in a Match?
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Jack Ballantyne and Mark Perlin International Conference on Inference and Statistics, July , Seattle, WA.
Exploring Forensic Scenarios with TrueAllele ® Mixture Automation 59th Annual Meeting American Academy of Forensic Sciences February, 2007 Mark W Perlin,
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
DNA Identification: Quantitative Data Modeling Cybergenetics © Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA TrueAllele ® Lectures.
DNA-led investigation through computer interpretation of evidence Pennsylvania State Police Training Seminar Hershey, PA April, 2014 Mark W Perlin, PhD,
Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 8,
DNA: TrueAllele® Statistical Analysis, Probabilistic Genotyping
A Match Likelihood Ratio for DNA Comparison
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Explaining the Likelihood Ratio in DNA Mixture Interpretation
On the threshold of injustice: manipulating DNA evidence
Virginia TrueAllele® Validation Study: Casework Comparison
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
Suffolk County TrueAllele® Validation
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Investigative DNA Databases that Preserve Identification Information
DNA Identification: Inclusion Genotype and LR
DNA Identification: Stochastic Effects
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
severed carotid artery
DNA Identification: Mixture Interpretation
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Using probabilistic genotyping to distinguish family members
Presentation transcript:

Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©

Coping with uncertainty Reproducible DNA data exhibit stochastic variation Probability models can capture stochastic effects Genotypes inferred from uncertain data are probability distributions (e.g., CPI) Science expects us to account for stochastic effects The law expects us to testify within our certainty The likelihood ratio (LR) meets both demands The LR expresses genotype uncertainty, reflecting the underlying data uncertainty

PCR is a random process p1 - p PCR efficiency is not 100% efficient. A strand copies with probability p, and doesn't copy with probability 1-p. COPY NO COPY

STR peak is a random variable p = 80%, n = 6 1,298 2,900 One amplification Another amplification

STR peak height measurement reflects probability distribution mean = 2,145 stdev = 284

Relative peak certainty: coefficient of variation stdev = 12 mean = 85 CV = 14% CV = standard deviation mean value

Four times the peak height, gives twice the peak certainty stdev = 25 mean = 350 CV = 7% CV = standard deviation mean value

STR peaks are random variables

To interpret quantitative data, some use a qualitative threshold Over threshold, peaks are treated as allele events. Under threshold, alleles do not exist. list of included alleles

Thresholds introduce error

False allele exclusions

2010 SWGDAM Guidelines When every peak is under threshold, all the data disappears. No genotype, no match score, nothing left to say. Raising the stochastic threshold.

Higher false exclusion rate

SWGDAM 2010 – Mixtures If a stochastic threshold based on peak height is not used in the evaluation of DNA typing results, the laboratory must establish alternative criteria (e.g., quantitation values or use of a probabilistic genotype approach) for addressing potential stochastic amplification. The criteria must be supported by empirical data and internal validation and must be documented in the standard operating procedures. higher peak threshold discards information probability modeling preserves information

Probability modeling Bayes Theorem: addresses scientific uncertainty uses likelihood function to update probability joint likelihood combines independent evidence likelihood: how well parameters explain the data STR data: must explain every peak (all rfu) likelihood gives probability at one peak: genotype allele prediction vs. peak height joint likelihood at all peaks: multiply together the individual peak likelihoods

Compare genotype pattern vs. data

Overcome stochastic effects Computers can solve for genotype probabilities and other random variables, like peak variation By modeling peak variation as just another parameter, computers can overcome DNA stochastic effects

TrueAllele ® Casework quantitative computer interpretation statistical search of probability model preserves all identification information objectively infer genotype, then match any number of mixture contributors stutter, imbalance, degraded DNA calculates uncertainty of every peak created in 1999, now in version 25 used on 100,000 evidence samples available as product, service or both

Commonwealth v. Foley ScoreMethod 13 thousandinclusion 23 millionobligate allele 189 billionTrueAllele peak threshold discards information probability modeling preserves information

Likelihood Comparison Quantitative TrueAllele likelihood focuses probability on true genotype Inclusion method likelihood disperses probability across incorrect allele pairs

Preserve vs. Discard MW Perlin, MM Legler, CE Spencer, JL Smith, WP Allan, JL Belrose, BW Duceman. Validating Trueallele DNA Mixture Interpretation. Journal of Forensic Sciences, Quantitative TrueAllele Inclusion method

SWGDAM 2010 – Mixtures If composite profiles (i.e., generated by combining typing results obtained from multiple amplifications and/or injections) are used, the laboratory should establish guidelines for the generation of the composite result. When separate extracts from different locations on a given evidentiary item are combined prior to amplification, the resultant DNA profile is not considered a composite profile. Unless there is a reasonable expectation of sample(s) originating from a common source (e.g., duplicate vaginal swabs or a bone), allelic data from separate extractions from different locations on a given evidentiary item should not be combined into a composite profile. The laboratory should establish guidelines for determining the suitability of developing composite profiles from such samples. joint likelihood function combines evidence probability modeling preserves information

Regina v. Broughton low template mixture three DNA contributors triplicate amplification post-PCR enhancement vWA locus data inconclusive human result TrueAllele interpretation

Regina v. Broughton Genoype probabilility (vWA) 6x ScoreMethod nothinghuman inclusion 3.6 millionTrueAllele computer joint likelihood function combines evidence probability modeling preserves information

Preserve vs. Discard peak threshold discards information - 70% of the time quantitative likelihood preserves information - every time

Conclusions quantitative data has stochastic effects model data with joint likelihood function probability modeling preserves information and can statistically combine DNA evidence exact modeling of peak variation can replace inexact thresholds to scientifically overcome stochastic effects