Establishing parameters for objective interpretation of DNA profile evidence Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Steelman Visiting Scientist Lecture Series, Lenoir-Rhyne University, April 9, 2010
The science of DNA profiling is sound. But, not all of DNA profiling is science.
The science of DNA profiling is sound. There is plenty of opportunity for improvement.
The science of DNA profiling is sound. There is plenty of opportunity for improvement. I. Interpretation II. Statistical weighting
Doesnt someone either match or not?
Opportunities for subjective interpretation? Who can be excluded? SuspectD3vWAFGA Tom17, 1715, 1725, 25 Dick12, 1715, 1720, 25 Harry14, 1715, 1720, 25 Sally12, 1715, 1520, 22
Signal Measure μbμb μ b + 3σ b μ b + 10σ b Mean background Signal Detection limit Quantification limit Measured signal (In Volts/RFUS/etc) Saturation 0
Many opportunities to measure baseline
Measurement of baseline in control samples: Negative controls: 5,932 data collection points (DCPs) per run ( = 131 DCPs) Reagent blanks: 5,946 DCPs per run ( = 87 DCPs) Positive controls: 2,415 DCP per run ( = 198 DCPs)
RFU levels at all non-masked data collection points
Variation in baseline noise levels Average ( b ) and standard deviation ( b ) values with corresponding LODs and LOQs from positive, negative and reagent blank controls in 50 different runs. BatchExtract: ftp://ftp.ncbi.nlm.nih.gov/pub/forensics/
Lines in the sand: a two-person mix? Two reference samples in a 1:10 ratio (male:female). Three different thresholds are shown: 150 RFU (red); LOQ at 77 RFU (blue); and LOD at 29 RFU (green).
Mixed DNA samples
How many contributors to a mixture if analysts can discard a locus? How many contributors to a mixture? Maximum # of alleles observed in a 3-person mixture # of occurrencesPercent of cases ,967, ,037, ,532, There are 146,536,159 possible different 3-person mixtures of the 959 individuals in the FB I database (Paoletti et al., November 2005 JFS). 3,398 7,274, ,469,398 26,788,
How many contributors to a mixture if analysts can discard a locus? How many contributors to a mixture? Maximum # of alleles observed in a 3-person mixture # of occurrencesPercent of cases ,498, ,938, ,702, There are 45,139,896 possible different 3-person mixtures of the 648 individuals in the MN BCI database (genotyped at only 12 loci). 8,151 1,526,550 32,078,976 11,526,
How many contributors to a mixture? Maximum # of alleles observed in a 4-person mixture # of occurrencesPercent of cases 413, ,596, ,068, ,637, , There are 57,211,376 possible different 4-way mixtures of the 194 individuals in the FB I Caucasian database (Paoletti et al., November 2005 JFS). (35,022,142,001 4-person mixtures with 959 individuals.)
The science of DNA profiling is sound. There is plenty of opportunity for improvement. I. Interpretation II. Statistical weighting
What weight should be given to DNA evidence? Statistics do not lie. But, you have to pay close attention to the questions they are addressing.
What weight should be given to DNA evidence? Statistics do not lie. But, you have to pay close attention to the questions they are addressing. RMP: The chance that a randomly chosen, unrelated individual from a given population would have the same DNA profile observed in a sample.
What weight should be given to DNA evidence? Statistics do not lie. But, you have to pay close attention to the questions they are addressing. RMP: The chance that a randomly chosen, unrelated individual from a given population would have the same DNA profile observed in a sample.
Consider cold hits CODIS (Combined Offender DNA Index System) Maintained by the FBI Contains 7,940,321 profiles as of February, Assisted in 109,900 investigations
What weight should be given to DNA evidence? Probable Case –Suspect is first identified by non-DNA evidence –DNA evidence is used to corroborate traditional police investigation Cold Hit Case –Suspect is first identified by search of DNA database –Traditional police work is no longer focus Which is more damning evidence?
What weight should be given to DNA evidence? Probable Case –Suspect is first identified by non-DNA evidence –DNA evidence is used to corroborate traditional police investigation –RMP = 1 in 10 million Cold Hit Case –Suspect is first identified by search of DNA database –Traditional police work is no longer focus –RMP = 1 in 10 million Which is more damning evidence?
What weight should be given to DNA evidence? Probable Case –Suspect is first identified by non-DNA evidence –DNA evidence is used to corroborate traditional police investigation –RMP = 1 in 10 million Cold Hit Case –Suspect is first identified by search of DNA database –Traditional police work is no longer focus –RMP = 1 in 10 million –DMP = roughly 4 in 5 Which is more damning evidence?
What weight should be given to DNA evidence? Statistics do not lie. But, you have to pay close attention to the questions they are addressing. RMP: The chance that a randomly chosen, unrelated individual from a given population would have the same DNA profile observed in a sample.
What is the relevant population? 1 in 80 quadrillion
Popular vote in 2008 by county. McCain won red counties, Obama won blue counties. How would you determine the frequency of Obama supporters in North Carolina? Obama N.C. 50.2%
Popular vote in 2008 by county. McCain won red counties, Obama won blue counties. How would you determine the frequency of Obama supporters in North Carolina? Obama N.C. 50.2% Region 59.6%
Popular vote in 2008 by county. McCain won red counties, Obama won blue counties. How would you determine the frequency of Obama supporters in North Carolina? Obama N.C. 50.2% Region 59.6% U.S. 52.9%
Popular vote in 2008 by county. McCain won red counties, Obama won blue counties. How would you determine the frequency of Obama supporters in North Carolina? Obama N.C. 50.2% Region 59.6% U.S. 52.9% Utah? 35.5% ?
What is the relevant population? Errors are multiplicative
What weight should be given to DNA evidence? Statistics do not lie. But, you have to pay close attention to the questions they are addressing. RMP: The chance that a randomly chosen, unrelated individual from a given population would have the same DNA profile observed in a sample.
Combined probability of inclusion (CPI): a cousin of the RMP What fraction of a population cannot be excluded as a possible contributor to a given mixture?
Combined probability of inclusion (CPI): a cousin of the RMP What fraction of a population cannot be excluded as a possible contributor to a given mixture? What fraction of the population cannot be excluded from contributing their DNA to a given mixture if they are allowed to not match at two loci?
Matching profiles in NIST database PopulationSample IDD21 1D21 2D7 1D7 2CSF 1CSF 2D13 1D13 2D16 1D16 2D2 1D2 2D18 1D18 2FGA 1FGA 2Matching Loci African AmericanGT African AmericanOT African AmericanPT African AmericanPT African AmericanGT African AmericanMT African AmericanZT CaucasianGT CaucasianGT CaucasianMT CaucasianWT n = 257 African Americans, 302 Caucasians (1 in 21 African Americans; 1 in 38 Caucasians)
The science of DNA profiling is sound. There is plenty of opportunity for improvement. (particularly in the areas of interpretation and attaching statistical weights)