What is necessary (and unnecessary) for analyses of offender databases Forensic Bioinformatics (www.bioforensics.com) Jason R.

Slides:



Advertisements
Similar presentations
DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
Advertisements

Finding The Unknown Number In A Number Sentence! NCSCOS 3 rd grade 5.04 By: Stephanie Irizarry Click arrow to go to next question.
Advanced SQL Topics Edward Wu.
Familial searches and cold hit statistics Forensic Bioinformatics ( Dan Krane Wright State University, Dayton, OH
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Database Searches Non-random samples of N individuals Typically individuals convicted of some crime Maryland, people arrested but not convicted.
Database Searches Evidence profiles are compared to a list of up to 3,528,903 profiles (FBI CODIS) References Balding, Weight-of- evidence for forensic.
Cold Hit Statistics NRC I How does it work? Searches are done with a subset of available loci. A second set of different genetic markers are used to confirm.
Inferring the Number of Contributors to Mixed DNA Profiles David Paoletti.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 38.
Factors, Primes & Composite Numbers
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING Think Distributive property backwards Work down, Show all steps ax + ay = a(x + y)
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 10 second questions Numbers and number system Numbers and the number system, fractions, decimals, proportion & probability.
ELSA English Longitudinal Study of Ageing The English Longitudinal Study of Ageing Rebecca Taylor National Centre for Social.
School Shop. Welcome to my shop. You have 10p How much change will you get? 7p 3p change.
SQL: The Query Language Part 2
The basics for simulations
Information Systems Today: Managing in the Digital World
(This presentation may be used for instructional purposes)
ABC Technology Project
Chapter 12 Joining Tables Part C. SQL Copyright 2005 Radian Publishing Co.
Chapter 3 Physical Evidence Part II Naturally shed hairs, such as a head hair dislodged through combing, display undamaged, club-shaped roots Forcibly.
O X Click on Number next to person for a question.
15. Oktober Oktober Oktober 2012.
Measurements and Their Uncertainty 3.1
Squares and Square Root WALK. Solve each problem REVIEW:
Created by Susan Neal $100 Fractions Addition Fractions Subtraction Fractions Multiplication Fractions Division General $200 $300 $400 $500 $100 $200.
1..
© 2012 National Heart Foundation of Australia. Slide 2.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Chapter 5 Test Review Sections 5-1 through 5-4.
SIMOCODE-DP Software.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
25 seconds left…...
Equal or Not. Equal or Not
Slippery Slope
Test B, 100 Subtraction Facts
11 = This is the fact family. You say: 8+3=11 and 3+8=11
Week 1.
We will resume in: 25 Minutes.
O X Click on Number next to person for a question.
PSSA Preparation.
DNA Forensics MUPGRET Workshop. “DNA evidence…offers prosecutors important new tools for the identification and apprehension of some of the most violent.
The Basics of CODIS Combined DNA Index System
Statistical weights of mixed DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic DNA.
Forensic Databases. What is a database? A collection of data that share common attributes, and can be used to match or compare to a submitted item Databases.
User Verification - DNA Profiling
Implications of database searches for DNA profiling statistics Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton,
Chapter 22: The DNA Databases.  Developed to assist in solving violent crimes by creating a network for the exchange of information  Compare DNA profiles.
The Time Has Come to Analyze DNA Profile Databases Forensic Bioinformatics ( Dan E. Krane, Ph.D., Wright State University, Dayton,
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
How is DNA be used to solve crimes? 8 th Grade Forensic Science T. Trimpe
What can go wrong with DNA profiling Dan E. Krane, Wright State University, Dayton, OH Forensic DNA Profiling Video Series Forensic Bioinformatics (
Presentation transcript:

What is necessary (and unnecessary) for analyses of offender databases Forensic Bioinformatics ( Jason R. Gilder August 16, 2008

Offender databases Originally designed for convicted offenders –CODIS: Convicted Offender DNA Index System Expanded –Unsolved crime samples –Arrestees –Elimination profiles

CODIS COmbined DNA Index System –National: NDIS –State: SDIS - fewer restrictions –Local: LDIS - fewest restrictions Convicted Offender Profiles in NDIS: 6,031,000 Forensic Profiles in NDIS: 225,400 More than 71,800 cold hits

Why analyze a database? Questions remain regarding the weight of a DNA database match –Random Match Probability (RMP) –Database Match Probability (DMP) –Balding & Donnelly LR –Other Composition of database may affect chance of a coincidental match –Presence of relatives

Structure of a DNA database Collection of records Structured Query Language (SQL) format ID#FnameLnamePopSSNDateD3vWAFGA…D7 AC937JohnDoeCAU /2/0213, 1516, 1621, 2311, 14 BQ384JaneDoeHIS /23/0312, 1715, 1925, 2510, 10 BZ927FrankSmithAA /9/0613, 1514, 1524, 2612, 16

Examples of possible issues with the use of DNA databases Michigan v. Gary Leiterman –Evidence: blood found on victims hand –Cold hit to a 4-year-old boy R v. Sean Hoey –Evidence: explosive device –Cold hit to a 14-year-old boy Jaidyn Leskie inquest (Australia) –Evidence: clothing from deceased –Cold hit to a rape victim

Lab error and false cold hits

How a database can be analyzed Perform all pairwise profile comparisons –the Arizona Search P 1 with P 2, P 1 with P 3, P 1 with P 4, …, P 1 with P n P 2 with P 3, P 2 with P 4, P 2 with P 5, …, P 2 with P n Analyze profile similarity –Count number of matching loci and alleles –Perform kinship analyses

Arizona Match Data 65,493 Profiles –122 pairs matched at 9 of 13 loci –20 pairs matched at 10 of 13 –1 pair matched at 11 of 13 –1 pair matched at 12 of 13 LociAveStd Devp-value E E E-04

Review of Victoria State Database Krane/Paoletti analysis: >11,000 profiles each compared to all others across 9 loci: Shared allelesObserved occurrences Aussie Bump

# Matching Alleles # Observed

Issues with the release or analysis of a DNA database Privacy concerns –Names, social security numbers, DNA profiles, addresses, etc. Issues with analysis –Duplicate profiles, multiple databases, presence of relatives, processing time, CODIS requirements Legal issues –California Proposition 69

Issue 1: Privacy concerns Database contains private information that should not be released Answer: provide anonymous profiles only Accomplished through one command SELECT D3, vWA, FGA, …, D7 FROM CODIS_DB

Issue 2: Duplicate profiles Many databases contain at least 10-15% duplicate profiles Answer: ignore duplicates in analysis A fairly thorough database analysis can take place with duplicates removed –Also identify potential mistyping rate The lab may be able to cull out duplicates from the same individual with additional information (e.g. SSN)

Issue 2b: Multiple databases California DOJ contains information in two databases that can be cross referenced to remove duplicates –Login DB – contains unique CII ID and accession numbers of all samples for that individual –SDIS – contains accession number and profile Answer: JOIN the data with one command –Only select the first accession number profile SELECT D3, vWA, FGA, … D7 FROM SDIS JOIN LOGIN_DB WHERE (LOGIN_DB.ACCESSION1 = SDIS.ACCESSION)

Issue 3: Presence of relatives It is difficult to identify the presence of relatives by hand by simply looking at the CODIS records There are a significant, but unknown number, of such related individuals in Californias offender database. – Kenneth Konzack Answer: Exactly!

Issue 4: Processing time Performing an internal search of the database will take too long (a week or more) and will not allow for CODIS searches during that time Answer: perform an analysis on a separate computer or computers Pairwise database search is embarrassingly parallel

Issue 5: Legal issues Legal statutes (e.g., California Proposition 69) prohibit release of database to citizens Answer: 38 state statutes (including CA) allow for an outside review of their database for statistical analysis –Many require the removal of identifying information

Questions?