Representing and Solving Complex DNA Identification Cases Using Bayesian Networks Philip Dawid University College London Julia Mortera & Paola Vicard Università.

Slides:



Advertisements
Similar presentations
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Advertisements

Mendel’s Laws.
Single Gene Inheritance
. Exact Inference in Bayesian Networks Lecture 9.
Patterns of Inheritance What patterns can be observed when traits are passed to the next generation?
Mendel’s Breakthrough
Tutorial #1 by Ma’ayan Fishelson
1 Statistical genetics and genetical statistics Thore Egeland, Rikshospitalet and Section of Medical Statistics Joint work with P. Mostad, NR, B. Olaisen,
PROBABILITY AND STATISTICS IN THE LAW Philip Dawid University College London.
Tutorial #5 by Ma’ayan Fishelson. Input Format of Superlink There are 2 input files: –The locus file describes the loci being analyzed and parameters.
Variation, probability, and pedigree
Basics of Linkage Analysis
Transmission Genetics: Heritage from Mendel 2. Mendel’s Genetics Experimental tool: garden pea Outcome of genetic cross is independent of whether the.
 How does the graph represent a gel? Each group filled in a ‘band’ that represents where different – sized DNA fragments would have migrated on a gel,
1 How many genes? Mapping mouse traits, cont. Lecture 2B, Statistics 246 January 22, 2004.
DATA ANALYSIS Module Code: CA660 Lecture Block 2.
DNA Forensics MUPGRET Workshop. “DNA evidence…offers prosecutors important new tools for the identification and apprehension of some of the most violent.
Philip Dawid University of Cambridge TexPoint fonts used in EMF.
Tutorial #5 by Ma’ayan Fishelson Changes made by Anna Tzemach.
New Technologies Y-STR DNA Pedigree (7 generations)
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
explain how crime scene evidence is
Question 1___________________________ Question 2___________________________ Question 3 ___________________________ TotalAverage = 44 out of 50 points Important.
1 Father of genetics. Studied traits in pea plants.
 Dear teacher this was adapted from a presentation by:  Nancy Dow  Jill Hansen  Tammy Stundon  Who teach in Bay County  The missing words have been.
DNA evidence The DNA Double Helix Consists of so-called nucleobases always in pairs A-T, C-G. One part of the pair is inherited from the mother, the other.
DNA & Proteins B3a.
CATALYST Recall and Review: – What are chromosomes? – What are genes? – What are alleles? How do these terms relate to DNA? How do these terms relate to.
Mendelian Heredity (Fundamentals of Genetics) CH9 pg 173.
Today: Mendelian Genetics! Intro to Mitosis?. Gregor Mendel, The “Father” of Genetics?
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Watson & Crick Discovered the basic shape of DNA
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
Forensic Science: Fundamentals & Investigations, Chapter 7 1 Introduction and History of Biological Evidence in Forensics DNA fingerprinting or DNA profiling,
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Using Familias and FamLink Athens 29 May, Familias 10 w w.umb.no NORWEGIAN UNIVERSITY OF LIFE SCIENCES.
Mendel’s Theory Section 2. Explaining Mendel’s Results Mendelian theory of heredity explains simple patterns of inheritance. In these patterns, two of.
Genetics *transmission of traits – heredity *variation *genetics.
1 Variation, probability, and pedigree Gamete production is source of variation and genetic diversity, an advantage of sex. –As a result of segregation.
Chapter 23: Evaluation of the Strength of Forensic DNA Profiling Results.
DNA Fingerprinting: The DNA of every individual is different. Loci where the human genome differs from individual to individual are called polymorphisms.
1/22/16 Starter: What determines the traits of an organism? 1/22/ Heredity and Genetics Application Notes Glue here when done Connection Ws.
Genetic Engineering and Biotechnology Notes. IB Assessment Statement 4.4.1Outline the use of polymerase chain reaction (PCR) to copy and amplify minute.
Bio II: Forensics.  DNA molecules are found in the nucleus of cells in the human body in chromosomes.  People have 23 pairs of chromosomes, with an.
Complex Patterns of Inheritance INCOMPLETE DOMINANCE, CODOMINANCE, MULTIPLE ALLELES, EPISTASIS AND POLYGENIC INHERITANCE.
Mendel & Genetic Variation Chapter 14. What you need to know! The importance of crossing over, independent assortment, and random fertilization to increasing.
Mendelian Genetics Creating Gametes Probability Genetic Terms.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Explain how crime scene evidence is
DNA LiRa 2.0 R. Puch-Solis, M. Barron, R. Young, H. Tazeem
Explain how crime scene evidence is
Statistical Weights of DNA Profiles
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Genome Wide Association Studies using SNP
explain how crime scene evidence is
Explain how crime scene evidence is
Explain how crime scene evidence is
Human genetics How to determine inheritance of a trait in humans
Unit 8: Mendelian Genetics 8.2 Probability and Punnett Squares
DNA Fingerprinting and Forensic Analysis
CATALYST Recall and Review: How do these terms relate to DNA?
Patterns of Inheritance
Explain how crime scene evidence is
explain how crime scene evidence is
Explain how crime scene evidence is
Biotechnology Mader 19.4.
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Presentation transcript:

Representing and Solving Complex DNA Identification Cases Using Bayesian Networks Philip Dawid University College London Julia Mortera & Paola Vicard Università Roma Tre

FORENSIC USES FOR DNA PROFILES Murder/Rape/…: Is A the culprit? Paternity: Is A the father of B? Immigration: Is A the mother of B? How are A and B related? Disasters: 9/11, tsunami, Romanovs,…

Disputed Paternity c m pf We have DNA data D from a disputed child c, its mother m and the putative father pf child founder hypothesis Building blocks: founder, child query founder pf tf af If pf is not the true father tf, this is a “random” alternative father af, query

Disputed Paternity c m pf We have DNA data D from a disputed child c, its mother m and the putative father pf LIKELIHOOD RATIO Essen-Möller 1938 pf tf af If pf is not the true father tf, this is a “random” alternative father af

MISSING DNA DATA What if we can not obtain DNA from the suspect ? (or other relevant individual?) Sometimes we can obtain indirect information by DNA profiling of relatives But analysis is complex and subtle…

query child founder hypothesis Disputed Paternity Case Building blocks: founder, child, query

Complex Paternity Case c1 m1 pf c2 pf m2 b1 b2pf We have DNA from a disputed child c1 and its mother m1 but not from the putative father pf. We do have DNA from c2 an undisputed child of pf, and from her mother m2 as well as from two undisputed full brothers b1 and b2 of pf. founder child query hypothesis Building blocks: founder, child, query

Criminal Identification Case body CR bodywife CR c1c2 wife A body has been found, burnt beyond recognition, but there is reason to believe it might be that of a missing criminal CR. DNA is available from the body, from the wife of CR, and from two children c1 and c2 of CR and wife founder child founder query child hypothesis Building blocks: founder, child, query

founderchildqueryEach building block ( founder / child / query ) in a pedigree can be an INSTANCE of a generic CLASS network — which can itself have further structure The pedigree is built up using simple mouse clicks to insert new nodes/instances and connect them up Genotype data are entered and propagated using simple mouse clicks Object-Oriented Bayesian Network HUGIN 6

Under the microscope… Each CLASS is itself a Bayesian Network, with internal structure Recursive: can contain instances of further class networks Communication via input and output nodes

Marker VWA (Austro-German population allele frequencies) Single-marker analysis (multiply LR’s across markers)

Lowest Level Building Blocks STR MARKER having associated repertory of alleles together with their frequencies gene mendel MENDELIAN SEGREGATION Child’s gene copies paternal or maternal gene, according to outcome of fair coin flip GENOTYPE consisting of maximum and minimum of paternal and maternal genes genotype

founder FOUNDER INDIVIDUAL represented by a pair of genes pgin and mgin (instances of gene ) sampled independently from population distribution, and combined in instance gt of genotype gene genotype

child CHILD INDIVIDUAL paternal [maternal] gene selected by instances fmeiosis [mmeiosis] of mendel from father’s [mother’s] two genes, and combined in instance cgt of genotype mendel genotype

query QUERY INDIVIDUAL Choice of true father’s paternal gene tfpg [maternal gene mfpg] as either that of f1 or that of f2, according as tf=f1? is true or false. QUERY INDIVIDUAL Choice of true father’s paternal gene tfpg [maternal gene mfpg] as either that of f1 or that of f2, according as tf=f1? is true or false.

Complex Paternity Case founder child query hypothesis Measurements for 12 DNA markers on all 6 individuals Enter data, “propagate” through system Overall Likelihood Ratio in favour of paternity: 1300

MORE COMPLEX DNA CASES Mutation Silent/missed alleles,… Mixed crime stains –rape –scuffle Multiple perpetrators and stains Database search Contamination, laboratory errors –…–…

MUTATION mendel mut + appropriate network mut to describe mutation process

e.g. proportional mutation: founder Prob(otherg) ~ mutation rate mut – or build other, more realistic, models

SILENT ALLELES Code by additional allele (99) gene genotype unobserved + inherited e.g. 5 = 5/5 or 5/s

MISSED ALLELES genotype geneobs unobserved + non-inherited geneobs

COMBINATION Can combine any or all of above features (and others), by using all appropriate subnetworks Can use any desired pedigree network –no visible difference at top level Simply enter data (and desired parameter- values) and propagate…

Effect of accounting for silent allele Simple paternity testing Paternity testing with additional measured individuals

Marker VWA (Austro-German population allele frequencies)

Simple paternity testing – allowing for silent alleles Simple paternity testing – allowing for silent alleles

pr(silent)LR mgt = 12/20 pfgt = 13 cgt = 12 Paternal incompatibility p 12 = – rare allele with mutation ~ 0.005

pr(silent)LR 0 Impossible mgt = 16 pfgt = 18 cgt = 18 The mother must have passed a silent allele to the child –who must have inherited allele 18 from his father Maternal incompatibility

Paternity testing

Paternity testing with brother too

Overall likelihood ratio is Consider additional information carried by the brother’s data B: where D denotes data on triplet ( pf, c, m )

mgt = 12/15 pfgt = 14 cgt = 12 Incompatible triplet 16/20 12/ p(silent)LR D LR B B = p 22 =.0003 *Maximum LR overall is 1027, at p(silent) = *

mgt = 12/15 pfgt = 13 cgt = 12/13 Compatible triplet 13 13/1621/2222 p(silent)LR D LR B B =

Extensions Estimation of mutation rates from paternity data Peak area data – mixtures – contamination – low copy number

Network to estimate mutation rate

Marker:D8D18D21 Alleles: Peak Area (RFUs): Suspect alleles in yellow Excerpt of data on 6 markers from Evett et al. (1998) Mixed crime trace

Mixed crime trace – alleles only

Mixed crime trace – peak areas

Marker:D8D18D21 Alleles: Peak area: Mixed crime trace + 3 more… LR (alleles only): 25,000 LR (peak areas too): 170,000,000

Thanks to: Steffen Lauritzen Robert Cowell and The Leverhulme Trust