Peptide Binding motif Height of a column equal to I Relative height of a letter is p High information positions HLA-A0201
NetMHC-3.2 www.cbs.dtu.dk/services/NetMHC-3.2 79 ANN’s Covering Human, Primate and Mouse MHC Predictions can be made for 8-11 mer petides
The IMGT/HLA Sequence Database currently encompass more than 1500 HLA class I proteins Source: http://www.anthonynolan.com/HIG/index.html HLA polymorphism
< 70 HLA alleles are characterized by binding data Reliable MHC class I binding predictions (NetMHC-3.2) for 57 HLA A and B molecules No methods for HLA-C, and HLA-E Long way to over 1500!
More MHC molecules: more diversity in the presented peptides 1% probability that MHC molecule presents a peptide Different hosts sample different peptides from same pathogen.
HLA polymorphism Few human beings will share the same set of HLA alleles –Different persons will react to a pathogen infection in a non-similar manner A CTL based vaccine must include epitopes specific for each HLA allele in a population –A CTL based vaccine must consist of ~1000 HLA class I epitopes
Pan-specific method The contact residues are defined as being within 4.0 Å of the peptide in any of a representative set of HLA-A and -B structures with nonamer peptides. Only polymorphic residues from A, B, and C alleles are included Pseudo-sequence consisting of 34 amino acid residues. Include polymorphic residues in potential contact with the bound peptide
Evaluation. MHC ligands from SYFPEITHI Sort on binding Top Rank: F-rank=0.0 Random Rank: F-rank=0.5
SYFPEITHI benchmark (1400 ligands restricted to 46 HLA molecules)
Prediction Primate MHCs Can we predict binding specificities for non-human primates using the NetMHCpan method trained on human specificity data only?
Yes. Monkey are just like humans Sidney et al. (2006) Patr A*0101 Patr B*0101
And even Pigs and Cows are (somewhat) like humans
Epitope based vaccines and diagnostics Challenges Identify epitopes in pathogen genome A small viral genome contains >> 1000 potential CTL epitopes HLA diversity No two humans will induce the same reaction to a pathogen infection Viral escape No two viral strains will “host” the same set of T cell epitopes
Rational epitope selection We have more than 2000 MHC molecules We have more than 500 different pathogenic strains How to design a method to select a small pool of peptides that will cover both the MHC polymorphism and the pathogen diversity? –No peptide will bind to all MHC molecules and few (maybe even no) peptides will be present in all pathogenic strains
Polyvalent vaccines The equivalent of this in epitope based vaccines is to select epitopes in a way so that they together cover all strains. Strain 1 Strain 2 Strain 1 Strain 2 Epitope Uneven coverage, Average coverage = 2 Even coverage, Average coverage = 2
All HIV responsive patients respond to at least one of nine peptides Perez et al., JI, 2008
Class II MHC binding MHC class II binds peptides in the class II antigen presentation pathway Binds peptides of length 9-18 (even whole proteins can bind!) Binding cleft is open Binding core is 9 aa
HLA class II polymorphism More than 2000 HLA class II allele combinations –HLA-DR –HLA-DQ –HLA-DP Only data for 14 of the more than 500 known HLA-DR allele (< 3%) No data for HLA-DQ and HLA-DP
MTB (mycobacterium tuberculosis) Bacterial genome coding for more then 4000 proteins 700 known epitopes, found in only 30 proteins (ORFs)
MTB (mycobacterium tuberculosis) Bacterial genome coding for more then 4000 proteins 700 known epitopes, found in only 30 proteins (ORFs) Is this biology, or history? –More than 500.000 unique 9mer peptides –Where to start? Each HLA allele will binding ~5000 of these peptides..
Conclusions Rational epitope discovery is feasible –Prediction method is an important guide for epitope identification –Given a protein sequence and an HLA molecule, we can predict the peptide binders (the needle in the haystack) Pan-specific MHC prediction method can deal with the immense MHC polymorphism Epitope selection strategies can deal with pathogen diversity For large pathogens, we still have no handle on how to select immunogenic proteins (we cannot find the haystack)
CBS immunology web servers www.cbs.dtu.dk/services
Acknowledgements Immunological Bioinformatics group, CBS, DTU –Ole Lund - Group leader –Claus Lundegaard - Data bases, HLA binding predictions Collaborators –IMMI, University of Copenhagen Søren Buus: MHC binding –La Jolla Institute of Allergy and Infectious Diseases A. Sette, B. Peters: Epitope database and many, many more www.cbs.dtu.dk/services