CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Project in Immunological Bioinformatics Morten Nielsen, CBS, BioCentrum, DTU.

Slides:



Advertisements
Similar presentations
193 G 10. G 194 G G 10 G 197 G
Advertisements

Major Histocompatibility Complex. Principles of Immune Response Highly specific recognition of foreign antigens Mechanisms for elimination of microbes.
Improvement of Yin Yang site prediction by incorporating the interplay between phosphorylation and O-GlcNAcylation Chao Ji, Yinxing Guo, Quan Zhang.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Neural Network training Morten Nielsen, CBS, BioCentrum, DTU.
Performance measures Morten Nielsen, CBS, Department of Systems Biology, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Technical University of Denmark - DTU Department of systems biology B cell epitopes and predictions.
Bioinformatical design of a vaccine against influenza virus N1 subtype Bonaccorsi, Irene; Clausen, Martin Bau; Høj, Leif Howalt; Kjær, Jesper and Sayyad,
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Sequence information, logos and Hidden Markov Models Morten Nielsen, CBS, BioCentrum,
HeleneAndersenMayaBondeAndersenSimonCarlsen MortenAhlgreenGronemann&MadsChristianHjortsø Figure 4, Alignment of the NS3 protein: Alignment of NS3 performed.
Gibbs sampling Morten Nielsen, CBS, BioSys, DTU. Class II MHC binding MHC class II binds peptides in the class II antigen presentation pathway Binds peptides.
Biological Databases Morten Nielsen BioSys, DTU. Different kinds of data DNA –NCBI GenBankNCBI GenBank –Organism specific databases Protein –UniProt SwissProt.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU T cell Epitope predictions using bioinformatics (Neural Networks and hidden.
Slide 1 of 38 T-cell EPITOPES PREDICTION OF HEMAGGLUTININ, NEURAMINIDASE AND MATRIX PROTEIN OF INFLUENZA A VIRUS USING SUPPORT VECTOR MACHINE AND HIDDEN.
Prediction of B cell epitopes Pernille Haste Andersen Immunological Bioinformatics CBS, DTU
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.
An Analysis of “Coronavirus 3CL pro proteinase cleavage sites: Possible relevance to SARS virus pathology” Connie Wu.
Computer Aided Vaccine Design Dr G P S Raghava. Concept of Drug and Vaccine Concept of Drug Concept of Drug –Kill invaders of foreign pathogens –Inhibit.
Artificial Neural Networks 2 Morten Nielsen BioSys, DTU.
Algorithms in Bioinformatics Morten Nielsen BioSys, DTU.
Biological sequence analysis and information processing by artificial neural networks.
Class I pathway Prediction of proteasomal cleavage and TAP binding.
Artificial Neural Networks 2 Morten Nielsen Depertment of Systems Biology, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Model Evaluation Metrics for Performance Evaluation
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Department of Systems Biology Technical University of Denmark Immunological Bioinformatics Processing, combined.
Biological sequence analysis and information processing by artificial neural networks Morten Nielsen CBS.
Bioinformatics pipeline for detection of immunogenic cancer mutations by high throughput mRNA sequencing Jorge Duitama 1, Ion Mandoiu 1, and Pramod Srivastava.
CS 8751 ML & KDDEvaluating Hypotheses1 Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal.
Performance measures Morten Nielsen, CBS, BioCentrum, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
Class I pathway Prediction of proteasomal cleavage and TAP binidng Morten Nielsen, CBS, BioCentrum, DTU.
Class I pathway Prediction of proteasomal cleavage and TAP binidng Can Keşmir, TBB, Utrecht University, NL & CBS, BioCentrum, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU C programming Morten Nielsen, CBS, BioCentrum, DTU.
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen.
Selection of T Cell Epitopes Using an Integrative Approach Mette Voldby Larsen cand. scient. in biology ph.d. student.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Sequence motifs, information content, logos, and HMM’s Morten Nielsen, CBS,
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
Biological sequence analysis and information processing by artificial neural networks.
Project list 1.Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting clustering (Hobohm)
Immunological databases on the web Ole Lund Center for Biological Sequence Analysis BioCentrum-DTU Technical University of Denmark
Epitope Selection Rational Vaccine design. Why? Therapeutic vaccines Therapeutic vaccines Treatment of viral infections (e.g., HIV, HCV), and resistant.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU B cell epitopes and B cell epitope predictions Morten Nielsen, CBS, BioCentrum,
Prediction of CTL responses Mette Voldby Larsen cand. scient. in biology ph.d. student.
Algorithms in Bioinformatics Morten Nielsen Department of Systems Biology, DTU.
Selection of T Cell Epitopes Using an Integrative Approach Mette Voldby Larsen cand. scient. in Biology PhD in Immunological Bioinformatics.
From Genomic Sequence Data to Genotype: A Proposed Machine Learning Approach for Genotyping Hepatitis C Virus Genaro Hernandez Jr CMSC 601 Spring 2011.
1 Computer-aided Subunit Vaccine Design G.P.S. Raghava, Institute of Microbial Technology, Chandigarh  Understanding immune system  Breaking complex.
Classification Performance Evaluation. How do you know that you have a good classifier? Is a feature contributing to overall performance? Is classifier.
Sequence encoding, Cross Validation Morten Nielsen BioSys, DTU
Project list 1.Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting clustering (Hobohm)
Artificiel Neural Networks 2 Morten Nielsen Department of Systems Biology, DTU IIB-INTECH, UNSAM, Argentina.
What is a Project Purpose –Use a method introduced in the course to describe some biological problem How –Construct a data set describing the problem –Define.
Introduction H5N1 is an avian influenza. It was detected in humans for the first time in 1997 in Hong Kong. Since then the spread to humans has been limited.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall Chapter 5: Credibility: Evaluating What’s Been Learned.
© Devi Parikh 2008 Devi Parikh and Tsuhan Chen Carnegie Mellon University April 3, ICASSP 2008 Bringing Diverse Classifiers to Common Grounds: dtransform.
Computational methods for genomics-guided immunotherapy Sahar Al Seesi Computer Science & Engineering Department, UCONN Immunology Department, UCONN Health.
Artificiel Neural Networks 2 Morten Nielsen Department of Systems Biology, DTU.
1.7 Linear Independence. in R n is said to be linearly independent if has only the trivial solution. in R n is said to be linearly dependent if there.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Stabilization matrix method (Ridge regression) Morten Nielsen Department of Systems Biology, DTU.
Prediction of T cell epitopes using artificial neural networks Morten Nielsen, CBS, BioCentrum, DTU.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Performance measures Morten Nielsen, CBS, Department of Systems Biology, DTU.
7. Performance Measurement
Computer-aided Vaccine and Drug Discovery G. P. S
MHCflurry: Open-Source Class I MHC Binding Affinity Prediction
Morten Nielsen, CBS, BioSys, DTU
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen
Evaluating Models Part 1
Artificial Neural Networks Thomas Nordahl Petersen & Morten Nielsen
Presentation transcript:

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Project in Immunological Bioinformatics Morten Nielsen, CBS, BioCentrum, DTU

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Selection of epitopes Conservation Proteasome cleavage –Cleavage of C-terminus Tap binding MHC binding CTL binding/activation

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Prediction methods SYFPEITHI –Welcome to SYFPEITHIWelcome to SYFPEITHI Bimas –HLA Peptide Binding PredictionsHLA Peptide Binding Predictions

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU EasyPred Web interface for training of weight matrices and neural networks Weight matrices –Sequence weighting –Pseudo counts Neural networks –Hidden neurons –Splitting of Test/Training data –Cross validated training

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Weight matrices Is some pattern hidden in the sequences that binds MHC? –First step is to select peptides from the data pool that bind MHC –Next use the methods of sequences weighting and pseudo counts to estimate the weight matrix Evaluate the accuracy of the matrix on an independent data set.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Evaluation of prediction accuracy True positive proportion = TP/(AP)False positive proportion = FP/(AN) A roc =0.5 A roc =0.8 Roc curves Pearson correlation TPFP AP AN

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU MHC peptide binding affinities Transformation o = 1 - log(aff nM)/log(50000) High binder aff < 50nM => o > Intermediate binder aff < 500nM => o > 0.426

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Test/Training performance Test max 20% 80% Test TRain

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU 5 fold test/training 20%

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Your favorite protein Swiss-Prot – Sars sequence –Sars entrySars entry

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Combination with Proteasomal cleavage Prediction of MHC binding 227 RLNQLESKV LLLDRLNQL ILLNKHIDA IVWVATEGA LLNKHIDAY GLPNNTASW YLGTGPEAS QIAQFAPSA QLQNSMSGA TTLPKGFYA QLPQGTTLP ELSPRWYFY TLLPAADMD ALNTPKDHI LLDRLNQLE TLPKGFYAE LLLLDRLNQ RLNQLESKV S..SS Netchop prediction server

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Project plan Follow the project description in detail Write down detailed notes as you go through the project Aflever kort resultat summery tirsdag d. 27. April (1-2 sider) kl 13.00