BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

Applications of genome sequencing projects 1) Molecular Medicine 2) Energy sources and environmental applications 3) Risk assessment 4) Bioarchaeology,
applications of genome sequencing projects
Background Goals Methods Results Conclusions Implications.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Molecular Docking G. Schaftenaar Docking Challenge Identification of the ligand’s correct binding geometry in the binding site ( Binding Mode ) Observation:
Structural bioinformatics
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics Ch1. Introduction (continue-2) 阮雪芬 Nov7, 2002 NTUST
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Protein Structures.
Human Molecular Genetics Section 14–3
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Development of Bioinformatics and its application on Biotechnology
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
Analyzing DNA Differences PHAR 308 March 2009 Dr. Tim Bloom.
Lecture 10 – protein structure prediction. A protein sequence.
CS 790 – Bioinformatics Introduction and overview.
Representations of Molecular Structure: Bonds Only.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Chapter 13 Table of Contents Section 1 DNA Technology
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Central dogma: the story of life RNA DNA Protein.
Bioinformatics and Computational Biology
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Chapter 12 Assessment How could manipulating DNA be beneficial?
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Notes: Human Genome (Right side page)
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Molecular Modeling in Drug Discovery: an Overview
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
bacteria and eukaryotes
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Protein Structure Prediction and Protein Homology modeling
Virtual Screening.
Genome organization and Bioinformatics
Introduction to Bioinformatics II
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
Homology Modeling.
Protein structure prediction.
Introduction to Bioinformatic
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin

Data Creation Analysis Prediction Presentation Searching Organizing Sequences DNA Protein Computers Structures

Introductionary Lecture

Introduction helps you create data example of fragment assembly Bioinformatics…

Introduction provides tools to store and search data databases and databanks primary/secondary/composite/gateways Bioinformatics…

Introduction allows you to make predictions prediction techniques –moving windows, –computer learning Bioinformatics…

Introduction allows you to create 3D models separate lecture Bioinformatics…

Introduction allows transfer of annotations homologous proteins likely to perform similar functions Bioinformatics…

Introduction Annotations… Pre-genome world Post-genome world Annotations will change

Genomes and Gene Prediction Lectures

Genome structure C-paradox Compare prokaryotes and eukaryotes Complexity of eukaryotes: –Introns/exons, –Repeated sequences, –Transposable elements, –Pseudogenes Problems introduced by these...

ORF Scanning in Eukaryotes exonintron exon 5’ 3’ Intron/exon splice sites

Finding Genes in Genomic DNA Ab initio methods Similarity based methods Integrated approaches TRY4

Prediction accuracy Nucleotide level Exon level Measures for assessment

Computing Lecture

Computers

Operating systems What is an operating system? Examples of operating systems Choice of operating systems for different areas of research

Computers and computer science Data structures and information retrieval –Relational databases –Design of databases to reduce errors in data Simple examples of SQL and structuring data into tables Must handle:

Computers and computer science Algorithms: how to solve a problem –Defined an algorithm –Looked at an example Must handle:

Computers and computer science Data mining and machine learning –Extract patterns, etc from data –Computer software which learns from examples and is then able to make predictions Must handle:

Comparative Modelling Lecture

What is comparative modelling? Build a three-dimensional (3D) model of a protein... …based on known structure of a (generally) homologous protein sequence "Homology Modelling" is misleading: –fold recognition and threading allow recognition of non-homologous sequences which adopt the same fold

Stages in CM 1. Identify templates (or ‘parents’)‏ 2. Align the target sequence with the parent(s), 3. Find: structurally conserved regions structurally variable regions 4. Inherit the SCRs from the parent(s)‏ 5. Build the SVRs 6. Build the sidechains 7. Refine the model 8. Evaluate errors in the model

Correct alignment is the structural alignment. Align target with parent(s)‏ Structure of Target Optimal alignment based on Structural Equivalents Structure of Parent We don’t have this! Guess structural alignment from sequence alignment

An example MLSA

Sequence alignment quality

Assessing the model Ideal is to compare the model with the true target structure - 4-6Å; 2Å; 0.5Å

Model quality The main factors are:  The sequence identity with the primary parent  The number and size of indels  The quality of the alignment  The amount of change which has been necessary to the parent(s) to create the model.

Summary of CASP2 results CASP8 ran summer

Medical Applications Lecture

Mutations, Alleles & Polymorphisms Mutation: –any change in DNA sequence Allele: –alternative form of a genetic locus; one inherited from each parent –e.g. eye colour locus - brown and blue alleles Polymorphism: –genetic variation present in >= 1% of a normal population

How are SNPs useful? Understanding evolution –Some alleles may be advantageous in one environment, but disadvantageous in another DNA fingerprinting Markers to map traits –diseases, characteristics Pharmacogenomics –genotype-specific medications

Drug responses Drug efficacy may be affected by: transporters metabolism receptors signalling pathways, etc.

Potentially lethal SNPs First described ~2000 years ago “What is food to some men may be fierce poison to others” Lucretius Caro

Protein Sequence DNA Sequence Protein StructureProtein Function Mutation Altered Sequence Altered Structure Altered Function Understand Structure & Function Restore Structure Restored Function Design Drugs

Looked at p53... Local level - effects of mutations General classes –Functional –Fold Preventing –Destabilizing Types of mutations

How human? Chimeric: 67% human Humanized: 90% human Mouse: 0% human

Antibody Humanization

Summary –Diagnosis of disease –Prediction of disease risk –Prognosis –Customized response to disease –Identifying drug targets - treatments –Engineering of proteins for therapy

Docking and Drug Design Lecture

Van der Waals forces Electrostatic (Salt bridge) Interaction Hydrogen bonds Hydrophobic bonding Surface complementarity

Six degrees of freedom - protein and ligand both treated as rigid - 3 rotations / 3 translations Docking methods - rigid body Just like docking the space shuttle with a satellite Image from NASA

Treat receptor as static / ligand as flexible Dock ligand into binding pocket - generate large number of possible orientations Evaluate and select by energy function Docking methods - flexible ligand

Ligand Matching Match sphere centres against ligand atoms Find possible ligand orientations Often >10,000 orientations possible Find the transformation (rotation + translation) to maximize sphere matching DOCK

Virtual Screening Docking can be used for virtual screening Scan a library of potential drug molecules Identify leads

LUDI (InsightII) - find fragments that can bind GRID - uses molecular mechanics potential to find interaction sites for probe groups X-site - uses an empirical potential to find interaction sites for probe groups De Novo Drug Design

Stupid mistakes... Don't confuse secondary databases with secondary structure! Ensure you know the difference between SCOP/PFam functional domains and CATH structural domains

Summary Find pockets Principles for docking - complementarity Docking –rigid body / ligand flexibility Virtual screening Identifying probe interaction sites –build ligands de novo