Aidan Budd, EMBL Heidelberg Multiple Sequence Alignments.

Slides:



Advertisements
Similar presentations
Secondary structure prediction from amino acid sequence.
Advertisements

Blast to Psi-Blast Blast makes use of Scoring Matrix derived from large number of proteins. What if you want to find homologs based upon a specific gene.
Homework Assignments due next session 1.Find a entry of interest in OMIM ( )
Bioinformatics Finding signals and motifs in DNA and proteins Expectation Maximization Algorithm MEME The Gibbs sampler Lecture 10.
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
Sequence Similarity Searching Class 4 March 2010.
Progressive MSA Do pair-wise alignment Develop an evolutionary tree Most closely related sequences are then aligned, then more distant are added. Genetic.
Multiple alignment June 29, 2007 Learning objectives- Review sequence alignment answer and answer questions you may have. Understand how the E value may.
Multiple sequence alignment and alignment editor.
Sequence similarity.
Similar Sequence Similar Function Charles Yan Spring 2006.
Sequence Alignment III CIS 667 February 10, 2004.
Multiple Sequence Alignments
Single Motif Charles Yan Spring Single Motif.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
Chapter 5 Multiple Sequence Alignment.
Multiple Sequence Alignments. Multiple Alignments Generating multiple alignments  Web servers Analyzing a multiple alignment  what makes a ‘good’ multiple.
Multiple sequence alignment
Multiple sequence alignment Monday, December 6, 2010 Bioinformatics J. Pevsner
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
An Introduction to Bioinformatics
Multiple sequence alignment (MSA) Usean sekvenssin rinnastus Petri Törönen Help contributed by: Liisa Holm & Ari Löytynoja.
Multiple Sequence Alignment School of B&I TCD May 2010.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Protein Sequence Alignment and Database Searching.
Sequence Alignment Techniques. In this presentation…… Part 1 – Searching for Sequence Similarity Part 2 – Multiple Sequence Alignment.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Aidan Budd, EMBL Heidelberg Pairwise Alignments and Sequence Similarity-Based Searching.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Pairwise Sequence Alignment. The most important class of bioinformatics tools – pairwise alignment of DNA and protein seqs. alignment 1alignment 2 Seq.
Sequence Alignments Wednesday 6th October 2010 Aidan Budd Structural and Computational Biology Unit EMBL Heidelberg, Germany.
Multiple Alignments Motifs/Profiles What is multiple alignment? HOW does one do this? WHY does one do this? What do we mean by a motif or profile? BIO520.
DNA alphabet DNA is the principal constituent of the genome. It may be regarded as a complex set of instructions for creating an organism. Four different.
Sequence Based Analysis Tutorial NIH Proteomics Workshop Lai-Su Yeh, Ph.D. Protein Information Resource at Georgetown University Medical Center.
Applied Bioinformatics Week 8 Jens Allmer. Practice I.
Multiple sequence alignments Introduction to Bioinformatics Jacques van Helden Aix-Marseille Université (AMU), France Lab.
Construction of Substitution Matrices
Multiple sequence alignment
Protein Evolution: Introduction to Protein Structure and Function protEvolEllsEmblSept2009 Please open the.
Copyright OpenHelix. No use or reproduction without express written consent1.
Comparing Sequences AND Multiple Sequence Alignment Bioinformatics
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Manually Adjusting Multiple Alignments Chris Wilton.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
(PSI-)BLAST & MSA via Max-Planck. Where? (to find homologues) Structural templates- search against the PDB Sequence homologues- search against SwissProt.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Sequence Alignment.
Construction of Substitution matrices
Protein backbone Biochemical view:
Copyright OpenHelix. No use or reproduction without express written consent1.
DNA makes RNA  Transcription RNA makes Proteins  Translation Information flows from genes  proteins – But not the other way! (usually)
1 Multiple Sequence Alignment(MSA). 2 Multiple Alignment Number of sequences >2 Global alignment Seek an alignment that maximizes score.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
Multiple Sequence Alignment Carlow IT Bioinformatics November 2006.
InterPro Sandra Orchard.
9/6/07BCB 444/544 F07 ISU Dobbs - Lab 3 - BLAST1 BCB 444/544 Lab 3 BLAST Scoring Matrices & Alignment Statistics Sept6.
Multiple Sequence Alignment
Multiple sequence alignment (msa)
Secondary structure prediction
Genome Center of Wisconsin, UW-Madison
There are four levels of structure in proteins
Sequence Based Analysis Tutorial
Adva Yeheskel Bioinformatics Unit, Tel Aviv University 8/5/2018
Sequence Based Analysis Tutorial
Basic Local Alignment Search Tool
Explore Evolution: Instrument for Analysis
Protein domains Jasmin sutkovic
Presentation transcript:

Aidan Budd, EMBL Heidelberg Multiple Sequence Alignments

Aidan Budd, EMBL Heidelberg Build a Sequence Alignment Sequences are usually aligned automatically MUSCLE, PRANK, CLUSTAL etc. Also possible 'manually' using tools such as JalView Hopefully, these demonstrations will highlight that Alignment is "trivial" (at one level, at least) only involves putting gap characters in the right places

Aidan Budd, EMBL Heidelberg Build an Automatic MSA Search Internet for "EBI Muscle"

Aidan Budd, EMBL Heidelberg Build an Automatic MSA Copy and paste sequences in FASTA format Click "Submit"

Aidan Budd, EMBL Heidelberg Build an Automatic MSA Wait for result to be returned Click "Download Alignment File" to reach plain-text version of alignment

Aidan Budd, EMBL Heidelberg Build an Automatic MSA Download file or copy-paste text into text editor to store alignment on local computer View alignment in MSA viewer (e.g. JalView) etc.

Aidan Budd, EMBL Heidelberg Choosing an MSA tool CLUSTALX, MUSCLE, PROBCONS divergent protein sequences NAST multiple alignment of 16S rRNA genes PRANK multiple alignment of relatively similar DNA sequences in an evolutionary context EXPRESSO(3DCoffee) multiple alignment of protein sequences, some of which have 3D structural information MAUVE, Enredo multiple alignment of genomes and many others... Different tools designed for different tasks

Aidan Budd, EMBL Heidelberg Examining MSAs: Recognising Patterns

Aidan Budd, EMBL Heidelberg Only one of many possible colouring schemes Good at highlighting variation in conservation between Designed for red/green colour-blindness CLUSTALX Colouring Scheme extract from an alignment of p53 proteins

Aidan Budd, EMBL Heidelberg Amino acids with similar properties drawn with the same colour extract from an alignment of p53 proteins e.g. basic residues arginine (R) and lysine (K) CLUSTALX Colouring Scheme

Aidan Budd, EMBL Heidelberg extract from an alignment of p53 proteins EXCEPT for P and G, which are always coloured Residues only coloured... e.g. lysine (K) in columns with: only "a few" other basic residues (uncoloured) "many" other basic residues (coloured)... if some proportion of residues in the column have the same property CLUSTALX Colouring Scheme

Aidan Budd, EMBL Heidelberg Hydrophobic: L V I M F W A C Polar: N T S Q Acidic: D E Basic: K R Secondary-structure breaking: GP Large Aromatic Polar: H Y (CLUSTALX help file fully describes the default colouring rules) CLUSTALX Colouring Scheme

Aidan Budd, EMBL Heidelberg bin/homstrad/showpage.cgi?family=response_reg&disp=str Response regulator receiver domain Common Patterns - Buried Beta- Strand

Aidan Budd, EMBL Heidelberg bin/homstrad/showpage.cgi?family=response_reg&disp=str Response regulator receiver domain Common Patterns - Amphipathic Partially-Buried Alpha-Helices

Aidan Budd, EMBL Heidelberg ubiquitin conjugating enzyme Common Patterns - Amphipathic Beta Strands

Aidan Budd, EMBL Heidelberg Different, more strongly biased (from equal representation of each of the 20 amino acids), sequence composition Sometimes more variable sequence more substitutions more gaps than globular/structured regions) Common Patterns - Non-Globular Sequence

Aidan Budd, EMBL Heidelberg [RK].L.{0,1}[FYLIVMP] LIG_CYCLIN_1 Mostly occur in disordered protein regions Often show greater conservation than neighbouring sequence Common Patterns - Short Linear Motifs

Aidan Budd, EMBL Heidelberg K Identifying Mis-Aligned Regions Identify a region of a sequence that you think is misaligned Decide how you would "fix" this misalignment Look at patterns of conservation, and sequences which

Aidan Budd,EMBL Heidelberg Unusual Sequences: Examples With CLUSTALX “”Quality”->”Show Low-Scorring Segments” switched on Short/fragmented sequences Unusual pattern of "conservation"

Aidan Budd, EMBL Heidelberg Using MSAs to Improve Prediction of Linear Motifs Demonstration and Exercise