Condor: BLAST Rob Quick Open Science Grid Indiana University.

Slides:



Advertisements
Similar presentations
Genetic Research Using Bioinformatics: LESSON 2:
Advertisements

Fa07CSE 182 CSE182-L4: Database filtering. Fa07CSE 182 Summary (through lecture 3) A2 is online We considered the basics of sequence alignment –Opt score.
Blast outputoutput. How to measure the similarity between two sequences Q: which one is a better match to the query ? Query: M A T W L Seq_A: M A T P.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 2: “Homology” Searches and Sequence Alignments.
Bioinformatics “Other techniques raise more questions than they answer. Bioinformatics is what answers the questions those techniques generate.” SheAvery
Jeff Shen, Morgan Kearse, Jeff Shi, Yang Ding, & Owen Astrachan Genome Revolution Focus 2007, Duke University, Durham, North Carolina Introduction.
Sequence Similarity Searching Class 4 March 2010.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Similar Sequence Similar Function Charles Yan Spring 2006.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Recap Don’t forget to – pick a paper and – me See the schedule to see what’s taken –
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
SUPERVISED NEURAL NETWORKS FOR PROTEIN SEQUENCE ANALYSIS Lecture 11 Dr Lee Nung Kion Faculty of Cognitive Sciences and Human Development UNIMAS,
Automatic methods for functional annotation of sequences Petri Törönen.
C OMPUTATIONAL BIOLOGY. O UTLINE Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity of the Algorithms.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Run restriction digestion: TA's will take the pictures for you.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Lab 3 – BLAST – Directed It’s a BLAST! (too easy?)
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
Bacterial Genetics - Assignment and Genomics Exercise: Aims –To provide an overview of the development and.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Construction of Substitution Matrices
Function preserves sequences Christophe Roos - MediCel ltd Similarity is a tool in understanding the information in a sequence.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Condor: BLAST Monday, July 19 th, 3:15pm Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Intermediate Condor: Workflows Rob Quick Open Science Grid Indiana University.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.
Condor: BLAST Monday, 3:30pm Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
Database Similarity Search. 2 Sequences that are similar probably have the same function Why do we care to align sequences?
Sequence Alignment.
Construction of Substitution matrices
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Bioinformatics zInterdisciplinary science that involves developing and applying information technology for analyzing biological data Overview of Bioinformatics.
Copyright OpenHelix. No use or reproduction without express written consent1.
What is BLAST? Basic BLAST search What is BLAST?
While hiking, a student decided to collect and eat berries from the plants he came across on the AT trail. Unfortunately, he became very ill and had to.
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Pairwise Sequence Alignment. Three modifications for local alignment The scoring system uses negative scores for mismatches The minimum score for.
Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?
What is BLAST? Basic BLAST search What is BLAST?
Using BLAST to Identify Species from Proteins
Introduction to Bioinformatics Resources for DNA Barcoding
Basics of BLAST Basic BLAST Search - What is BLAST?
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Using BLAST to Identify Species from Proteins
Genome Center of Wisconsin, UW-Madison
Bioinformatics and BLAST
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Comparative Genomics.
Bioinformatics Vicki & Joe.
What do you with a whole genome sequence?
Basic Local Alignment Search Tool
Sequence Similarity Andrew Torda, wintersemester 2006 / 2007, Angewandte … What is the easiest information to find about a protein ? sequence history.
Applying principles of computer science in a biological context
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
Using BLAST to Identify Species from Proteins
Lab 3 – BLAST – Directed It’s a BLAST! (too easy?)
Condor: BLAST Tuesday, Dec 7th, 10:45am
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Condor: BLAST Rob Quick Open Science Grid Indiana University

2012 Africa Grid School Before we begin… Any questions on the lectures or exercises up to this point? 2

2012 Africa Grid School I hope you’re not getting too tired 3

2012 Africa Grid School BLAST Up to now, you’ve done toy examples  Simple, easy to use  Illustrate basics of what you need to know  The Mandlebrot set is cool… but a toy Let's try out a real application: BLAST  More complex, not so easy to use 4

2012 Africa Grid School First, some honesty I am a computer scientist I am not a biologist My knowledge of BLAST is shallow But it’s way cooler application than what we’ve done so far! 5

2012 Africa Grid School BLAST Description From the BLAST web page: 6 The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

2012 Africa Grid School Blast Description (My understanding) Biologists have sequences:  Nucleotides in DNA: ACGTTGCA…  Amino acids in proteins: GECVASR… They also have databases of lots of sequences  From lots of organisms, from tiny bacteria to humans BLAST helps them answer questions:  Which bacterial species have a protein that is related in lineage to another protein?  What other genes encode proteins that exhibit structures or motifs such as ones that have just been determined?  … BLAST is widely used and considered important 7

2012 Africa Grid School Is this just string comparison? It’s harder than just comparing two strings: Is “GCTA == GCTA”? BLAST can find “similar” sequences, based on metrics that biologists determine.  “Similar” means this is more computationally expensive than just string comparison BLAST is a very popular program to ask these questions 8

2012 Africa Grid School BLAST exercise The final set of exercises have you run queries with BLAST They are a bit arbitrary, because I know less about the underlying biology But it’s a real application with real data! Your challenge: run a bunch of BLAST queries and summarize the results. Do it all within a DAG 9

2012 Africa Grid School Time to try it out! 10

2012 Africa Grid School Questions? Questions? Comments? Feel free to ask me questions later: Rob Quick 11