Introduction unit 1 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD

Slides:



Advertisements
Similar presentations
1 Genome information GenBank (Entrez nucleotide) Species-specific databases Protein sequence GenBank (Entrez protein) UniProtKB (SwissProt) Protein structure.
Advertisements

NCBI data, sliding window programs and dot plots Sept. 25, 2012 Learning objectives-Become familiar with OMIM and PubMed. Understand the difference between.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
On line (DNA and amino acid) Sequence Information Lecture 7.
Bioinformatics Tutorial I BLAST and Sequence Alignment.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
How to use the web for bioinformatics Molecular Technologies Ethan Strauss X 1171
Introduction to bioknoppix: Linux for the life sciences Carlos M Rodríguez Rivera Humberto Ortiz Zuazaga.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Project presentation using TWiki Lim Yun Ping National University of Singapore.
Sequence Analysis MUPGRET June workshops. Today What can you do with the sequence? What can you do with the ESTs? The case of SNP and Indel.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics and Phylogenetic Analysis
Welcome to Chem 434 Bioinformatics March 25, 2008 Review of course prerequisites Review of syllabus Review of Bioinformatics Course website Course objectives.
Biology Workbench Introduction. What is it used for? It is a web-browser to use bioinformatics tools to analyze and visualize nucleotide and protein sequences.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2015 Xiaole Shirley Liu Please Fill Out Student Sign In.
Welcome to Introduction to Bioinformatics Computing aka BIC1.
BIO337 Systems Biology/Bioinformatics (course # 50524) Spring 2014 Tues/Thurs 11 – 12:30 PM BUR 212 Edward Marcotte/Univ. of Texas/BIO337/Spring 2014.
Login: BITseminar Pass: BITseminar2011 Login: BITseminar Pass: BITseminar2011.
Bioinformatics.
1/ 47 COP 3503 FALL 2012 SHAYAN JAVED LECTURE 19 Programming Fundamentals using Java 1.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Welcome to Introduction to Bioinformatics Computing aka BIC1.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Biological Databases and Tools Sandra Sinisi / Kathryn Steiger November 25, 2002.
BME 110L / BIOL 181L Computational Biology Tools Introductory Remarks and Overview - who - why - what - how Logistics.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
Genomics (BIO 426) James Madison University. Why are you here? Have you taught Genomics before? Plan to teach it soon? Might you teach it sometime? Just.
Bioinformatics: Theory and Practice – Striking a Balance (a plea for teaching, as well as doing, Bioinformatics) Practice (Molecular Biology) Theory: Central.

Institute of Biomedical Sciences (ICB) Malaria Nucleus Institute of Mathematics and Statistics (IME) BIOINFO-USP Nucleus Latin American Course on Bioinformatics.
Introduction to Bioinformatics Biostatistics & Medical Informatics 576 Computer Sciences 576 Fall 2008 Colin Dewey Dept. of Biostatistics & Medical Informatics.
REMINDERS 2 nd Exam on Nov.17 Coverage: Central Dogma of DNA Replication Transcription Translation Cell structure and function Recombinant DNA technology.
+ => Bioinformatics: from Sequence to Knowledge Outline: Introduction to bioinformatics The TAU Bioinformatics unit Useful bioinformatics issues and databases:
Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Class material and homework for February 9 today’s in-class topic: selected examples of contemporary biotechnology –polymerase chain reaction (PCR) –DNA.
Bioinformatics Curriculum Issues, goals, curriculum.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
What is BLAST? Basic BLAST search What is BLAST?
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
BCH339N Systems Biology/Bioinformatics (course # 54040) Spring 2016 Tues/Thurs 11 – 12:30 PM BUR 212.
Presenter: Bradley Green.  What is Bioinformatics?  Brief History of Bioinformatics  Development  Computer Science and Bioinformatics  Current Applications.
Bioinformatics Shared Resource Bioinformatics : How to… Bioinformatics Shared Resource Kutbuddin Doctor, PhD.
Introducing Bioinformatics Using the Nitrogen Cycle Alyssa Bumbaugh Ron Peck Mark Radosevich.
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2016 Xiaole Shirley Liu.
Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist.
Bioinformatics Professor: Monica Bianchini Department of Information Engineering and Mathematics E–mail: Phone: 1012.
Web Application Development Instructor: Matthew Schurr Please sign in on the sheet at the front of the room when you arrive.
What is BLAST? Basic BLAST search What is BLAST?
Introduction to Bioinformatics and Functional Genomics
Basics of BLAST Basic BLAST Search - What is BLAST?
Hood College Master of Science in Bioinformatics (Proposed)
생물정보학 Bioinformatics.
What is Bioinformatics?
Mangaldai College, Mangaldai
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
CSE 5290: Algorithms for Bioinformatics Fall 2009
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Problems from last section
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Introduction unit 1 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD

Course availability Lectures & Lab: every Wednesday, Duncan Hall, Room 550, 6:00 pm to 9:45 pm Lectures & Lab: every Wednesday, Duncan Hall, Room 550, 6:00 pm to 9:45 pm Office hours: Wednesday, 4pm-6pm (Room 554, phone: ) and by appointment Office hours: Wednesday, 4pm-6pm (Room 554, phone: ) and by appointment Lecture notes will be posted at: Lecture notes will be posted at:

DATES UnitsB&OTopicDue Jan Foreword, Intro, chapter3, lecture notes Introduction: information, databases, programming Survey PS0 Feb- March 1,2,5,11,12Sequence informaticsPS1 PS2 March -April 14, 16, Lecture Notes Network informaticsPS3 April8-10, 17Structure informaticsProjects MayReviewPS4 Exam

Final Grading Scenario 1Scenario 2 PS15%5% PS15%5% PS15%5% PS15%5% PS15%5% Projects20%40% Exam20%40% Voted for Voted against

Survey Compose a short message introducing yourself, your science background, bioinformatics interests and what you hope to learn from taking this course. Compose a short message introducing yourself, your science background, bioinformatics interests and what you hope to learn from taking this course. What bioinformatics databases and tools have you used in your previous courses/projects? What bioinformatics databases and tools have you used in your previous courses/projects? How familiar are you with resources/tools mentioned in this lecture and listed in the Survey? (? = not aware of / 0 = aware of, but never use / 1 = seldom use / 2 = weekly / 3 = daily ) How familiar are you with resources/tools mentioned in this lecture and listed in the Survey? (? = not aware of / 0 = aware of, but never use / 1 = seldom use / 2 = weekly / 3 = daily ) If you were to start a company, what bioinformatics service would you provide or need for the development of your solution? If you were to start a company, what bioinformatics service would you provide or need for the development of your solution?

The bioinformatics project An opportunity to use the tools and approaches taught in this course to research an area of personal interest.

Example 1 Choose a nucleotide or protein sequence with some presumed functional or structural importance, at least 140 residues in length. Define the problem or question, for example: Detection of distantly related (divergent) sequences. Detection of distantly related (divergent) sequences. Detection of sequence homologs in various species. Detection of sequence homologs in various species. Detection of homologous motifs in proteins of varied function. Detection of homologous motifs in proteins of varied function.

Example 1 Abstract Abstract Introduction: define the problem Introduction: define the problem Materials and Methods. Materials and Methods. Multiple sequence alignment figure. Multiple sequence alignment figure. Phylogenetic tree. Phylogenetic tree. Discussion. Discussion.

Example 1 cctgttaaaaatggtaaaattactaatgat  PVKNGKITND  EC Nucleic acid translator  O wl protein db  function & structure  drugs Nucleic acid translator  O wl protein db  function & structure  drugs Q– how many protein sequences? Q– how many protein sequences? BLAST (blastn, blastp?)  clustalw BLAST (blastn, blastp?)  clustalw BLAT  SNPdb BLAT  SNPdb

Example 2 Choose a disease. Find genes responsible or predisposing to this disease. Hypothesize on the disease pathway. Or find genes expressed in diseased tissue, compare to normal, research and report findings OMIM, biol. literature, even google  NCBI Gene  KEGG OMIM, biol. literature, even google  NCBI Gene  KEGG IPA IPA Unigene DDD or GEO DB  Pathway tools Unigene DDD or GEO DB  Pathway tools

Example 2: in the news six more gene regions associated with the severest form of lupus reported last Sunday six more gene regions associated with the severest form of lupus reported last Sunday ITGAM, located on Chromosome 16; ITGAM, located on Chromosome 16; BLK, on Chromosome 8; BLK, on Chromosome 8; KIAA1542, on Chromosome 11; KIAA1542, on Chromosome 11; rs , on Chromosome 1; rs , on Chromosome 1; PXK on Chromosome 3; and PXK on Chromosome 3; and BANK1, on Chromosome 4. BANK1, on Chromosome 4. Genes Linked to Height Also Tied to Osteoarthritis Genes Linked to Height Also Tied to Osteoarthritis Genes Stacked Against Weight Loss? Genes Stacked Against Weight Loss?

Example 3 Assay on New and Notable Assay on New and Notable Personal Genome Services: workflow, shortcomings, future trends (Decode Genetics, 23andMe, Knome, Navigenics) Personal Genome Services: workflow, shortcomings, future trends (Decode Genetics, 23andMe, Knome, Navigenics) Inexpensive whole-genome sequencing technologies Inexpensive whole-genome sequencing technologies

Projects: more ideas Comparing bioinformatics tools: Pathway Analysis Comparing bioinformatics tools: Pathway Analysis Research with Matlab Research with Matlab HCE, TreeView, SAM HCE, TreeView, SAM VectorNTI VectorNTI Visualization: Chimera, CN3D, Pymol Visualization: Chimera, CN3D, Pymol R and other statistics tools R and other statistics tools

Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, 3rd Edition Andreas D. Baxevanis (Editor), B. F. Francis Ouellette (Editor) Previously chosen for this course, still the main book

Developing Bioinformatics Computer Skills by Cynthia Gibas, Per Jambeck Introduction to Bioinformatics by Arthur M. Lesk Bioinformatics for dummies by Jean-Michel D. Claverie, etc

Other good books More computational

Online lectures and resources More links at the course page

Databases & Online Resources: NCBI databases: NCBI databases: The Protein Data Bank: The Protein Data Bank: Proteomics Software tools from ExPASy (Expert Protein Analysis System). Proteomics Software tools from ExPASy (Expert Protein Analysis System). NCBI BLAST can be used and downloaded from this site. NCBI BLAST can be used and downloaded from this site. UCSC Genome Browser: UCSC Genome Browser: EBI EBI Tree of Life: Tree of Life: KEGG: KEGG: More on the course website More on the course website

Software: Perl. Perl is open source software and may be downloaded for free from several sites. Perl. Perl is open source software and may be downloaded for free from several sites. Unix/Linux (Mac OS X) Unix/Linux (Mac OS X) MATLAB. Will be available in the Lab MATLAB. Will be available in the Lab IPA – trial version available for free, account in March IPA – trial version available for free, account in March R, Treeview, HCA, SAM – can be downloaded for free R, Treeview, HCA, SAM – can be downloaded for free Visualization: Rasmol, Chimera, VND, Cn3d, Pymol Visualization: Rasmol, Chimera, VND, Cn3d, Pymol

Why these choices? Why BLAST? Because you can learn a lot by comparing sequences, and BLAST is the standard program for this task. Why BLAST? Because you can learn a lot by comparing sequences, and BLAST is the standard program for this task. Why Unix? Because most bioinformatics applications were originally developed in Unix. Why Unix? Because most bioinformatics applications were originally developed in Unix. Why Perl? Because Perl (and BioPerl) is the most popular programming language in bioinformatics. Why Perl? Because Perl (and BioPerl) is the most popular programming language in bioinformatics.

Other Programming Languages Python (bioPython) also popular in Bioinformatics Python (bioPython) also popular in Bioinformatics Ruby is another scripting language with a rapid development cycle. Ruby is another scripting language with a rapid development cycle. Java, C++, and the like can be overkill for bioinformatics (vs hardcore coding/software development) Java, C++, and the like can be overkill for bioinformatics (vs hardcore coding/software development)

biomedical informatics? Definitions may differ, but objectives are the same What is

What is bioinformatics? Biologists using computers, or the other way around Biologists using computers, or the other way around Twenty-First Century Rocket Science Twenty-First Century Rocket Science The science of Blast searches The science of Blast searches Writing bioinformatics software is tougher and very competitive. You probably won’t get rich in this arena, but… Writing bioinformatics software is tougher and very competitive. You probably won’t get rich in this arena, but…

End of Unit 1 Please fill out the Survey Please fill out the Survey Demo for Problem Set 0 (Jan.30) Demo for Problem Set 0 (Jan.30) (to be continued after the break) (to be continued after the break)