Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)

Slides:



Advertisements
Similar presentations
Integrating Genomes D. R. Zerbino, B. Paten, D. Haussler Science 336, 179 (2012) Teacher: Professor Chao, Kun-Mao Speaker: Ho, Bin-Shenq June 4, 2012.
Advertisements

JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)
BIO513: Lecture 1. Central dogma “The central dogma of molecular biology deals with the detailed residue-by-residue transfer of sequential information.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics and Phylogenetic Analysis
Lecture 1 BNFO 240 Usman Roshan. Course overview Perl progamming language (and some Unix basics) Sequence alignment problem –Algorithm for exact pairwise.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
prepared with some help from friends...
BI420 – Course information Web site: Instructor: Gabor Marth Teaching.
Cbio course, spring 2005, Hebrew University Computational Methods In Molecular Biology CS-67693, Spring 2005 School of Computer Science & Engineering Hebrew.
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
Welcome to Introduction to Bioinformatics Computing aka BIC1.
Comparative Genomics of the Eukaryotes
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Elements of Molecular Biology All living things are made of cells All living things are made of cells Prokaryote, Eukaryote Prokaryote, Eukaryote.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
CSE 6406: Bioinformatics Algorithms. Course Outline
Welcome to Introduction to Bioinformatics Computing aka BIC1.
Introduction to Bioinformatics Prologue. Bioinformatics Living things have the ability to store, utilize, and pass on information Bioinformatics strives.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Introduction to the theory of sequence alignment Yves Moreau Master of Artificial Intelligence Katholieke Universiteit Leuven
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Intelligent systems in bioinformatics Introduction to the course.
Sevas Educational Society All Rights Reserved, 2008 Module 1 Introduction to Bioinformatics.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Centre for Integrative Bioinformatics VU (IBIVU) Tel ,
Introduction to Bioinformatics Biostatistics & Medical Informatics 576 Computer Sciences 576 Fall 2008 Colin Dewey Dept. of Biostatistics & Medical Informatics.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Bioinformatics for Human Biologists Rasmus Wernersson, Associate Professor Center for Biological Sequence Analysis, DTU [ -
Course Sequence Analysis for Bioinformatics Master’s Bart van Houte, Radek Szklarczyk, Victor Simossis, Jens Kleinjung, Jaap Heringa
Overview of Bioinformatics 1 Module Denis Manley..
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
An overview of Bioinformatics. Cell and Central Dogma.
Basic Biology for Bioinformatics: genes as information The central dogma of molecular genetics DNA to RNA to protein to phenotype Protein functions, synthesis.
Evolution and the Foundations of Biology
Bioinformatics Professor: Monica Bianchini Department of Information Engineering and Mathematics E–mail: Phone: 1012.
Computer Applications and Bioinformatics
BME435 BIOINFORMATICS.
Bioinformatics Overview
Statistical Applications in Biology and Genetics
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Workshop on the analysis of microbial sequence data using ARB
Genomes and Their Evolution
Bioinformatics Biological Data Computer Calculations +
BIOL 2416 Chapter 1: Genetics: An Introduction
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Bioinformatics For MNW 2nd Year
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
CISC 667 Intro to Bioinformatics (Spring 2007) Review session for Mid-Term CISC667, S07, Lec14, Liao.
The Study of Biological Information
Introduction to Bioinformatic
(Really) Basic Molecular Biology
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)

Bioinformatics - N 2004 Home page: Book: Krane & Raymer “Fundamental Concepts of Bioinformatics” Teachers: Leif Schauser, Thomas Mailund and others Instructors: Philippe Lamy

Defining Bioinformatics….. The term Bioinformatics was introduced in 1988 by Hwa Lim, combining ‘bio’ and ‘informatics’ into one word Early days Data compilation, organization Tracking, storing, retrieving … Data dissemination Analysis (string algorithms)

Defining Bioinformatics….. Today Computer Science Statistics/ Mathematics Biological Knowledge and Theory BIOINFORMATICS Modeling, Analysis, Collection, Management of Data Melting Pot Genomic and Other Molecular Data Other Fields

Jones & Baylin 2002 Our Own Genome 3∙10 9 Zoom 150,000 2∙10 4 TP53 One of genes

Our Own Genome GTCTGGCCCCTCCTCAGCATCTTATCCGAGTGGAAGG AAATTTGCGTGTGGAGTATTTGGATGACAGAAACACTT TTCGACATAGTGTGGTGGTGCCCTATGAGCCGCCTGAG EXON 6 consists of 113 base pairs …………LWKLLPENNVLSPLPSQAMDDL……. …that are translated, joint with other exons, into phosphoprotein p53. Part of the protein reads 2∙10 4 Zoom

Stepping Back p53 binding to DNA helix The p53 signaling pathway TP53 Gene 1 Gene 2Gene k More than 100 genes take part in p53 pathways Networks: Pathways, interactions Qualitative Features: size, directions Quantitative Features: expression levels

Over the years there has been an explosion in the amount of data …. The Amount of Data Improved technology More resources ‘Hot’ research areas Sequences (in thousands) Nucleotides (in millions) Ratio: Nuc/Seq van Nimwegen 2003 GenBank webpage 2004

The Amount of Data Janssen 2003

Leaving the cell Species LevelPopulation LevelIndividual Level DNA sequence DNA sequenceGene expression Man: ATTCGT1 st: AAAGTACGTumor: High Mouse: ATCCT2 nd: AACGTACGBlood: Low Zebra: TTCAATA3 rd: AAAGTATGLiver: Medium ……..……..…….. Other structures Gene expressionGene activation 1 st: HighTumor: Active 2 nd: HighBlood: Inactive 3 rd: Low …….. …….. Variation within a structure at a given level

Genome projects OrganismYearSize (Mbp)# of Genes Saccharomyces cerevisiae Caenorhabditis elegans Drosophila melanogaster Arabidopsis thaliana Human

Genomes First genomes were selected in order to reflect biological diversity. Database contains 20 10^9 bp Doubling time: 15 month –CPU doubling time 18 month Effective tools for sequence analysis needed

Bioinformatics Master Masters Thesis Algorithms in Bioinformatics Complex systemsProtein structure Algorithms and Datastructure Molecular Population Genetics and Evolution Biostatistics Basics in Programming Mathematics basic Molecular biology basics Intro: Bioinformatics Genome analysis

Topics Substitution matrices Pairwise alignment Multiple alignment Phylogenetic analysis Database searching RNA / protein secondary structure prediction Regulatory networks

Objectives Overview: understanding of topics and techniques –Motivation / principles –Mathematical and statistical models –Algorithms User-focus –When and how to use applications

Non-objectives To learn how to write programs To construct mathematical and statistical models To improve algorithms

Alignment: a central problem Alignments are basis of many analysis –Phylogeny reconstruction –Database searches –Predicting RNA / protein secondary structure –Genome analysis mm.

Why alignment? Discuss with your neighbour: –Which principle does an alignment represent

Why alignment? Biological sequences are related –Common ancestors –Duplication, mutation, speciation, variation –Principle of evolution

Why alignment?

What is an Alignment? HIGHLY RELATED: HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKL G+ +VK+HGKKV A+++++AH+D LS+LH KL HBB_HUMAN GNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKL RELATED: HBA_HUMAN GSAQVKGHGKKVADALTNAVAHV---D--DMPNALSALSDLHAHKL H+ KV + +A ++ +L L+++H+ K LGB2_LUPLU NNPELQAHAGKVFKLVYEAAIQLQVTGVVVTDATLKNLGSVHVSKG SPURIOUS ALIGNMENT: HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSD----LHAHKL GS+ + G + +D L ++ H+ D+ A +AL D ++AH+ F11G11.2 GSGYLVGDSLTFVDLL--VAQHTADLLAANAALLDEFPQFKAHQE How to filter out the last one & pick up the second?

Internet Explorer

Ribosome structure

Rimosome rRNA

Conclusions Bioinformatic methods are motivated by the explosion of sequence data This course gives a broad introduction to a number of analysis tools Most of these tools rely on the principle of evolution

Schedule Lectures take place mondays 11-13, auditorium G1, Department of Mathematical Sciences and on wednesday 11-12, auditorium D1. Computer / theoretical exercises take place on wednesdays (HOLD 1), and monday (HOLD 2) at G31.