Presentation is loading. Please wait.

Presentation is loading. Please wait.

Welcome to Chem 434 Bioinformatics March 29, 2005 Review of course prerequisites Course objectives Become proficient at using existing bioinformatics.

Similar presentations


Presentation on theme: "Welcome to Chem 434 Bioinformatics March 29, 2005 Review of course prerequisites Course objectives Become proficient at using existing bioinformatics."— Presentation transcript:

1

2 Welcome to Chem 434 Bioinformatics March 29, 2005 Review of course prerequisites Course objectives Become proficient at using existing bioinformatics software Understand the statistics behind bioinformatics Write an algorithm that answers a specific bioinformatics problem

3 Rationale for offering an introductory course in bioinformatics 1. Give the students a chance to understand how popular bioinformatics algorithms operate (Clustal W, BLAST, Scan Prosite). 2. Give the students a programming assignment that is a unique component of the course. 3. Give the students insight into bioinformatics so they can make an intelligent determination of whether or not to pursue this field.

4 Learning Goals of CSULA Intro to Bioinformatics Course Introduce software and databases currently used by bioinformaticists Introduce the organization of the data. How data is gathered and how it is annotated. Introduce statistics of data analysis. Introduce the concept of dynamic programming Give the students an opportunity to create an algorithm that analyzes sequence data Encourage the student to analyze the primary literature in the field and write about current research

5 Course logistics Course Website (http://www.calstatela.edu/faculty/jmomand /Bioinformaticscourse.html)http://www.calstatela.edu/faculty/jmomand /Bioinformaticscourse.html Syllabus Power point presentation Homework In-class Workshop References

6 Course logistics (continued) Grading Homework (100 pts) Quizzes (4 x 25 pts) Writing Assignment (100 pts) Final Project (200 pts)

7 Course logistics (continued) Prerequisites Upper division or graduate student status Majoring in Computer-related field or Molecular Life Science If Computer-related, you must have previously completed a Molecular Life Science course with grade C or better. If Molecular Life Science, you must have previously completed an object-oriented programming course or C++ programming course with grade C or better.

8 Definition of Bioinformatics Many definitions at the moment: Use of computers to catalog and organize biological information into meaningful entities. Conceptualization of biology in terms of molecules and the application of “informatics” techniques (from disciplines such as applied math, computer science and statistics) to understand and organize the information associated with these molecules

9 Bioinformatics is Multidisciplinary Computer Science Math Statistics Structural Biology Phylogenetics Drug Design Genomics Molecular Life Sciences

10 Unknown Function How much of the genome is defined?

11 How is Bioinformatics Used? Experimental proof is still the “Gold Standard”. Bioinformatics isn’t going to replace lab work anytime soon Bioinformatics is used to help “focus” the experiments of the benchtop scientist

12 Textbook Bioinformatics and Functional Genomics by Jonathan PevsnerJonathan Pevsner ISBN: 0471210048 Genome sequence acquisition and analysis. Basic and applied research with DNA microarrays. Proteomics.

13 Useful textbooks on the subject (2) Python Programming for the Absolute Beginner by M. Dawson ISBN: 1592000738

14 Useful textbooks on the subject (3) Learning Python, Second Edition by David Ascher andDavid Ascher Mark Lutz Mark Lutz ISBN: 0596002815

15 Useful textbooks on the subject (4) Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, Second Edition by Andreas D. Baxevanis (Editor), B. F. Francis Ouellette ISBN: 0471383910 GenBank database Sequence alignment programs

16 Basis of molecular life sciences Hierarchy of relationships (some exceptions): Genome Gene 1Gene 3Gene 2Gene X Protein 1Protein 2Protein 3Protein X Function 1Function 2Function 3Function X

17 How can one use bioinformatics to link diseases to genes? Positional cloning of genes 1. Find genetic markers associated with disease 2. Sequence DNA next to the markers 3. Compare DNA from afflicted individuals to DNA of normal individuals (database) 4. Find abnormality 5. Predict gene function from sequence information Disease Map Gene Function

18 What is the approach used to sequence genomes? Divide and conquer Split the genome into fragments Clone into vectors that can accept large fragments: yeast artificial chromosomes (YAC Library) Landmarks within the genome can be obtained using Sequence Tagged Sites (STS) Sequences of YAC clones are matched with each other. Sequences that overlap form contigs.

19 History of the Human Genome Project 1953 Watson, Crick DNA structure 1972 Berg, 1st recombinant DNA 1977 Maxam, Gilbert, Sanger sequence DNA 1980 Botstein, Davis, Skolnick White propose to map human genome with RFLPs 1982 Wada proposes to build automated sequencing robots 1984 MRC publishes first large genome Epstein-Barr virus (170 kb) 1985 Sinsheimer hosts meeting to discuss HGP at UCSanta Cruz; Kary Mullis develops PCR 1986 DOE begins genome studies with $5.3 million 1987 Gilbert announces plans to start company to sequence and copyright DNA; Burke, Olson, Carle develop YACs; Donis- Keller publish first map (403 markers)

20 History of the Human Genome Project (continued) 1987 (cont) Hood produces first automated sequencer; Dupont devolops fluorescent dideoxy- nucleotides 1988 NIH supports the HGP; Watson heads the project and allocates part of the budget to study social and ethical issues 1989 Hood, Olson, Botstein Cantor propose using STS’s to map the human genome 1990 Proposal to sequence 20 Mb in model organism by 2005; Lipman, Myers publish the BLAST algorithm 1991 Venter announces strategy to sequence ESTs. He plans to patent partial cDNAs; Uberbacher develops GRAIL, a gene finding program 1992 Simon develops BACs; US and French teams publish first physical maps of chromosome s; first genetic maps of mouse and human genome published 1993 Collins is named director of NCHGR; revise plan to complete seq of human genome by 2005 1995 Venter publishes first sequence of free-living organism: H. influenzae (1.8 Mb); Brown publishes on DNA arrays 1996 Yeast genome is sequenced (S. cerevisiae)

21 History of the Human Genome Project (continued) 1997 Blattner, Plunket complete E. coli sequence; a capillary sequencing machine is introduced. 1998 SNP project is initiated; rice genome project is started; Venter creates new company called Celera and proposes to sequence HG within 3 years; C. elegans genome completed 1999 NIH proposes to sequence mouse genome in 3 years; first sequence of chromosome 22 is announced 2000 Celera and others publish Drosphila sequence (180 Mb); human chromosome 21 is completely sequenced; proposal to sequence puffer fish; Arabadopsis sequence is completed 2001 Celera publishes human sequence in Science; the HGP consortium publishes the human sequence in Nature 2003 Completed genomes: 112 Microbial 18 Eukaryotes 1275 Viruses

22 Public funding vs. Private funding Public-Taxpayers’ money, international effort. Private-Companies that invest money hope to provide access to their information on a fee basis. Celera also allows some free information to small research groups. Both groups have published the sequence of the human genome.

23

24 Useful Websites that give information on Bioinformatics courses http://www.calstatela.edu/faculty/jmomand/Bioinformaticscourse.html http://gaia.ecs.csus.edu/~mei/bio296c.html http://www.smi.stanford.edu/projects/helix/bmi214/ http://www.sdsc.edu/~gribskov/bimm140/bimm140_lectures.html http://www.nslij-genetics.org/bioinfotraining/


Download ppt "Welcome to Chem 434 Bioinformatics March 29, 2005 Review of course prerequisites Course objectives Become proficient at using existing bioinformatics."

Similar presentations


Ads by Google