Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Doç. Dr. Nizamettin AYDIN Introduction to Bioinformatics.

Similar presentations


Presentation on theme: "1 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Doç. Dr. Nizamettin AYDIN Introduction to Bioinformatics."— Presentation transcript:

1 1 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Doç. Dr. Nizamettin AYDIN n.aydin@bahcesehir.edu.tr Introduction to Bioinformatics

2 2 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Recommended Texts www.amazon.com

3 3 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Recommended Texts - 2 www.amazon.com

4 4 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Recommended Texts - 3

5 5 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Recommended Texts - 4 Bioinformatics for Dummies Jean Claverie, Cedric Notredame Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins Andreas D. Baxevanis, B. F. Ouellette, Ouellette B. F. Francis. Instant Notes in Bioinformatics D. R. Westhead, Richard M. Twyman, J. H. Parish Bioinformatics: Sequence and Genome Analysis, Vol. 5 David W. Mount, David Mount Developing Bioinformatics Computer Skills Cynthia Gibas, Per Jambeck, Lorrie LeJeune (Editor) Discovering Genomics, Proteomics, and Bioinformatics A. Malcolm Campbell, Laurie J. Heyer

6 6 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Recommended Texts - 5 Structural Bioinformatics Philip E. Bourne (Editor), Helge Weissig Beginning Perl for Bioinformatics James Tisdall Mastering Perl for Bioinformatics James D. Tisdall

7 7 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” What is Bioinformatics?...

8 8 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN”...What is Bioinformatics?... Computational Biology Bioinformatics Genomics Proteomics Functional genomics Structural bioinformatics

9 9 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN”...What is Bioinformatics? Bioinformatics: collection and storage of biological information Computational biology: development of algorithms and statistical models to analyze biological data Bioinformatics/Computational Biology will be interchanged

10 10 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Why is Bioinformatics Important? Applications areas include –Medicine –Pharmaceutical drug design –Toxicology –Molecular evolution –Biosensors –Biomaterials –Biological computing models –DNA computing

11 11 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Why should I care? SmartMoney ranks Bioinformatics as #1 among next HotJobs Business Week 50 Masters of Innovation Jobs available, exciting research potential Important information waiting to be decoded!

12 12 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Why is bioinformatics hot? Supply/demand: few people adequately trained in both biology and computer science Genome sequencing, microarrays, etc lead to large amounts of data to be analyzed Leads to important discoveries Saves time and money

13 13 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” The Role of Computational Biology Source: GenBank 3D Structures Growth: Source: http://www.rcsb.org/pdb/ holdings.html GenBank BASEPAIR GROWTH

14 14 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Fighting Human Disease Genetic / Inherited –Diabetes Viral –Flu, common cold Bacterial –Meningitis, Strep throat

15 15 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Drug Development Life Cycle Years 0 2 4 6 8 10 12 14 16 Discovery (2 to 10 Years) Preclinical Testing (Lab and Animal Testing) Phase I (20-30 Healthy Volunteers used to check for safety and dosage) Phase II (100-300 Patient Volunteers used to check for efficacy and side effects) Phase III (1000-5000 Patient Volunteers used to monitor reactions to long-term drug use) FDA Review & Approval Post-Marketing Testing $600-700 Million! 7 – 15 Years!

16 16 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Drug lead screening 5,000 to 10,000 compounds screened 250 Lead Candidates in Preclinical Testing 5 Drug Candidates enter Clinical Testing; 80% Pass Phase I One drug approved by the FDA 30%Pass Phase II 80% Pass Phase III

17 17 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” What skills are needed? Well-grounded in one of the following areas: –Computer science –Molecular biology –Statistics Working knowledge and appreciation in the others!

18 18 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Where Can I Learn More? ISCB: http://www.iscb.org/ NBCI: http://ncbi.nlm.nih.gov/ http://www.bioinformatics.org/ Journals Conferences

19 19 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Overview of Molecular Biology Cells Chromosomes DNA RNA Amino Acids Proteins Genome/Transcriptome/Proteome

20 20 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Cells Complex system enclosed in a membrane Organisms are unicellular (bacteria, baker’s yeast) or multicellular Humans: – 60 trillion cells –320 cell types Example Animal Cell www.ebi.ac.uk/microarray/ biology_intro.htm

21 21 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Organisms Classified into two types: Eukaryotes: contain a membrane-bound nucleus and organelles (plants, animals, fungi,…) Prokaryotes: lack a true membrane-bound nucleus and organelles (single-celled, includes bacteria) Not all single celled organisms are prokaryotes!

22 22 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Chromosomes In eukaryotes, nucleus contains one or several double stranded DNA molecules organized as chromosomes Humans: –22 Pairs of autosomes –1 pair sex chromosomes Human Karyotype http://avery.rutgers.edu/WSSP/StudentScholars/ Session8/Session8.html

23 23 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Chromosomes Image source: www.biotec.or.th/Genome/whatGenome.html

24 24 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” DNA is the blueprint for life DNA: Deoxyribonucleic Acid Every cell in your body has 23 chromosomes in the nucleus The genes in these chromosomes determine all of your physical attributes. Single stranded molecule (oligomer, polynucleotide) chain of nucleotides 4 different nucleotides: –Adenosine (A) –Cytosine (C) –Guanine (G) –Thymine (T)

25 25 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Mapping the Genome The human genome project has provided us with a draft of the entire human genome. Four bases: A, T, C, G 3.12 billion base- pairs 99% of these are the same Polymorphisms = where they differ

26 26 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Nucleotide Bases Purines (A and G) Pyrimidines (C and T) Difference is in base structure Image Source: www.ebi.ac.uk/microarray/ biology_intro.htm

27 27 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” DNA Can be thought of as an alphabet with 4 characters 4 letter alphabet with sufficiently long words contains information to create complex organisms Not unlike a computer with a small alphabet

28 28 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” DNA polynucleotides(oligomers) Different nucleotides are strung together to form polynucleotides Ends of the polynucleotide are different A directionality is present Convention is to label the coding strand from 5 ’ to 3 ’ http://www.emc.maricopa.edu/faculty/farabee/BIOBK/BioBookDNAMOLGEN.html

29 29 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Single Strand Polynucleotide Example polynucleotide: 5’ G  T  A  A  A  G  T  C  C  C  G  T  T  A  G  C 3’

30 30 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Double Stranded DNA DNA can be single-stranded or double-stranded Double stranded DNA: second strand is the “ reverse complement ” strand Reverse complement runs in opposite direction and bases are complementary Complementary bases: –A, T –C, G

31 31 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Double Stranded Sequence Example double stranded polynucleotide: 5’ G  T  A  A  A  G  T  C  C  C  G  T  T  A  G  C 3’ | | | | | | | | 3’ C  A  T  T  T  C  A  G  G  G  C  A  A  T  C  G 5’ http://www.emc.maricopa.edu/faculty/farabee/BIOBK/BioBookDNAMOLGEN.html

32 32 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Double Stranded DNA

33 33 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Double Helix Two complementary DNA strands form a stable DNA double helix This spring marks the 50 th anniversary of its discovery Image source; www.ebi.ac.uk/microarray/ biology_intro.htm

34 34 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN”

35 35 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” How does the code work? Template for construction of proteins

36 36 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Proteins: Molecular machinery Proteins in your muscles allows you to move: myosin and actin

37 37 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Proteins: Molecular machinery Enzymes (digestion, catalysis) Structure (collagen)

38 38 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Proteins: Molecular machinery Signaling (hormones, kinases) Transport (energy, oxygen) Image source: Crane digital, http://www.cranedigital.com/

39 39 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” 1.Exposure & infection 2.HIV enters your cell 3.Your own cell reads the HIV “code” and creates the HIV proteins. 4.New viral proteins prepare HIV for infection of other cells. http://whyfiles.org/035aids/index.html © George Eade, Eade Creative Services, Inc. Example Case: HIV Protease

40 40 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” HIV Protease & Inhibition

41 41 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Many drugs bind to protein active sites. This HIV protease can no longer prepare HIV proteins for infection, because an inhibitor is already bound in its active site. HIV Protease + Peptidyl inhibitor (1A8G.PDB) HIV Protease as a drug target

42 42 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Target Identification –What protein can we attack to stop the disease from progressing? Lead discovery & optimization –What sort of molecule will bind to this protein? Toxicology –Does it kill the patient? –Does it have side effects? –Does it get to the problem spots? Drug Discovery

43 43 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Put some of the infectious agent into thousands of tiny wells Add a known drug lead compound into each well. –Try nearly every drug lead known. See which ones kill the agent… –Too small to see, so we have to use chemical tests called assays Drug discovery: past & present

44 44 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Once we have a target, how do we find some compounds that might bind to it? The old way: exhaustive screening The new way: computational screening! Finding drug leads

45 45 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Complementarity –Shape –Chemical –Electrostatic ? ? Drug Lead Screening & Docking

46 46 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Genomics –Gene finding –Annotation Sequence alignment and database search –Functional genomics Microarray expression, “gene chips” Proteomics –Structure prediction Comparative modeling –Function prediction Structural bioinformatics –Molecular docking, screening, etc. Problems in Bioinformatcs

47 47 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” RNA Ribonucleic Acid Similar to DNA Thymine (T) is replaced by uracil (U) RNA can be: –Single stranded –Double stranded –Hybridized with DNA

48 48 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” RNA RNA is generally single stranded Forms secondary or tertiary structures RNA folding will be discussed later Important in a variety of ways, including protein synthesis

49 49 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” RNA secondary structure E. coli Rnase P RNA secondary structure

50 50 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” mRNA Messenger RNA Linear molecule encoding genetic information copied from DNA molecules Transcription: process in which DNA is copied into an RNA molecule

51 51 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” mRNA processing Eukaryotic genes can be pieced together –Exons: coding regions –Introns: non-coding regions mRNA processing removes introns, splices exons together Processed mRNA can be translated into a protein sequence

52 52 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” mRNA Processing Image source: http://departments.oxy.edu/biology/Stillman/bi221/111300/processing_of_hnrnas.htm

53 53 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” tRNA Transfer RNA Well-defined three-dimensional structure Critical for creation of proteins

54 54 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” tRNA structure

55 55 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” tRNA Amino acid attached to each tRNA Determined by 3 base anticodon sequence (complementary to mRNA) Translation: process in which the nucleotide sequence of the processed mRNA is used in order to join amino acids together into a protein with the help of ribosomes and tRNA

56 56 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Genetic Code 4 possible bases (A, C, G, U) 3 bases in the codon 4 * 4 * 4 = 64 possible codon sequences Start codon: AUG Stop codons: UAA, UAG, UGA 61 codons to code for amino acids (AUG as well) 20 amino acids – redundancy in genetic code

57 57 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” 20 Amino Acids Glycine (G, GLY) Alanine (A, ALA) Valine (V, VAL) Leucine (L, LEU) Isoleucine (I, ILE) Phenylalanine (F, PHE) Proline (P, PRO) Serine (S, SER) Threonine (T, THR) Cysteine (C, CYS) Methionine (M, MET) Tryptophan (W, TRP) Tyrosine (T, TYR) Asparagine (N, ASN) Glutamine (Q, GLN) Aspartic acid (D, ASP) Glutamic Acid (E, GLU) Lysine (K, LYS) Arginine (R, ARG) Histidine (H, HIS) START: AUG STOP: UAA, UAG, UGA

58 58 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Amino Acids building blocks for proteins (20 different) vary by side chain groups Hydrophilic amino acids are water soluable Hydrophobic are not Linked via a single chemical bond (peptide bond) Peptide: Short linear chain of amino acids (< 30) polypeptide: long chain of amino acids (which can be upwards of 4000 residues long).

59 59 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Proteins Polypeptides having a three dimensional structure. Primary –sequence of amino acids constituting the polypeptide chain Secondary – local organization into secondary structures such as  helices and  sheets Tertiary – three dimensional arrangements of the amino acids as they react to one another due to the polarity and resulting interactions between their side chains Quaternary – number and relative positions of the protein subunits

60 60 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Protein Structure

61 61 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Central Dogma DNA  RNA  PROTEIN

62 62 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Central Dogma

63 63 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” What is a Gene? the physical and functional unit of heredity that carries information from one generation to the next DNA sequence necessary for the synthesis of a functional protein or RNA molecule

64 64 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Genome chromosomal DNA of an organism number of chromosomes and genome size varies quite significantly from one organism to another Genome size and number of genes does not necessarily determine organism complexity

65 65 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Genome Comparison ORGANISMCHROMOSOMESGENOME SIZEGENES Homo sapiens Homo sapiens (Humans) 233,200,000,000~ 30,000 Mus musculus (Mouse) 202,600,000,000~30,000 Drosophila melanogaster Drosophila melanogaster (Fruit Fly) 4180,000,000~18,000 Saccharomyces cerevisiae (Yeast) 1614,000,000~6,000 Zea mays (Corn)102,400,000,000???

66 66 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Transcriptome complete collection of all possible mRNAs (including splice variants) of an organism. regions of an organism’s genome that get transcribed into messenger RNA. transcriptome can be extended to include all transcribed elements, including non-coding RNAs used for structural and regulatory purposes.

67 67 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Proteome the complete collection of proteins that can be produced by an organism. can be studied either as static (sum of all proteins possible) or dynamic (all proteins found at a specific time point) entity


Download ppt "1 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Doç. Dr. Nizamettin AYDIN Introduction to Bioinformatics."

Similar presentations


Ads by Google