Introduction to Bioinformatics 2. Genetics Background Course 341 Department of Computing Imperial College, London © Simon Colton.

Slides:



Advertisements
Similar presentations
Disease-causing bacteria (smooth colonies) Harmless bacteria (rough colonies) Heat-killed, disease- causing bacteria (smooth colonies) Control (no growth)
Advertisements

DNA and RNA.
Classical and Modern Genetics.  “Genetics”: study of how biological information is carried from one generation to the next –Classical Laws of inheritance.
Nucleic Acids - Informational Polymers
CHAPTER 2 THE STRUCTURE AND FUNCTION OF MACROMOLECULES Section E: Nucleic Acids - Informational Polymers 1.Nucleic acids store and transmit hereditary.
B1b 6.1 Inheritance Keywords:
Computational Problems in Molecular Biology Dong Xu Computer Science Department 109 Engineering Building West
DNA and RNA. I. DNA Structure Double Helix In the early 1950s, American James Watson and Britain Francis Crick determined that DNA is in the shape of.
Protein synthesis and replication
Introduction to Biological Sequences. Background: What is DNA? Deoxyribonucleic acid Blueprint that carries genetic information from one generation to.
All illustrations in this presentation were obtained from Google.com
Structure, Replication & Protein Synthesis. DNA  DNA (deoxyribonucleic acid) is the hereditary material for all living things.  contains the codes for.
DNA/RNA/Protein Synthesis All illustrations in this presentation were obtained from Google.com.
Protein Synthesis & Mutations All illustrations in this presentation were obtained from Google.com.
Copyright © 2007 Pearson Education, Inc., publishing as Pearson Addison-Wesley What Is a Gene? A gene is a section of DNA that contains instructions for.
DNA.
GENETICS AND HEREDITY Chapter 5. Genetics and Heredity Heredity- the passing of traits from parents to offspring Genetics- the study of how traits are.
DNA & PROTEIN SYNTHESIS CHAPTERS 9 &10. Main Idea How are proteins made in our bodies?
DNA and its Processes  Chapter 12  Material on Midterm.
End Show Slide 1 of 39 Copyright Pearson Prentice Hall 12-3 RNA and Protein Synthesis RNA and Protein Synthesis.
Molecular Genetics gene: specific region of DNA that determines the type of proteins to be made * Therefore, DNA is a type of genetic material, passed.
Chapter 11 DNA Within the structure of DNA is the information for life- the complete instructions for manufacturing all the proteins for an organism. DNA.
Genetics Chapter 20. Genetics  Study of HEREDITY  Traits that are passed from parent  offspring  Sexual Repro.  2 parents, offspring is a combo.
Introduction The amino acid sequence of a polypeptide is programmed by a gene. A gene consists of regions of DNA, a polymer of nucleic acids. DNA (and.
Hereditary Material - DNA In 1952, Alfred Hershey and Martha Chase studied the genetic material of the virus called T2 that infects the bacterium E.Coli.
CO341: Introduction to Bioinformatics Prof. Yi-Ke Guo
Chapter 11 DNA and GENES. DNA: The Molecule of Heredity DNA, the genetic material of organisms, is composed of four kinds nucleotides. A DNA molecule.
Chromosome Abnormalities Non-disjunction during meiosis can cause a gamete to have an extra chromosome Trisomy = three copies of the same chromosome. Most.
Biology: DNA, Transcription, Translation, and Protein Synthesis
C11- DNA and Genes Chapter 11.
Unit 6: DNA & Protein Synthesis Ch. 28: DNA—Life’s Code DNA = Deoxyribonucleic Acid.
DNA, mRNA, and Protein Synthesis TAKS Review for April 22 test.
RNA. What is RNA?  RNA stands for Ribonucleic acid  Made up of ribose  Nitrogenous bases  And a phosphate group  The code used for making proteins.
DNA & Protein Synthesis Chapter 4 Section 3. Vocabulary 1. DNA 2. nucleotide 3. nitrogen bases 4. base pairing 5. double helix 6. DNA replication 7. gene.
DNA: The Genetic Material Molecular Genetics Section 1 Griffith  Performed the first major experiment that led to the discovery of DNA as the genetic.
DNA Structure and Protein Synthesis (also known as Gene Expression)
Unit 2 The Molecule of Life Genes and Heredity. What is a gene?
DNA Deoxyribonucleic Acid. Nature vs. Nurture? DNA We know traits are inherited but how are they inherited?
Replication, Transcription and Translation. Griffith’s Experiment.
Microbiology Chapter 9 Genetics - Science of the study of heredity, variations in organisms that are transferable from generations to generation DNA is.
DNA and RNA Chapters 12 & 13. Hershey and Chase Performed two experiments to show that DNA is genetic material. Worked with viruses to determine if it.
DNA, RNA. Genes A segment of a chromosome that codes for a protein. –Genes are composed of DNA.
Double Helix DNA consists of two strips, made of sugars and phosphates, twisted around each other and connected by nitrogen bases. Looks like a spiral.
Gene Expression Gene: contains the recipe for a protein 1. is a specific region of DNA on a chromosome 2. codes for a specific mRNA.
The Discovery of DNA as the genetic material. Frederick Griffith.
DNA History  Genetics is the study of genes.  Inheritance is how traits, or characteristics, are passed on from generation to generation.  Chromosomes.
DNA Deoxyribose Nucleic Acid – is the information code to make an organism and controls the activities of the cell. –Mitosis copies this code so that all.
Biology Ch. 11 DNA and Genes DNA  DNA controls the production of proteins Living tissue is made up of protein, so DNA determines an organism’s.
DNA and RNA DNA and RNA. DNA DNA -Deoxyribonucleic acid DNA -Deoxyribonucleic acid The nucleic acid that stores and transmits the genetic information.
Biochemical Composition Evidence of Evolutionary Relationships.
DNA: Structure and Function Unit 7. Recall: DNA is a nucleic acid and made of nucleotides Nucleotides contain a sugar, phosphate, and a base In DNA, the.
Chapter 10: Nucleic Acids And Protein Synthesis Essential Question: What roles do DNA and RNA play in storing genetic information?
DNA Intro: DNA. Background Information: It is important to recall from the information from unit C about DNA. The acronym DNA stands for Deoxyribonucleic.
1 UNIT 4 PART 1: MODERN GENETICS In sexual reproduction the new individual develops from the zygote formed by the union of two gametes, one from each parent.
Unit 6: DNA & Protein Synthesis Ch 28: DNA—Life’s Code DNA = Deoxyribonucleic Acid.
DNA AND ITS STRUCTURE. DNA is located inside the nucleus.
CHAPTER 12 DNA, RNA, & Protein Synthesis Put these notes behind your meiosis notes.
DNA, RNA, and PROTEIN SYNTHESIS DNA, genome, instructions, blueprint, chromosomes, genes All MEAN DNA!!!! THEY ALL HAVE TO DO WITH DNA DNA is a molecule.
Unit 7 (A)-DNA Structure Learning Targets I can describe the role that Wilkins, Franklin, Watson, and Crick had in the discovery of the structure of DNA.
DNA The Secret of Life. Deoxyribonucleic Acid DNA is the molecule responsible for controlling the activities of the cell It is the hereditary molecule.
GENETICS. Objectives: Objective 10- Identify the differences between DNA & RNA. Objective Identify the mechanisms through which DNA can be mutated.
Ch 12 DNA and RNA 12-1DNA 12-2 Chromosomes and DNA Replication 12-3 RNA and Protein Synthesis 12-4 Mutations 12-5 Gene Regulation 12-1DNA 12-2 Chromosomes.
8.2 KEY CONCEPT DNA structure is the same in all organisms.
DNA and Protein Synthesis
What is a genome? The complete set of genetic instructions (DNA sequence) of a species.
Chapter 8 Notes/ DNA and RNA
Unit 3: Genetic Continuity
Genetics Unit Review.
Chapter 11: Lesson 3 Notes. Chapter 11: Lesson 3 Notes.
Presentation transcript:

Introduction to Bioinformatics 2. Genetics Background Course 341 Department of Computing Imperial College, London © Simon Colton

Coursework 1 coursework – worth 20 marks – Work in pairs Retrieving information from a database Using Perl to manipulate that information

The Robot Scientist Performs experiments Learns from results – Using machine learning Plans more experiments Saves time and money Team member: – Stephen Muggleton

Biological Nomenclature Need to know the meaning of: – Species, organism, cell, nucleus, chromosome, DNA – Genome, gene, base, residue, protein, amino acid – Transcription, translation, messenger RNA – Codons, genetic code, evolution, mutation, crossover – Polymer, genotype, phenotype, conformation – Inheritance, homology, phylogenetic trees

Substructure and Effect (Top Down/Bottom Up) Species Organism Cell Nucleus Chromosome DNA strand Gene Base Protein Amino Acid Folds into Affects the Function of Affects the Behaviour of Prescribes

Cells Basic unit of life Different types of cell: – Skin, brain, red/white blood – Different biological function Cells produced by cells – Cell division (mitosis) – 2 daughter cells Eukaryotic cells – Have a nucleus

Nucleus and Chromosomes Each cell has nucleus Rod-shaped particles inside – Are chromosomes – Which we think of in pairs Different number for species – Human(46),tobacco(48) – Goldfish(94),chimp(48) – Usually paired up X & Y Chromosomes – Humans: Male(xy), Female(xx) – Birds: Male(xx), Female(xy)

DNA Strands Chromosomes are same in every cell of organism – Supercoiled DNA (Deoxyribonucleic acid) Take a human, take one cell – Determine the structure of all chromosonal DNA – You’ve just read the human genome (for 1 person) – Human genome project 13 years, 3.2 billion chemicals (bases) in human genome Other genomes being/been decoded: – Pufferfish, fruit fly, mouse, chicken, yeast, bacteria

DNA Structure Double Helix (Crick & Watson) – 2 coiled matching strands – Backbone of sugar phosphate pairs Nitrogenous Base Pairs – Roughly 20 atoms in a base – Adenine  Thymine [A,T] – Cytosine  Guanine [C,G] – Weak bonds (can be broken) – Form long chains called polymers Read the sequence on 1 strand – GATTCATCATGGATCATACTAAC

Differences in DNA 2% tiny Roughly 4% Share Material DNA differentiates: – Species/race/gender – Individuals We share DNA with – Primates,mammals – Fish, plants, bacteria Genotype – DNA of an individual Genetic constitution Phenotype – Characteristics of the resulting organism Nature and nurture

Genes Chunks of DNA sequence – Between 600 and 1200 bases long – 32,000 human genes, 100,000 genes in tulips Large percentage of human genome – Is “junk”: does not code for proteins “Simpler” organisms such as bacteria – Are much more evolved (have hardly any junk) – Viruses have overlapping genes (zipped/compressed) Often the active part of a gene is spit into exons – Seperated by introns

The Synthesis of Proteins Instructions for generating Amino Acid sequences – (i) DNA double helix is unzipped – (ii) One strand is transcribed to messenger RNA – (iii) RNA acts as a template ribosomes translate the RNA into the sequence of amino acids Amino acid sequences fold into a 3d molecule Gene expression – Every cell has every gene in it (has all chromosomes) – Which ones produce proteins (are expressed) & when?

Transcription Take one strand of DNA Write out the counterparts to each base – G becomes C (and vice versa) – A becomes T (and vice versa) Change Thymine [T] to Uracil [U] You have transcribed DNA into messenger RNA Example: Start: GGATGCCAATG Intermediate: CCTACGGTTAC Transcribed: CCUACGGUUAC

Genetic Code How the translation occurs Think of this as a function: – Input: triples of three base letters (Codons) – Output: amino acid – Example: ACC becomes threonine (T) Gene sequences end with: – TAA, TAG or TGA

Genetic Code A=Ala=Alanine C=Cys=Cysteine D=Asp=Aspartic acid E=Glu=Glutamic acid F=Phe=Phenylalanine G=Gly=Glycine H=His=Histidine I=Ile=Isoleucine K=Lys=Lysine L=Leu=Leucine M=Met=Methionine N=Asn=Asparagine P=Pro=Proline Q=Gln=Glutamine R=Arg=Arginine S=Ser=Serine T=Thr=Threonine V=Val=Valine W=Trp=Tryptophan Y=Tyr=Tyrosine

Example Synthesis TCGGTGAATCTGTTTGAT Transcribed to: AGCCACUUAGACAAACUA Translated to: SHLDKL

Proteins DNA codes for – strings of amino acids Amino acids strings – Fold up into complex 3d molecule – 3d structures:conformations – Between 200 & 400 “residues” – Folds are proteins Residue sequences – Always fold to same conformation Proteins play a part – In almost every biological process

Evolution of Genes: Inheritance Evolution of species – Caused by reproduction and survival of the fittest But actually, it is the genotype which evolves – Organism has to live with it (or die before reproduction) – Three mechanisms: inheritance, mutation and crossover Inheritance: properties from parents – Embryo has cells with 23 pairs of chromosomes – Each pair: 1 chromosome from father, 1 from mother – Most important factor in offspring’s genetic makeup

Evolution of Genes: Mutation Genes alter (slightly) during reproduction – Caused by errors, from radiation, from toxicity – 3 possibilities: deletion, insertion, alteration Deletion: ACGTTGACTC  ACGTGACTC Insertion: ACGTTGACTC  AGCGTTGACTC Substitution: ACGTTGACTC  ACGATGACTT Mutations are almost always deleterious – A single change has a massive effect on translation – Causes a different protein conformation

Evolution of Genes: Crossover (Recombination) DNA sections are swapped – From male and female genetic input to offspring DNA

Bioinformatics Application #1 Phylogenetic trees Understand our evolution Genes are homologous – If they share a common ancestor By looking at DNA seqs – For particular genes – See who evolved from who Example: – Mammoth most related to African or Indian Elephants? LUCA: – Last Universal Common Ancestor – Roughly 4 billion years ago

Genetic Disorders Disorders have fuelled much genetics research – Remember that genes have evolved to function Not to malfunction Different types of genetic problems Downs syndrome: three chromosome 21s Cystic fibrosis: – Single base-pair mutation disables a protein – Restricts the flow of ions into certain lung cells – Lung is less able to expel fluids

Bioinformatics Application #2 Predicting Protein Structure Proteins fold to set up an active site – Small, but highly effective (sub)structure – Active site(s) determine the activity of the protein Remember that translation is a function – Always same structure given same set of codons – Is there a set of rules governing how proteins fold? – No one has found one yet – “Holy Grail” of bioinformatics

Protein Structure Knowledge Both protein sequence and structure – Are being determined at an exponential rate 1.3+ Million protein sequences known – Found with projects like Human Genome Project 20,000+ protein structures known – Found using techniques like X-ray crystallography Takes between 1 month and 3 years – To determine the structure of a protein – Process is getting quicker

Sequence versus Structure Year Number Protein sequence Protein structure

Database Approaches Slow(er) rate of finding protein structure – Still a good idea to pursue the Holy Grail Structure is much more conservative than sequence – 1.3m genes, but only 2,000 – 10,000 different conformations First approach to sequence prediction: – Store [sequence,structure] pairs in a database – Find ways to score similarity of residue sequences – Given a new sequence, find closest matches A good match will possibly mean similar protein shape E.g., sequence identity > 35% will give a good match – Rest of the first half of the course about these issues

Potential (Big) Payoffs of Protein Structure Prediction Protein function prediction – Protein interactions and docking Rational drug design – Inhibit or stimulate protein activity with a drug Systems biology – Putting it all together: “E-cell” and “E-organism” – In-silico modelling of biological entities and process

Further Reading Human Genome Project at Sanger Centre – Talking glossary of genetic terms – Primer on molecular genetics –