Thursday, 5 June 2008 Problems in sequence analysis Identification by sequence similarity Genes Determining Plant-Cyanobacterial Symbioses and Consideration.

Slides:



Advertisements
Similar presentations
SBI 4U November 14 th, What is the central dogma? 2. Where does translation occur in the cell? 3. Where does transcription occur in the cell?
Advertisements

The Central Dogma and Transcription Chapter 17: Sections
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
RT-PCR lab You have a cell…is a certain gene on (by “on,” we mean active and producing mRNA?)? If a certain gene is on when the cell divides, the gene.
CHAPTER 31 Genetic Engineering and Biotechnology.
DNA Sequencing and Gene Analysis
TRANSLATION The process of converting the information stored in mRNA into a protein is called translation mRNA carries information from a gene to a structure.
Physical Mapping II + Perl CIS 667 March 2, 2004.
General Microbiology (Micr300) Lecture 11 Biotechnology (Text Chapters: ; )
Making, screening and analyzing cDNA clones Genomic DNA clones
Gene Regulation: What it is, and how to detect it By Jordan, Jennifer, and Brian.
CHAPTER 3 GENE EXPRESSION IN EUKARYOTES (cont.) MISS NUR SHALENA SOFIAN.
FROM GENE TO PROTEIN: TRANSCRIPTION & RNA PROCESSING Chapter 17.
DNA Replication DNA mRNA protein transcription translation replication Before each cell division the DNA must be replicated so each daughter cell can get.
By Moayed al Suleiman Suleiman al borican Ahmad al Ahmadi
BIOLOGY 3020 Fall 2008 Gene Hunting (DNA database searching)
Protein Synthesis The genetic code – the sequence of nucleotides in DNA – is ultimately translated into the sequence of amino acids in proteins – gene.
Nucleic Acid Secondarily Structure AND Primer Selection Bioinformatics
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
Gene Technology Chapters 11 & 13. Gene Expression 0 Genome 0 Our complete genetic information 0 Gene expression 0 Turning parts of a chromosome “on” and.
Do Now Why is it important to learn about DNA and how can DNA be used to help people? NUA Notebook Check Today.
AP Biology: Chapter 14 DNA Technologies
Molecular Biology (MLMB-201) Lecturer: Dr. Mohamed Salah El-Din Department of Medical Laboratory Technology Faculty of Allied Medical Science.
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Library screening Heterologous and homologous gene probes Differential screening Expression library screening.
Microarray Technology
RNA and Protein Synthesis
Protein synthesis mb.edu/cellbio/r ibosome.htm.
LECTURES 3/4. CONSTRUCTING and SCREENING cDNA LIBRARIES to ISOLATE NEW GENES ORIGINAL ARTICLES: CLONING BY COMPLEMENTATION: Lew, D, Dulic, V, and Reed.
DNA REPLICATION. What does it mean to replicate? The production of exact copies of complex molecules, such as DNA molecules, that occurs during growth.
Human awareness.  M16.1 Know that the DNA can be extracted from cells  Genetic engineering and /or genetic modification have been made possible by isolating.
Genetics 3: Transcription: Making RNA from DNA. Comparing DNA and RNA DNA nitrogenous bases: A, T, G, C RNA nitrogenous bases: A, U, G, C DNA: Deoxyribose.
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Protein Synthesis Transcription.
TRANSCRIPTION Copying of the DNA code for a protein into RNA Copying of the DNA code for a protein into RNA 4 Steps: 4 Steps: Initiation Initiation Elongation.
Genetic Engineering Genetic engineering is also referred to as recombinant DNA technology – new combinations of genetic material are produced by artificially.
Lecture 18 – Functional Genomics Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Recombinant DNA Technology. DNA replication refers to the scientific process in which a specific sequence of DNA is replicated in vitro, to produce multiple.
Click anywhere to go on to the next slide This demonstration is best viewed as a slide show, enabling you to simulate a session and make changes in cursor.
Lesson 3 – Gene Expression
Expression of the Viral Genome in Host Cells (How do viruses express their genomes?)
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Transcription and The Genetic Code From DNA to RNA.
Use the image above to answer these questions. 1. Does the process shown above use ATP? 2. The process shown above moves molecules [up, down] the concentration.
Retroviruses and Trans(retro)posons
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Experiments by Matthew Meselsohn and Franklin Stahl proved DNA replication was semi conservative. Using Esherichia coli (bacterium), they used two isotopes.
Gene Expression = Protein Synthesis.
Protein Synthesis - Transcription
Part 3 Gene Technology & Medicine
Lesson: Sequence processing
Genomic and cDNA Libraries
Transcription.
DEFINITION WHAT IS GENOME?
Example of a DNA Array (note green, yellow red colors; also note that only part of the total array is depicted)
Chapter 14 Bioinformatics—the study of a genome
Small RNA Sample Preparation
mRNA Sequencing Sample Preparation
SBI 4U: Metablic Processes
Introduction to Bioinformatics II
General Animal Biology
Digital Gene Expression – Tag Profiling Sample Preparation
Expression of the Genome
Producing DNA fragments eg for manufacturing insulin
Comparison Of DNA And RNA Synthesis in Prokaryotes and Eukaryotes
General Animal Biology
Presentation transcript:

Thursday, 5 June 2008 Problems in sequence analysis Identification by sequence similarity Genes Determining Plant-Cyanobacterial Symbioses and Consideration of Blast This demonstration is best viewed as a slide show, enabling you to simulate a session and make changes in cursor position more obvious. To do this, click Slide Show on the top tool bar, then View show. Click anywhere to go on to the next slide

10 mM nitrate0.1 mM nitrate Gland development is stimulated by N-limitation What's special about the gland? Gland suppressed by presence of fixed N Plant starved for N makes gland to house cyanobacteria What genes are specifically expressed in glands?

Construction of a cDNA library from Gunnera gland mRNA ends with polyA tails Use modified polyT to direct synthesis of DNA copy of mRNA Reverse Transcriptase (RT) adds CCC to end. Add 2 nd adapter, using GGG to attach to CCC. Extend cDNA

Construction of a cDNA library from Gunnera gland (Same protocol, but with real sequences) 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' 3'-TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' Use modified polyT adapter to direct synthesis of DNA copy of mRNA

Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' 3'-TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' Use modified polyT adapter to direct synthesis of DNA copy of mRNA The adapter can bind to many positions in polyA tail, resulting in variation in number of T's in cDNA sequence.

Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' 3'-TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' Use modified polyT adapter to direct synthesis of DNA copy of mRNA The adapter can bind to many positions in polyA tail, resulting in variation in number of T's in cDNA sequence.

Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' 3'-CCCNNNNNNNNNN... NNNNNNNNNN Reverse Transcriptase (RT) extends the adapter to the end of the mRNA and adds CCC to the 3' end.

3'-CCCNNNNNNNNNN... Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' 3'-CCCNNNNNNNNNN... NNNNNNNNNN 5'-AAGCAGTGGTATCAACGCAGAGTGGCCATTACGGCCGGG A second adapter is added which (with the help of antibodies to) uses three G's to bind to the three.C's.

CCCNNNNNNNNNN... Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' 3'-CCCNNNNNNNNNN... NNNNNNNNNN 5'-AAGCAGTGGTATCAACGCAGAGTGGCCATTACGGCCGGG The cDNA sequence is extended to the left, using the second adapter as a template. TTCGTCACCATAGTTGCGTCTCACCGGTAATGCCGG

CCCNNNNNNNNNN... Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' 3'-CCCNNNNNNNNNN... NNNNNNNNNN 5'-AAGCAGTGGTATCAACGCAGAGTGGCCATTACGGCCGGGNNNNNNNNNN... The cDNA sequence is extended to the left, using the second adapter as a template… …and then the second cDNA is strand is synthesized left-to-right, using the first cDNA strand as the template. TTCGTCACCATAGTTGCGTCTCACCGGTAATGCCGG

CCCNNNNNNNNNN... Construction of a cDNA library from Gunnera gland 5'-NNNNNNNNNN... NNNNNNNNNNAAAAAAAAAAAAAAAAA...-3' TTTTTCTTTTTTCATGGCTGACGCTGAGACGCAACTATGGTGACGAA-5' 3'-CCCNNNNNNNNNN... NNNNNNNNNN 5'-AAGCAGTGGTATCAACGCAGAGTGGCCATTACGGCCGGGNNNNNNNNNN... TTCGTCACCATAGTTGCGTCTCACCGGTAATGCCGG Hundreds to thousands of nucleotides To give some perspective, the adapters are about 50 nucleotides, while the mRNA itself can be as large as a couple of thousands of nucleotides.

Construction of a cDNA library from Gunnera gland Of course there are thousands of different mRNA's in a cell, leading to thousands of cDNA's in the library, all in multiple copies.

Sequencing of cDNA library Limitations: - Only from ends - Only ~400 nt It would be nice to be able to sequence the cDNA's from end to end, but that's not presently possible. Sequencing has its limitations.

Sequencing of cDNA library Limitations: - Only from ends - Only ~400 nt Solution: - Break the cDNA The solution is to break up the cDNA so that there are multiple, overlapping ends from which to sequence. In this way, all the full length of the cDNA can be sequenced

Sequencing of cDNA library (1000's of cDNA's) The broken fragments are read from either end (at random). If there are enough reads, it is possible to use overlaps to reassemble the original sequence. Unfortunately, the adapters are also sequenced, and these complicate the assembly process, as they're interpreted as overlapping sequences, leading to misassembly. They need to be removed.

Sequencing of cDNA library (1000's of cDNA's) Given the number of sequences, the removal process obviously must be automated, but automated processes, while fast, are often stupid. We need to check to make sure they worked.

Identifying elements of cDNA library The assembly process should, in theory, also remove duplicate sequences.

Identifying elements of cDNA library The assembly process should, in theory, also remove duplicate sequences. In practice, partial duplicates may remain, and it is necessary to keep an eye out for them.

Identifying elements of cDNA library Predict function directly from sequence How to go from cDNA sequence to predicted function for the sequences? You might think that since we can readily predict a protein sequence from a DNA sequence, it should be possible to predict function as well.

Identifying elements of cDNA library Predict function directly from sequence Predict function from sequence similarity Nope. At present that's impossible. The best we can do is to compare sequences with sequences from other organisms where there is experimental evidence as to function.

Identifying elements of cDNA library Predict function directly from sequence Predict function from sequence similarity Blast is a tool to do just that, comparing a given sequence against at database of known sequences. It is important to understand the mind of Blast. But that is a subject for another time.

Genes Determining Plant-Cyanobacterial Symbioses and Consideration of Blast 1. Determine if primers been removed from sequences. 2. Determine if the library contains duplicates 3. Identify protein sequences similar to those encoded by cDNAs We've identified many things that need to be done: 4. (plus one extra) Find where in the cDNAs genes begin and end

Genes Determining Plant-Cyanobacterial Symbioses and Consideration of Blast Go into StaphyloBIKE through the BioBIKE portal (Gunnera isn't a member of the Staphylococcus, of course, but I put the cDNA sequences in that instance of BioBIKE) RUN-FILE "contig-resources.bike" SHARED (this makes the cDNA sequences available to you as a variable called gunnera-contigs and also provides you with a possibly useful tool READ-NAMED to extract specific sequences) These questions are ordinarily answered by high-powered computer types. But you can answer them yourself. First you need to read in the data.

Genes Determining Plant-Cyanobacterial Symbioses and Consideration of Blast SEQUENCE-SIMILAR-TO Accesses BLAST, using as targets either internal data (i.e. gunnera-contigs ) or external data (i.e. *GENBANK* ) Also used to look for nearly identical sequences, using the MISMATCHES option. READING-FRAMES-OF Translates the sequence in all six possible reading frames. Possibly useful functions: