Mitochondrial and Chloroplast DNA in Scaffolds. Goal Determine which scaffolds have mitochondrial or chloroplast DNA – Grape and Arabidopsis reference.

Slides:



Advertisements
Similar presentations
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Advertisements

PH Regulation in Blueberries Locating Nhx1. Which proteins regulate pH? The Nhe or Nhx (Na/H exchanger) family of genes – Six known members of this family.
Phytome A Data Analysis Pipline presented by Jason Phillips.
Advanced Perl for Bioinformatics Lecture 5. Regular expressions - review You can put the pattern you want to match between //, bind the pattern to the.
How to use the web for bioinformatics Molecular Technologies February 11, 2005 Ethan Strauss X 1373
BME 130 – Genomes Lecture 7 Genome Annotation I – Gene finding & function predictions.
Molecular Evidence Using DNA, RNA or Protein Sequences to Classify Organisms.
Tutorial 5 Motif discovery.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
©CMBI 2007 Search tools Google, MRS, (SRS). ©CMBI 2007 Search tools Google= Thé best generic search and retrieval system MRS= Maarten’s Retrieval System.
Spring 2007 Bioinformatiatics Ch. 2 - Sequence Alignment.
09 / 23 / Predicting Protein Function Using Machine-Learned Hierarchical Classifiers Roman Eisner Supervisors: Duane Szafron.
Bioinformatics On Genomics Hsueh-Fen Yuki Juan April 28, 2003.
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
A Comprehensive Workflow for Microbial Genome Sequencing From Swab to Publication Madison I. Dunitz 1, David A. Coil 1, Jenna M. Lang 1, Guillaume Jospin.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Advanced Perl for Bioinformatics Lecture 5. Regular expressions - review You can put the pattern you want to match between //, bind the pattern to the.
Mapping NGS sequences to a reference genome. Why? Resequencing studies (DNA) – Structural variation – SNP identification RNAseq – Mapping transcripts.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
DNA DNA, or deoxyribonucleic acid, is the hereditary material in humans and almost all other organisms. Located in the nucleus, mitochondria and chloroplast.
Tweaking BLAST Although you normally see BLAST as a web page with boxes to place data in and tick boxes, etc., it is actually a command line program that.
Discover the UniProt Blast tool. Murcia, February, 2011Protein Sequence Databases Customize the BLAST results.
Cleaning Genomes: So easy - even a program head can do it Igor Bogorad.
What Makes the “Blue” in Blueberries? -The Truth about Myb Dylan Coughtrey Laboratory Methods in Genomics Spring 2011.
TGAC Browser A new open-source client-side rendering genomic browse
Common Errors in Student Annotation Submissions contributions from Paul Lee, David Xiong, Thomas Quisenberry Annotating multiple genes at the same locus.
Adding GO GO Workshop 3-6 August GOanna results and GOanna2ga 2. gene association files 3. getting GO for your dataset 4. adding more GO (introduction)
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
1 P6a Extra Discussion Slides Part 1. 2 Section A.
Gene Prediction and Phylogenetic Trees
Copyright OpenHelix. No use or reproduction without express written consent1.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Parsing BLAST output. Output of a local BLAST search “less” program Full path to the BLAST output file.
Floral Timing Mike Nuttle.
Laura McCoy.  rRNA genes are a multi-gene family  Located in the nucleolus of the cell  Genes are found in tandem arrays  rRNA plus ribosomal proteins.
Function Of Microsoft Words Tables. Where Table section is located Table section is located on top row with File, Edit, View, Insert, Format, Tools, Window.
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Tweaking BLAST Although you normally see BLAST as a web page with boxes to place data in and tick boxes, etc., it is actually a command line program that.
Denovo Sequencing Practical. Overview Very small dataset from Staphylococcus aureus – 4 million x 75 base-pair, paired end reads Cover basic aspects of.
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
Automatic and manual sequence alignment Inferring phylogenetic trees Mining web-based databases Estimating rates of molecular evolution Testing evolutionary.
Group discussion Name this protein. Protein sequence, from Aedes aegypti automated annotation >25558.m01330 MIHVQQMQVSSPVSSADGFIGQLFRVILKRQGSPDKGLICKIPPLSAARREQFDASLMFE.
What is BLAST? Basic BLAST search What is BLAST?
Copyright OpenHelix. No use or reproduction without express written consent1.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
Myb Transcription Factors Dylan Coughtrey Laboratory Methods in Genomics Spring 2011.
PROTEIN IDENTIFIER IAN ROBERTS JOSEPH INFANTI NICOLE FERRARO.
DNA / protein sequence analysis 第九組成員: 吳宇軒 侯卜夫 朱子豪 王俊偉
Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?
What is BLAST? Basic BLAST search What is BLAST?
Single-Stranded Positive-Sense RNA Single-Stranded Negative-Sense RNA
A Whale of a Tale Using BLAST
Emily Eder HC70AL - Spring 2005
Daphnia Genome Preview at wFleaBase.org
Saccharomyces Genome Database (SGD)
Mitochondria Q: What organisms have mitochondria?
Symposium on Applied Bioinformatics
Gene Annotation with DNA Subway
Comparative Genomics.
Annotation Presentation
Basic Local Alignment Search Tool
Hands-on: Reviewing BLAST
STORE MANAGER RESPONSIBILITIES.
Multiple sequence alignment & Phylogenetics Analysis
Long Orf Added Long Orf Gene +2 frame 52,913-54,454
TF candidate selection pipeline.
Common Errors in Student Annotation Submissions contributions from Paul Lee, David Xiong, Thomas Quisenberry Annotating multiple genes at the same locus.
Presentation transcript:

Mitochondrial and Chloroplast DNA in Scaffolds

Goal Determine which scaffolds have mitochondrial or chloroplast DNA – Grape and Arabidopsis reference sets Ideally somehow annotate scaffolds/Contigs

Process A lot of blast results Program – Splits up blast results – Counts the number of times a specific scaffold appears Store data in format that is editable in excel

454

Illumina

Why I used the Top Most Frequent hits

What Next? Possible additional feature is to pull out scaffolding sequences that give x hits Annotation issues with geneious

What we can do in Geneious Scaffolds identified through blast and counting hits re-Blasted as queries against grape and Arabidopsis mitochondria DNA – Just took one of the scaffolds with the most hits and re-blasted Some alignment with ORFs

1 scaffold against all mitochondrial genes for grape and Arabidopsis Notice 14 total hits… Different e-value? Program wrong?

Mitochondial Gene Scaffold/contig

How Accurate is Geneious ORF?

Questions… How do we want to use geneious? Is further work really helpful? Or good enough know these scaffolds as flagged?