 Read quality  Adaptor trimming  Read sequence collapse Preprocessing Genome mapping  Map read to the spruce genome (Pabies1.0- genome.fa) using Patman-1.2.2.

Slides:



Advertisements
Similar presentations
Homology Based Analysis of the Human/Mouse lncRNome
Advertisements

Processing of miRNA samples and primary data analysis
Bioinformatics Tutorial I BLAST and Sequence Alignment.
Basics of Comparative Genomics Dr G. P. S. Raghava.
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
Local alignments Seq X: Seq Y:. Local alignment  What’s local? –Allow only parts of the sequence to match –Results in High Scoring Segments –Locally.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
MicroRNA genes Ka-Lok Ng Department of Bioinformatics Asia University.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Scaffold Download free viewer:
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
© Wiley Publishing All Rights Reserved. Searching Sequence Databases.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Identifying and classifying functional small RNAs from pine Ryan Morin BC Genome Sciences Centre (presenting research conducted in the lab of Dr. Peter.
EXPLORING DEAD GENES Adrienne Manuel I400. What are they? Dead Genes are also called Pseudogenes Pseudogenes are non functioning copies of genes in DNA.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
MicroRNA identification based on sequence and structure alignment Presented by - Neeta Jain Xiaowo Wang†, Jing Zhang†, Fei Li, Jin Gu, Tao He, Xuegong.
Computational Identification of Drosophila microRNA Genes Journal Club 09/05/03 Jared Bischof.
1 Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine Chenghai Xue, Fei Li, Tao He,
1 P6a Extra Discussion Slides Part 1. 2 Section A.
RNA Sequencing I: De novo RNAseq
Genome Annotation Rosana O. Babu.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
SMARTAR: small RNA transcriptome analyzer Geuvadis RNA analysis meeting April 16 th 2012 Esther Lizano and Marc Friedländer Xavier Estivill lab Programme.
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
The Central Dogma The Central Dogma traces the flow of genetic information DNA Replication, Transcription, and Translation take place in human cells as.
Blast 2.0 Details The Filter Option: –process of hiding regions of (nucleic acid or amino acid) sequence having characteristics.
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
Moderní metody analýzy genomu - analýza Mgr. Nikola Tom Brno,
Annotation of eukaryotic genomes
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
What is BLAST? Basic BLAST search What is BLAST?
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
Work Presentation Novel RNA genes in A. thaliana Gaurav Moghe Oct, 2008-Nov, 2008.
Biocomputational Languages December 1, 2011 Greg Antell & Khoa Nguyen.
What is sequencing? Video: WlxM (Illumina video) WlxM.
 Series of enzyme catalyzed reactions  Glycolysis - citrate cycle – oxidative phosphorylation  Sugar -> energy.
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: miRNA biogenesis & discovery Simon Moxon
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
What is BLAST? Basic BLAST search What is BLAST?
bacteria and eukaryotes
Figure 1. The overall workflow of RNA-seq QC
Identifying Conserved microRNAs in a Large Dataset of Wheat Small RNAs
Basics of BLAST Basic BLAST Search - What is BLAST?
Basics of Comparative Genomics
Sequence based searches:
S1 Supporting information Bioinformatic workflow and quality of the metrics Number of slides: 10.
Functional Annotation of Transcripts
Chapter 4 Recombinant DNA Technology
Figure 3. Schematic of the parameters to assess junctions in SpliceMap
GEP Annotation Workflow
Identify D. melanogaster ortholog
Comparative Genomics.
DNA & The Genetic Code The sequence (order) of bases in a strand of DNA acts as a template for DNA replication and makes the code for building proteins.
Basic Local Alignment Search Tool
Volume 21, Issue 22, Pages (November 2011)
Basic Local Alignment Search Tool (BLAST)
Volume 24, Issue 13, Pages (July 2014)
Fast, Quantitative and Variant Enabled Mapping of Peptides to Genomes
Identification of novel peptides in annotations from other plants.
Matthew W Jones-Rhoades, David P Bartel  Molecular Cell 
Baekgyu Kim, Kyowon Jeong, V. Narry Kim  Molecular Cell 
Basics of Comparative Genomics
CARPEL FACTORY, a Dicer Homolog, and HEN1, a Novel Protein, Act in microRNA Metabolism in Arabidopsis thaliana  Wonkeun Park, Junjie Li, Rentao Song,
Basic Local Alignment Search Tool
Computational Genomics of Noncoding RNA Genes
Figure 1a. Insertion of sequence into Claudi capsid gene
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

 Read quality  Adaptor trimming  Read sequence collapse Preprocessing Genome mapping  Map read to the spruce genome (Pabies1.0- genome.fa) using Patman  Raw abundance ≥ 10  Length between nt  Genomic matches on genome ≤ 20 Preliminary filter  minimal space size between miRNA and miRNA*: 5  minimal pair of miRNA and miRNA*: 14  maximal bulge of miRNA and miRNA*: 3  maximal asymmetry of miRNA/miRNA* duplex: 3 Stem-loop structure screen (modified miREAP)  BLAST against miRBase, version 20 (≤ 4 nucleotide differences) Homology search  Strand bias filter: sense/total ≥ 0.9  Abundance bias: (Top1 + Top2) / total ≥ 0.7 for novel miRNAs and 0.4 for known miRNAs  With star sequence found in corresponding library Strict filter (applied to at least one library) STEPS CRITERIA Supplemental figure S1. Workflow for miRNA identification in spruce 52 known miRNA families (313 precursors with 185 unique miRNAs) 181 novel miRNA families (272 precursors with 241 unique miRNAs) Spruce miRNAs (585 precursors with 426 unique miRNAs)  MIRNA locus redundancy remove  MIRNA structure check  Novel miRNA family classification  Novel miRNA name assignment Manual check

(1) Does not code an appreciable peptide ( ≤ 100 amino acids) 1,025 loci 196 loci (CP ≥ 0) (2) Not good protein-coding potential, CPC score (evaluated by CPC-0.9-r2) 606 loci 300 loci Non-coding PHAS loci Supplemental figure S2. Coding and non-coding classification of PHAS genes 614 loci (CP ≤ -1) 155 loci (E ≤ 1e-4) 60 loci (E > 1e-4) 1,420 loci Coding PHAS loci (3) No significant homology to known protein (BLASTX to UniRef90, e-value < 1e-4) 215 loci (-1 < CP < 0) 1,001 loci 674 loci (4) No overlap with annotated coding genes (Pabies1.0_HC.gff3) 68 loci 2,061 loci PHAS locus search

Supplemental figure S3. Stem-loop structures of miR482/2118 in spruce miRNA sequence is highlighted in red