I. Introduction and Red Line Education for Data-unlimited Science.

Slides:



Advertisements
Similar presentations
Evolution of bz Orthologous Regions in the Genus Zea and Relatives in the Andropogoneae Qinghua Wang and Hugo K. Dooner Waksman Institute, Rutgers University,
Advertisements

© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
LESSON 1: What is Genetic Research? PowerPoint slides to accompany Using Bioinformatics : Genetic Research.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Transcriptome Sequencing with Reference
BIOINFORMATICS Ency Lee.
ECE 501 Introduction to BME
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
The Central Dogma of Molecular Biology (Things are not really this simple) Genetic information is stored in our DNA (~ 3 billion bp) The DNA of a.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
Aequatus Browser, an open-source web-based tool developed at TGAC to visualise homologous gene structures among differing species or subtypes of a common.
NGS Analysis Using Galaxy
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
1 The Genome Browser allows you to –Browse the Rice-Japonica, Maize and Arabidopsis genomes. –View the location of a particular feature on the rice genome.
 GEP Digital Laboratory Notebook Nick Reeves, Mt. San Jacinto Community College.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
I hear and I forget. I see and I remember. I do and I understand. - Confucius PCAST recommendation #2: “Advocate and provide support for replacing standard.
A new way of seeing genomes Combining sequence- and signal-based genome analyses Maik Friedel, Thomas Wilhelm, Jürgen Sühnel FLI Introduction: So far,
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Organizing information in the post-genomic era The rise of bioinformatics.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
+ => Bioinformatics: from Sequence to Knowledge Outline: Introduction to bioinformatics The TAU Bioinformatics unit Useful bioinformatics issues and databases:
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Sackler Medical School
Annotating genomes using MAKER-P and iPlant. What Are Annotations? Annotations are descriptions of features of the genome –Structural: exons, introns,
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
Mark D. Adams Dept. of Genetics 9/10/04
 GEP Implementation at Mt. San Jacinto Community College Nick Reeves, Ph.D.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
IPlant Genomics in Education
Prospecting for Genes that Fueled the Green Revolution
By Chris Paine Genes Essential idea: Every living organism inherits a blueprint for life from its parents. Genes and.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
How can we find genes? Search for them Look them up.
Genomes & The Tree of Life
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
-1- Module 3: RNA-Seq Module 3 BAMView Introduction Recently, the use of new sequencing technologies (pyrosequencing, Illumina-Solexa) have produced large.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Welcome to the combined BLAST and Genome Browser Tutorial.
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Using DNA Subway in the Classroom Genome Annotation: Red Line.
Visualizing Biosciences Genomics & Proteomics. “Scientists Complete Rough Draft of Human Genome” - New York Times, June 26, 2000 The problem: –3 billion.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
Using BLAST to Identify Species from Proteins
Tools and Services Workshop
Bioinformatics in the Dynamic Genome Course
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
Genome organization and Bioinformatics
Genome Annotation w/ MAKER
Volume 8, Issue 6, Pages (June 2015)
Introduction to Bioinformatics II
Michael Freeling, Jie Xu, Margaret Woodhouse, Damon Lisch 
3.1 Genes Essential idea: Every living organism inherits a blueprint for life from its parents. Genes and hence genetic information is inherited from.
Presentation transcript:

I. Introduction and Red Line Education for Data-unlimited Science

ResearchEducation For the first time in the history of biology students can work with the same data at the same time and with the same tools as research scientists. Educational Challenge Context of scientific discovery

Monocots Dicots Time (million years) Present Oryza (rice) Avena (oats) Hordeum (barley) Triticum (wheat) Setaria (foxtail millet) Pennisetum (pearl millet) Sorghum Zea (maize) Arabidopsis Brachypodium Glycine max (soy) 2,500 Mb 750 Mb 20,000 Mb 270 Mb 430 Mb 145 Mb 1,115 Mb ?? Mb 5,200 Mb >20,000 Mb ?? Mb Plant Genomes Vary Widely in Size = Genome duplication event

Genome Duplication/Factionation

DNA Subway Concepts (Big Ideas) Genomes are complex and dynamic (queer). DNA sequence is information. DNA sequence is biological identity. Gene annotation adds meaning to DNA sequence. Concept of gene continues to evolve. A genome is more than genes.

Insights from Genomics in Education Washington University, June 16-19, participants from three worlds and three kingdoms Bioinformatics: Students have limited patience for pure computer work and want a wet bench hook. Student-scientists partnerships: Someone has to care about the data generated by students. Students as co-investigators: Projects should potentially lead to publication. Scale: Need to move from individual experiments to course-based and distributed research projects.

Walk or…

Ride…

DNA Subway an educational Discovery Environment Simplified bioinformatics workflows Developed with 25 collaborators at 11 institutions Since March 2010 launch: 2,905 registered users 52,591 visits, 24,593 unique visits Red Line: predict and annotate genes in <150 kb Yellow Line: identify homologs in sequenced genomes Blue Line: analyze DNA barcodes and build gene trees Green Line: align and analyze RNA-seq data (coming)

Red Line Learning Questions What is a gene and how does it relate to DNA sequence? What are the components of genes? How does a gene relate to the central dogma of molecular biology: DNA <> RNA > Protein? How does a gene encode a protein? How is the mathematical evidence used to predict genes? How does biological evidence (from RNA and proteins) confirm gene predictions?

Genes as Beads on a String

Morgan’s Beads on a String

Human Globin Locus on Chromosome 11 Is that really all there is?

Human Genome Insights (ENCODE) Majority of genome is transcribed ~50% transposons ~25% protein coding genes/1.3% exons ~23,700 protein coding genes ~160,000 transcripts Average Gene ~ 36,000 bp 7 ~ 300 bp 6 ~5,700 bp 7 alternatively spliced products (95% of genes)

Piano Keys? Keys dynamically placed by real data (features, coordinates)

This map can allow student to appreciate some of the complexity of the genome. Clicking on links to sequence confirms a relationship between something called a gene and a DNA sequence. What is a gene is and how does it relate to DNA ?

Submit Sequence Identify & Mask Repeats Predict Genes Search Datasets Build Gene Models Prospect Genomes Compare Annotations (Optional) Load User Data Predict Function Gene Annotation Workflow

Brent Buckner, Ph.D. Truman State University “I have found that students are overwhelmed by their first introduction to genome sequences viewed on a genome browser. Students who used DNA Subway needed little or no guidance when they moved on to use MaizeGDB and had an easier time transitioning to genomes depicted in different genome browsers.”

DNA Subway Case Study Brent Buckner, Ph.D., Truman State University Sophomore genetics class, spring 2010 and 2011 – 70 students used Red Line to annotate 3.7 mbp of maize genome – 12 hours effort, each student annotated 100 kb – Follow-up research projects by 7 undergraduates: Compared syntenic regions of maize Chr. 6 and sorghum 65 hours effort, each student annotated 1 million bp MaizeGDB, MaizeSequence.org, InterProScan, CoGE, PlexDB, Circos Sophomore genetic class, spring 2012 – 19 students used Red Line to visualize next-gen RNA-Seq data to investigate presence/absence variation (PAV) in maize – 12 hours effort, each student group annotated 100 kb and then imported next-gen RNA-Seq data from 5 different tissues in 30 maize inbred lines for a gene that they had previously shown exhibits PAV

Judy Brusslan, Ph.D. CSU, Long Beach “When I used the Red Line exercise in six lab sections of my General Genetics class this Fall, it went smoothly and best of all, there was a mass “Ah-ha” moment when the results of the gene prediction programs were displayed on the Genome Browser. The use of BLASTX and BLASTN within the Red Line allowed the students to visualize the different outputs and understand the value of sequenced cDNAs for gene prediction.”