Molecular Marker Evaluation Data Laura Fredrick Marek ISU/NCRPIS, Ames, IA WRPIS, Pullman, WA supporting presentations by: grape SSR informationpea SNP.

Slides:



Advertisements
Similar presentations
An overview of the MaizeGDB project and web interface, etc. Carolyn J. Lawrence 15 September 2005 (Write this down)
Advertisements

Lecture 2 Strachan and Read Chapter 13
applications of genome sequencing projects
METHODS FOR HAPLOTYPE RECONSTRUCTION
Genome Structure/Mapping Lisa Malm 05/April/2006 VCR 221 Lisa Malm 05/April/2006 VCR 221.
A new GIS Map Interface for ALFRED: the ALelle FREquency Database R. Gadagkar 1, H. Rajeevan, K.-H. Cheung 1, S. Stein, U. Soundararajan,, J. R. Kidd,
By Angela Brooks and David Chapman Mentor: Dr. Garry Larson Molecular Medicine, City Of Hope Southern California Bioinformatics Institute 2004.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
How to use the web for bioinformatics Molecular Technologies February 11, 2005 Ethan Strauss X 1373
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Molecular Markers DNA & PROTEINS –mtDNA = often used in systematics; in general, no recombination = uniparental inheritance –cpDNA = often used in systematics;
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
ALFRED ALFRED: the ALlele FREquency Database ALFRED: the A AA ALlele F FF FREquency D DD Database Kenneth K. Kidd and the ALFRED Team Department of Genetics.
Objective Assessment of allelic diversity for economic traits by sequencing genes and/or markers associated with traits. Goal Use Pisum core germplasm.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Procedures in RFLP. RFLP analysis can detect Point mutations Length mutations Inversions.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
DbSNP: the NCBI database of genetic variation S. T. Sherry, M.H. Ward, M. Kholodov, J. Baker, L. Phan, E. M. Smigielski and K. Sirotkin, Nucleic Acids.
GENETIC FINGERPRINT ESTABLISHED FOR THE SELECTED ALFALFA GENOTYPES USING MOLECULAR MARKERS.
Expansion of the USDA-ARS Germplasm Resources Information Network (GRIN) Database to Accommodate Molecular Data Gayle Volk, Christopher Richards USDA-ARS-National.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
PLANT GENETIC MARKERS Plant Biotechnology Dr.Ir. Sukendah, MSc.
Analyzing DNA Differences PHAR 308 March 2009 Dr. Tim Bloom.
Phenotyping Clare Coyne & Melanie Harrison-Dunn, curators.
Molecular marker data and their impact on gene bank management Chris Richards NCGRP, Fort Collins, CO Curator Workshop, Atlanta Georgia.
A Survey of Patent Search Engine Software Jennifer Lewis April 24, 2007 CSE 8337.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Molecular identification of living things. Molecular Markers Single locus marker Multi-locus marker RFLP Microsatellite DNA Fingerprinting AFLP RAPD.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Authentication Training Guide 1 The Red Flag Ruling requires automotive dealerships to detect red flags that are applicable to their operation. After.
1 Gene Therapy Gene therapy: the attempt to cure an underlying genetic problem by insertion of a correct copy of a gene. –Tantalizingly simple and profound.
Measuring genetic diversity in natural populations.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
1 DNA Polymorphisms: DNA markers a useful tool in biotechnology Any section of DNA that varies among individuals in a population, “many forms”. Examples.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Lettuce/Sunflower EST CGPDB project. Data analysis, assembly visualization and validation. Alexander Kozik, Brian Chan, Richard Michelmore. Department.
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
PT Sampoerna Agro Tbk Sampoerna Strategic Square North Tower, 28th Floor Jl. Jend. Sudirman Kav. 45 Jakarta, Indonesia,12930 Development of Marker Assisted.
Software and Databases for managing and selecting molecular markers General introduction Pathway approach for candidate gene identification and introduction.
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
Development of a Cotton Marker Database (CMD) for Gossypium genome and genetic research CMD Main Goals Collect and integrate.
Molecular revolution. The first molecular markers: allozymes Allozymes Enzymes that diifer in amino acid sequence yet catalyze the same reaction -visible.
Welcome to the combined BLAST and Genome Browser Tutorial.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
DNA Fingerprinting Maryam Ahmed Khan February 14, 2001.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
Synteny - many distantly related species have co- linear maps for portions of their genomes; co-linearity between maize and sorghum, between maize and.
Restriction Fragment Length Polymorphism. Definition The variation in the length of DNA fragments produced by a restriction endonuclease that cuts at.
1 Bioinformatics Tools for Genotyping Frances Tong Dr. Garry Larson, Ph.D City of Hope Department of Molecular Medicine Southern California Bioinformatics.
Assessment of genetic diversity and relationship of coastal salt tolerant rice accessions of Kerala (South India) using microsatellite markers Jithin Thomas.
T3/Tutorials: Data Submission
ALFRED: the ALlele FREquency Database
Molecular Marker Characterization of plant genotypes
Lettuce/Sunflower EST CGPDB project.
Applied Molecular Genetics Molecular Marker and Technique
DNA Polymorphisms: DNA markers a useful tool in biotechnology
Reading Gels -once a gel has been run, it is stained with another chemical exposed to UV light to allow the DNA to appear -below is an example of a gel.
Geneomics and Database Mining and Genetic Mapping
Welcome to the Quantitative Trait Loci (QTL) Tutorial
Basic Local Alignment Search Tool
Welcome to the Markers Database Tutorial
Barley (Hordeum vulgare subsp. vulgare)
9-3 DNA Typing with Tandem Repeats
Presentation transcript:

Molecular Marker Evaluation Data Laura Fredrick Marek ISU/NCRPIS, Ames, IA WRPIS, Pullman, WA supporting presentations by: grape SSR informationpea SNP information Chuck Simon PGRU, Geneva, NY Clare Coyne

Molecular Session Provide an overview of current organization of molecular descriptors/information in GRIN. Present and solicit suggestions about what information should be included in GRIN. Present and solicit suggestions about how the data should be organized and presented in GRIN. Does standardization of presentation format make searching GRIN easier and assist database connectivity? What do users want to see?

Crops with defined molecular descriptors maize sugar beet cucumis clover hazelnut woody-landscape grape sunflower vaccinium sorghum Not many crops; good time to evaluate/modify data organization and content. Molecular category can be strictly defined.

A sampling of molecular descriptors Currently every molecular descriptor is unique; DQ name and definition. There is significant variation in definition field content. GRIN link name

Molecular Descriptors isozymes, allozymes* SSRs* RAPDs* Molecular category can be strictly defined; descriptors involve specific DNA marker information. indirect detection (gene products): direct detection (DNA sequence): AFLPs STRs SNPs others RFLPs * marker types with data in GRIN

Example of molecular GRIN data: isozymes data from multiple crops data for the same enzyme from multiple crops

Malate Dehydrogenase (MDH1) sunflower gel allelle 2 allelle 1 invariant mito. band l-102 B-4 MDH PI PI PI A B C D E Isozyme Raw Data gel by M. Brothers

MDH1 in study SUNFLOWER.SUN.ISOZYME.CORE.03 Isozyme data set in GRIN One way to get data in an excel file is to search by descriptor in Oracle forms database (curators’ GRIN). Sample size is critical information to include.

Isozyme descriptors in GRIN

Cucumis molecular descriptors in GRIN Field size limitation: link names for different descriptors appear identical.

Isozyme descriptors in GRIN * Summary statistics listed by accession. Should a naming convention be used across crops for molecular descriptors? In general, this is done with morphological descriptors.

A sampling of molecular descriptors There is significant variation in definition field content. GRIN link name Vaccinium DQ name derived from EST source name. Currently every molecular descriptor is unique; DQ name and definition.

SSR gel (Licor) of individuals from three alfalfa cultivars. Ted Kisha, W6, SSR gel image and information. MWM Aragon Hunter River Yonca

SSR and other PCR-based descriptors in GRIN Blueberry descriptor definition field includes more information than any other crop.

Blueberry SSR data model A tour through public GRIN… or, what the information looks like to our users…. Nahla Bassil, Corvallis

Select “Research Crops and Descriptor/Evaluation Data Queries”.

Select “VACCINIUM”

Select “List of Descriptors”

CA stands for “cold acclimated”. (Only defined in embedded excel worksheet.) Actual descriptor name does not include SSR- (affects ability to search in oracle forms database version). Select “SSR-CA112F” Abbreviated SSR definition on this page. Combined GRIN/information based choice?

Actual descriptor name does not include SSR- (affects ability to search in oracle forms database version). In sunflower, the equivalent field is named “code values”. Select “Observed values” Full descriptor definition on this page. Contains descriptor specific information for this evaluation. More detail than any other descriptor.

Definition indicated allele range of bp. Go back to the full descriptor definition page.

Actual descriptor name does not include SSR- (affects ability to search in oracle forms database version). Select “CULTIVATED BLUEBERRY.SSR.2005” (Called study or environment on this page.)

Actual descriptor name does not include VACCINIUM (affects ability to search in oracle forms database version). If select “VACCINIUM CA112F”, will go back to full definition page. If select “54 Accessions … (Called evaluation on this page.)

Blueberry accessions are clonal. In an out crossing seed crop, data in this category could look like….

Population sample size critical information to include. Back to the evaluation page…

Select “an Excel Spreadsheet”.

Partial listing of CA-type Vaccinium SSRs (descriptors) and observation values. Other SSR types (NA, VCC) listed on separate worksheets in the file.

Is useful information missing from the blueberry model? Publication reference BLAST date, search algorithm and database(s) searched Summary statistics (in publication?) Put all common descriptor data in evaluation page description. Population size or sampling information Information Organization

Grape SSR data presented by Chuck

What information should be included? Currently each molecular descriptor is unique. Most will be. Significant variation in content in definition and evaluation fields. How to name descriptors? Introduce a new field containing molecular marker type (SSR, SNP etc) to facilitate cross-genera searching. How to define evaluation (environment)? Should descriptors follow a naming convention across crops? Should definitions contain similar information across data types and crops? Experimental details, raw data: Information to allow use of markers by other labs. Summary statistics: Information to assess accession diversity. Descriptor definitions should contain descriptor unique information. Evaluation/environment definitions should contain information in common to all descriptors in that evaluation.

SSR SNP isozyme CROP CROP-isozyme CROP-SSR CROP-SNP molecular SSR#1 SSR#2 SSR#3 SNP#1 SNP#2 isozyme#1 CROP molecular DQ name enzyme locus DQ name Consider how data organization affects searching ability in GRIN curators database version (oracle forms). ABC etc. Structure of molecular data in GRIN Three models are currently in use. B is most common. etc. Which format best handles multiple accessions with multiple descriptors within multiple data types (blueberry model of 54/28 is a conservative amount of data)?

Re-categorize some molecular descriptors other chromosome related information presented in the category “cytologic” other cytoplasmic male sterility data presented in the category “cytoplasm type”

Summary: Molecular Session What information should be included in GRIN? What do users want to see? Include allele sizes and frequency data in GRIN. Information to include in descriptor definition (as per SSR data model). Primer sequences GenBank referencePublication reference or information source. Experimental details unique to individual descriptor such as: Population sampling size For BLAST search results, include search algorithm, date of search and database(s) searched. Sample gel image(s) with standards (molecular weight and standard accession). Experimental details in common to all descriptors in environment. Information to include in evalutation/environment definition.

Summary: Molecular Session How should the data be organized and presented in GRIN? Would standardization of presentation format make searching GRIN easier and assist database connectivity? Select descriptor name to allow cross crop searching. Introduce a new field containing molecular marker type (SSR, SNP etc) to facilitate cross crop searches. Embed information links in GRIN not on local WEB pages. Consider re-categorization of descriptors to streamline searching.

Helianthus niveus ssp. niveus on the Pacific coast sand dunes west of Vincente Guerrero, BC, Mex endemic to Baja California, only Heilanthus taxa not represented in NPGS