Software and Databases for managing and selecting molecular markers General introduction Pathway approach for candidate gene identification and introduction.

Slides:



Advertisements
Similar presentations
Planning breeding programs for impact
Advertisements

Fall 2014 HORT6033 Molecular Plant Breeding INSTRUCTOR: AINONG SHI HORT6033 web site:
MARKER ASSISTED SELECTION Individuals carrying the trait of interest are selected based on a marker which is linked to the trait and not on the trait itself.
Potato Mapping / QTLs Amir Moarefi VCR
Frary et al. Advanced Backcross QTL analysis of a Lycopersicon esculentum x L. pennellii cross and identification of possible orthologs in the Solanaceae.
Cameron Peace, Washington State University
Association Mapping as a Breeding Strategy
IMAS 1.9 – An Integrated Decision Support System for MAB Abhishek Rathore 1, Mallikarjuna, G. 1, Manna, S. 1, Hoisington, D 1, McLaren, G 2, Davenport,
Genomic Tools for Oat Improvement
Pepper Mapping & Major Genes Mapping of chlorophyll retainer (cl) mutation in pepper The Pun1 gene for pungency QTL mapping for fruit size and shape.
Genome Structure/Mapping Lisa Malm 05/April/2006 VCR 221 Lisa Malm 05/April/2006 VCR 221.
Whole genome association mapping of beta-glucan content ir barley Ieva Mežaka, Nils Rostoks Advances in Plant Biotechnology in Baltic Sea region1.
Marker-assisted backcrossing for submergence tolerance
Backcross Breeding.
Update on the NSA SNP project Dr. Venkatramana Pedagaraju – Molecular Breeding and Genomics Technology Manager Dr. Brent Hulke -- Research Geneticist.
Chapter 9: Genetic linkage and maps in breeding applications
QTL Mapping R. M. Sundaram.
Chapter 7: Molecular markers in breeding
1.Generate mutants by mutagenesis of seeds Use a genetic background with lots of known polymorphisms compared to other genotypes. Availability of polymorphic.
A Look into the Process of Marker Development Matt Robinson.
Structural and Functional Genomics of Tomato Barone et al Tomato (Solanum Lycopersicon) – economically important crop worldwide, – intensively investigated.
The role of parallel genetic changes in domestication: Fruit size in the plant family Solanaceae Matt Robinson.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
SolCAP Solanaceae Coordinated Agricultural Project What is SolCAP? The SolCAP project links together people from public institutions, private institutions.
Update in GDR, The Genome Database for Rosaceae S Jung, T Lee, S Ficklin, CH Cheng, I Cho, P Zheng, K Evans, C Peace, N Oraguzie, A Abbott, D Layne, M.
Identification of Elemental Processes Controlling Genetic Variation in Soybean Seed Composition José L. Rotundo, Silvia Cianzio & Mark Westgate Iowa State.
GENOMIC MAPPING FOR DROUGHT TOLERANCE IN SORGHUM Introduction Drought is a major abiotic factor limiting crop production. Sorghum is one of the most drought.
Towards utilization of genome sequence information for pigeonpea improvement By ICAR institutes, SAUs and ICRISAT.
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
Introduction to NRSP databases and other breeding databases.
Chuanyu Sun Paul VanRaden National Association of Animal breeders, USA Animal Improvement Programs Laboratory, USA Increasing long term response by selecting.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
The Community of Practices “Concept applied to rice production in the Mekong Region: Quick conversion of popular rice varieties with emphasis on drought,
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Outline of Barley CAP Approach Breeding lines from 10 breeding programs are phenotyped (over 40 traits) in collaborative trials and individual breeder.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
Dr. Scott Sebastian, Research Fellow, Pioneer Hi-Bred International Plant Breeding Seminar at University of California Davis Accelerated Yield.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
INTRODUCTION TO ASSOCIATION MAPPING
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Marker Assisted Selection in Tomato Pathway approach for candidate gene identification and introduction to metabolic pathway databases. Identification.
Gramene V. 211 Gramene Diversity Gramene Genetic Diversity database contains SSR and SNP allelic data and passport descriptions for rice, maize and wheat.
PT Sampoerna Agro Tbk Sampoerna Strategic Square North Tower, 28th Floor Jl. Jend. Sudirman Kav. 45 Jakarta, Indonesia,12930 Development of Marker Assisted.
CASE7——RAD-seq for Grape genetic map construction
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
Welcome to the combined BLAST and Genome Browser Tutorial.
Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng, Jodi L. Humann, Deah McGaughey, Heidi Hough, Stephen P. Ficklin, B. Todd Campbell,
Progress on TripalBIMS Breeding Information Management System in Tripal Sook Jung, Taein Lee, Chun-Huai Chen, Jing Yu, Ksenija Gasic, Todd Campbell, Kate.
Institute of Crop Sciences, CAAS
Association Mapping in European Winter Wheat
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
An Assignment on Marker Assisted Selection
Plant Genetics: TA INTRO
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Backcross Breeding.
Review and Complete QTL analysis from Monday (lecture 15).
Genome-wide Association Studies
Geneomics and Database Mining and Genetic Mapping
Genetic Drift, followed by selection can cause linkage disequilibrium
Heat map of additive effects for PCs QTL
Presentation transcript:

Software and Databases for managing and selecting molecular markers General introduction Pathway approach for candidate gene identification and introduction to metabolic pathway databases. Identification of polymorphisms in data-based sequences

Databases (General and Crop Specific) Germplasm GRIN: TGRC: Sequence NCBI: SGN: Metabolic PlantCyc:

New format to NCBI

Access current and past scientific lit.

Increased emphasis on phenotypic data

Germplasm databases

Crop specific germplasm resources

Example: QTL for color uniformity in elite crosses QTLTraitOrigin 2L, YSDS. lyc. 4YSDS. lyc. 6L, Hueog c 7L, HueS. hab. 11L, HueS. lyc. Audrey Darrigues, Eileen Kabelka

Carotenoid Biosynthesis: Candidate pathway for genes that affect color and color uniformity. Disclaimer: this is not the only candidate pathway…

Databases that link pathways to genes

External Plant Metabolic databases CapCyc (Pepper) (C. anuum) CoffeaCyc (Coffee) (C. canephora) SolCyc (Tomato) (S. lycopersicum) NicotianaCyc (Tobacco) (N. tabacum) PetuniaCyc (Petunia) (P. hybrida) PotatoCyc (Potato) (S. tuberosum) SolaCyc (Eggplant) (S. melongena) Databases that link pathways to genes

Note: missing step (lycopene isomerase, tangerine)

Check boxes (Note: MetaCyc has many more choices, but no plants)

Scroll down page Capsicum annum sequence retrieved

Select database

Query CCACCACCATCCTCACTTTAACCCACAAATCCCACTTTCTTTGGCCTAATTAACAATTTT |||||||||||||||||||||||||||||||||| ||||||||||||||||||||||||| Sbjct CCACCACCATCCTCACTTTAACCCACAAATCCCATTTTCTTTGGCCTAATTAACAATTTT Zeaxanthin epoxidase Probable location on Chromosome 2 Alignment of Z83835 and EF reveals 5 SNPs over ~2000 bp

51 annotated loci

Information missing from other databases is here… Candidates identified in other databases are here

Comment on the databases: Information is not always complete/up to date. Display is not always optimal, and several steps may be needed to go from pathway > gene > potential marker. Sequence data has error associated with it. eSNPs are not the same as validated markers. Germplasm data may also have error (e.g. PI ) There is a wealth of information organized and available.

The previous example detailed how we might identify sequence based markers for trait selection. Query CCACCACCATCCTCACTTTAACCCACAAATCCCACTTTCTTTGGCCTAATTAACAATTTT |||||||||||||||||||||||||||||||||| ||||||||||||||||||||||||| Sbjct CCACCACCATCCTCACTTTAACCCACAAATCCCATTTTCTTTGGCCTAATTAACAATTTT Improving efficiency of selection in terms of 1) relative efficiency of selection, 2) time, 3) gain under selection and 4) cost will benefit from markers for both forward and background selection. Remainder of Presentation will focus on Where to apply markers in a program Forward and background selection Marker resources Alternative population structures and size

Relative efficiency of selection: r (gen) x {H i /H d } Line performance over locations > MAS > Single plant Comparison of direct selection with indirect selection (MAS).

F1 50:50 BC1 75:25 BC2 87.5:12.5 BC :6.25 BC :3.125 Expected proportion of Recurrent Parent (RP) genome in BC progeny Accelerating Backcross Selection

References: Frisch, M., M. Bohn, and A.E. Melchinger Comparison of Selection Strategies for Marker-Assisted Backcrossing of a Gene. Crop Science 39:

Progeny needed for Background Selection During MAS Q10 indicates a 90% probability of success From Frisch et al., 1999.

Marker Data Points required (Modified from Frisch et al., 1999; based on assumption of 12 chromosomes; initial selection with 4 markers/chromosome)

For effective background selection we need: Markers for our target locus (C > T SNP for Zep) Markers on the target chromosome (Chrom. 2) Markers unlinked to the target chromosome (~2 per chromosome arm)

Ovate

HBa0104A12

55 polymorphic markers 44 polymorphic markers

Where can we expect to be? Data based on estimated ~42% of sequence, therefore expect as many as 300 markers for a cross like E6203 x H1706 analysis by Buell et al., unpublished

DOS UNIX CygWin (Unix emulator) BLAST BioPerl Perl BioPerl Perl Cyc NCBI When is the time to move from reliance on public databases to in house pipelines? In-house database

Complete genome sequences are available for: Soybean, Corn, Potato, Tomato, Cucumber, and more are coming….

DOS UNIX CygWin (Unix emulator) BLAST BioPerl Perl BioPerl Perl Cyc NCBI When is the time to move from reliance on public databases to in house pipelines? In-house database

QTL’s mapped in a bi-parental cross may not be appropriate for MAS in all populations… Marker allele and trait may not be linked in all populations. Genetic background effects may be population specific. Original association may be spurious. QTL detection is dependent on magnitude of the difference between alleles and the variance within marker classes. Confirmation of phenotype along the way is very important!

Take home messages: Marker resources exist for forward and background selection in elite x elite crosses in tomato. Marker resources are currently not sufficient for QTL discovery in bi-parental or AM populations; they will soon be. The best time to use genetic markers : early generation selection Restructuring of breeding program to integrate markers may include: 1) Increasing genotypic replication (population size) at the expense of replication (consider augmented designs). 2) Collecting objective data.

References: Kaepler, TAG 95: Frisch, et al., Crop Science 39: Knapp and Bridges, Genetics 126: Yu et al., Nature Genetics 38: Van Deynze et al., BMC Genomics 8:465