Genetic Diversity Analysis and Single-nucleotide Polymorphism Marker Development in Cultivated Bulb Onion Based on Expressed Sequence Tag–Simple Sequence Repeat Markers
Click on author name to view affiliation information
Click on author name to view affiliation information
Bulb onion (Allium cepa L.) is a globally significant crop, but the structure of genetic variation within and among populations is poorly understood. We broadly surveyed genetic variation in a cultivated onion germplasm using simple sequence repeat (SSR) markers and sequenced regions flanking expressed sequence tag (EST)-SSRs to develop single-nucleotide polymorphism (SNP) markers. Samples from 89 inbred and open-pollinated (OP) bulb onion populations of wide geographical adaptation and four related Allium L. accessions were genotyped with 56 EST-SSR and four genomic SSR markers. Multivariate analysis of genetic distances among populations resolved long-day, short-day, and Indian populations. EST-SSR markers frequently revealed two major alleles at high frequency in OP populations. The median proportion of single-locus polymorphic loci was 0.70 in OP and landrace populations compared with 0.43 in inbred lines. Resequencing of 24 marker amplicons revealed additional SNPs in 17 (68%) and five SNP assays were developed from these, suggesting that resequencing of EST markers can readily provide SNP markers for purity testing of inbreds and other applications in Allium genetics.
Bulb onion is one of the most widely cultivated vegetables but is poorly characterized at the genetic level. It is a biennial, outcrossing diploid that exhibits severe inbreeding depression (Havey, 1993) and is traditionally propagated from seed produced by insect crosspollination of large populations. Increasingly, commercial breeding is focused on F1 hybrids based on cytoplasmic male sterility, which necessitates inbreeding to fix nuclear restorer loci (Berninger, 1965; Jones and Clarke, 1943). Shallot (Allium cepa Aggregatum group) is a morphologically distinctive but closely related horticultural type that is mostly propagated asexually, although fertile types can freely intercross with bulb onion (Rabinowitch and Kamenetsky, 2002). As a result of a lack of wild relatives, poor representation in genetic resource collections (Cross, 1998; Kik, 2008), and the expense of asexual or seed maintenance of A. cepa germplasm, it is desirable to better understand the population structure of global resources to better target maintenance and exploitation. A more detailed understanding of genetic variation within and among populations is also required to develop marker-based methods for testing purity and authenticity of commercial inbreds and hybrids.
Studies of genetic diversity in onion have been hampered by a lack of portable codominant molecular markers (McCallum, 2007). Although a variety of molecular marker methods have been successfully used to resolve questions of genetic diversity and relatedness to species level in Allium (Klaas and Friesen, 2002), identifying robust and informative markers within A. cepa has proved much more challenging. Dominant randomly applied polymorphic DNA markers have been used in limited studies of A. cepa diversity (Dennequin et al., 1997; Tanikawa et al., 2002), but a more detailed evaluation by Bradeen and Havey (1995a) showed that identification of reliable, heritable polymorphisms is very challenging in onion. Isozyme (Cramer and Havey, 1999; Rouamba et al., 2001) and restriction fragment length polymorphism (RFLP) (Bark and Havey, 1995) markers have been used but are limited, respectively, by low polymorphism (Peffley and Orozco-Castillo, 1987) and the unusually large genome size typical of Allium species (Bennett and Leitch, 1995). Fischer and Bachmann (2000) reported development of a set of genomic dinucleotide simple sequence repeat (SSR) markers from onion. As a result of complex amplification requirements, these markers have not proved sufficiently portable to enable wider use in mapping and diversity studies, although Masuzaki et al. (2006a) reported optimization and chromosomal allocation of a subset of these. More recently, SSR and single-nucleotide polymorphism (SNP) markers have been developed from onion expressed sequence tag (EST) resources (Kuhl et al., 2004; Martin et al., 2005) and proved to be readily reproducible for mapping (McCallum et al., 2006a, 2006b) and cultivar discrimination (Jakse et al., 2005).
The goal of this study was to perform a broad survey of genetic variation within and among cultivated onion populations and assess potential of EST-derived markers for genetic identity testing. To permit a sufficiently broad survey and ensure relevance for seedlot identity testing, we estimated allele numbers in bulked samples, an approach previously used in similar surveys of maize (Zea mays L.) diversity. This revealed high levels of within-population heterozygosity and limited EST-SSR size polymorphism in cultivated onion. The results suggest that discrimination of inbred onion lines with SNP markers will be feasible and that such markers can be readily developed from onion EST resources.
A set of 82 bulb onion populations was selected to represent the broadest possible range of germplasms relevant to modern genetics and breeding, including mapping population parents, inbred and open-pollinated (OP) populations widely exploited by breeders and landraces. A supplementary table containing full details of this material is available online at www.ASHS.org. Seven doubled haploid (DH) bulb onion lines were obtained from Cornell University (Alan et al., 2004). Four related Allium outgroups were provided by single accessions of A. fistulosum L. (Ohara et al., 2005), A. roylei Stern (van der Meer and de Vries, 1990), shallot (A. cepa L. aggregatum group), and the amphidiploid between A. cepa and A. fistulosum ‘Beltsville Bunching’ (Jones and Clarke, 1942). DNA samples used were either isolated in earlier studies (Bark and Havey, 1995; Bradeen and Havey, 1995b; Havey, 2000; Leite et al., 1999) or purified from bulked tissue of at least 25 seedlings as described previously (McCallum et al., 2006a).
Immature, unopened flower buds were pooled in Dec. 2004 from multiple genotypes that included maintainer and restorer lines in nitrogen and sulfur cytoplasm. Total RNA was isolated as described previously (McCallum et al., 2002) and a nonnormalized cDNA library was constructed in pCMV.SPORT 6.1 (Invitrogen, Carlsbad, CA). Random clones (672) were purified using NucleoSpin 96 Flash Kit (Macherey-Nagel, Duren, Germany) and sequenced from the 5′ end using Big Dye version 3.0 chemistry (Applied Biosystems, Foster City, CA). Trimmed reads were submitted to the Genbank EST division (accession numbers ES449250 to ES449826).
EST-SSR primer sets were described previously (Kuhl et al., 2004; Martin et al., 2005; McCallum et al., 2006a, 2006b) or designed as described previously (Kuhl et al., 2004) from floral EST and other onion EST resources (Table 1). Genomic SSR primer sets were those originally reported by Fischer and Bachmann (2000) and chromosomally assigned by Masuzaki et al. (2006b). Polymerase chain reaction (PCR) amplification and analysis of SSR products was performed as described previously (Martin et al., 2005) with the following modifications. Primer ratios were modified from the method of Schuelke (2000) to use marker tailed forward plus reverse primers at 0.5 μM and fluorescently labeled universal primer at 0.2 μM. A “PIG-tail” sequence (GTTTCTT) was added to the 5′ end of reverse primers to minimize variability in nontemplated adenylation of amplicons (Brownstein et al., 1996).
Chromatograms were analyzed using visual and quantitative tools provided in GeneMarker (version 1.42; SoftGenetics, State College, PA). Peak interpretation was guided by reference to prior segregation data from mapping populations (Martin et al., 2005; McCallum et al., 2006a) and DH controls. Analyses were restricted to measures of allele presence judged on conservative criteria. Principal coordinates analysis (PCO) was performed in Genstat (Payne et al., 2006) on the similarity matrix calculated from 289 peaks revealed by 60 primer sets using the Jaccard coefficient. A total of 51 primer sets (48 EST-SSRs, three genomic SSRs), which yielded profiles of quality 1 or 2, as defined by Leigh et al. (2003), which could be reliably scored as single loci, were used to estimate population heterozygosity as the proportion of polymorphic loci [Pl (Berg and Hamrick, 1997)].
A small subset of two to four templates was selected for resequencing, including a DH line and one or more populations exhibiting high frequencies of an alternate size allele. PCR products were purified using the High Pure PCR Product Purification Kit (Roche Diagnostics, Mannheim, Germany) following the manufacturer's instructions and cloned into pGEM-T Easy Vector (Promega, Madison, WI). DNA for sequencing was amplified directly from at least eight white colonies from each template using the Templiphi Amplification Kit (Amersham Biosciences, Piscataway, NJ). Sequencing was performed with M13 forward and reverse primers using a Big Dye version 3.0 cycle sequencing kit (Applied Biosystems) and analyzed on an ABI3100 Genetic Analyser (Applied Biosystems). Sequence comparison and SNP detection was carried out using SeqScape version 2.1 (Applied Biosystems) using Genbank sequences of ESTs used in original primer design as reference sequences.
Restriction polymorphisms were identified using BlastDigester (Ilic et al., 2004) and if suitable polymorphisms could not be identified, potential for engineering these was identified using dCAPS finder 2.1 (Neff et al., 2002). Cleaved amplified polymorphic sequence or derived cleaved amplified polymorphic sequence primer sets for assaying the polymorphisms were designed using Primer 3 software (Rozen and Skaletsky, 2000). Ability of assays to reveal polymorphisms within bulb onion was confirmed by genotyping a small subset of surveyed germplasm, including a DH control.
We screened a total of 76 EST-SSR and 12 genomic SSR markers for ability to reveal polymorphism in this set of accessions and used data from 56 EST-SSR and four genomic SSR primer sets, which revealed reproducible patterns. Presence of DH lines in the panel greatly aided interpretation of chromatograms, because visual evaluation of aligned chromatograms suggested a high frequency of heterozygosity for most markers. As observed during earlier mapping studies (Martin et al., 2005), EST-SSR markers designed using standard criteria performed well across onion germplasm under uniform PCR conditions. Amplicons of similar size were amplified in the related species A. roylei and A. fistulosum by 33 and 32 of these markers, respectively, confirming their value for interspecific and comparative studies. By contrast, dinucleotide genomic SSRs (gSSRs) required optimization of conditions and amplified rarely in allied Allium species. Chromatograms of gSSRs revealed much higher levels of size polymorphism than EST-SSR but were more challenging to interpret in population bulk samples as a result of high levels of stutter such that only three were selected for estimation of locus heterozygosity. EST-SSR primer sets revealed an average of 3.59 alleles (range, one to 10 alleles), similar to the distribution observed in durum wheat (Triticum aestivum L.) EST-SSRs by Wang et al. (2007) with di- and trinucleotide SSRs revealing similar ranges of polymorphism (Table 2; Fig. 1). Five primer sets were monomorphic in bulb onion but revealed polymorphism between bulb onion and Allium outgroups.




Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 133, 6; 10.21273/JASHS.133.6.810
Population heterozygosity in the bulb onion accessions (excluding DH lines) was conservatively estimated as Pl revealed by 51 of the 60 markers (219 alleles), which revealed variation in bulb onion that could be confidently interpreted as representing alleles at one locus. Because we expect that alleles with frequencies below 5% may not be detected in a bulk PCR, this estimate of heterozygosity is conservative but directly relevant to PCR-based discrimination and authenticity testing of bulked seed samples. The heterozygosity detected by markers in the 82 non-DH bulb onion populations, expressed as the proportion of heterozygous populations (Pp ), shows a bimodal distribution (histogram in Fig. 1). The markers that detected polymorphism in less than 30% of populations surveyed were characterized by rare informative alleles, whereas the majority were characterized by several alleles at high frequency across the populations. Bulb onion populations exhibited a wide range of heterozygosity with Pl ranging from 0.12 to 0.84 (Fig. 2). Median Pl in landrace and OP or hybrid populations was 0.7 compared with 0.43 in the sample of inbred lines. The range of Pl observed in the noninbred populations is at the high end of the range observed in isozyme studies of outcrossing species (Berg and Hamrick, 1997; Hamrick and Godt, 1989). The wide range of Pl observed in the inbred lines is consistent with the fact that inbred development in onion can involve a wide range of bottlenecks, ranging from limited mass pollinations of three to 20 plants to one or more cycles of self-pollination. The narrower range of Pl in OP cultivars and breeding populations compared with landraces may reflect the success of modern plant breeding in increasing heterozygosity through intercrossing and selection for vigor. The DH line DH2178 showed heterozygosity at a few loci, which is believed to have arisen from crosscontamination in seed production (M. Mutschler, personal communication).




Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 133, 6; 10.21273/JASHS.133.6.810
These observations of high heterozygosity and low allele number confirm the previous report of Bark and Havey (1995), who commonly observed two alleles using RFLP and together suggest that allelic diversity in onion is lower than in maize. Similar genotyping of bulked samples has been widely used and well proven in maize (Reif et al., 2005), but more detailed analysis of allele frequencies within populations is required to provide a more quantitative picture of heterozygosity in breeding and landrace onion germplasm.
The complete data set comprising presence/absence data for 289 peaks scored on all 60 markers was subjected to multivariate analysis of the Jaccard similarity matrix. PCO was used for ordination and exploration of the similarity matrix (Fig. 3). The first PCO explaining 15% of the variability was highly correlated (r = −0.95) with Pl , indicating that this component represents differences in heterozygosity. The second PCO (7% of variation) provides resolution of populations congruent with their known geographical origins. In particular, long- and intermediate-day types originating from temperate growing regions are resolved from short-day (SD) tropical types. However, the most notable feature of this ordination is the clear resolution of a group of Indian cultivars and landraces from the main grouping of SD populations. This grouping may represent populations more closely related to ancestral bulb onion types from the center of origin. The survey of A. cepa by Fischer and Bachmann (2000), based on genomic SSRs, also revealed groupings related to geographical origin but focused on shallot and landrace onion germplasm, making comparison with our findings difficult.




Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 133, 6; 10.21273/JASHS.133.6.810
After observations of relatively low size allelic diversity revealed by EST-SSR markers, we conducted resequencing to determine whether these might provide a source of biallelic SNP markers more suitable for commercial and research purposes. Additional polymorphisms were identified in 17 of 24 loci sequenced [68% (Table 3)] suggesting that more general mutation scanning of 5′ UTR regions of onion genes may provide more polymorphic markers than sizing alone. From these, five polymorphic SNP assays were developed (Table 4; Fig. 4), suggesting that the existing onion EST resources can readily provide many more such markers if properly exploited. Although the levels of diversity observed in noninbred populations would make marker-based purity and authenticity testing using such markers inefficient, the higher levels of fixation observed among inbred lines (Fig. 2) suggests that testing inbreds and their derived hybrids with such single-locus markers derived from highly heterozygous loci would be quite practical. To improve the efficiency of such an approach, key inbreds could also be subjected to marker-aided inbreeding at one or more loci, as proposed by Tsukazaki et al. (2006) for A. fistulosum. Bunching onion appears to be more tolerant of inbreeding than onion, and therefore some validation that inbreeding did not compromise fitness would be required.




Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 133, 6; 10.21273/JASHS.133.6.810
These observations have implications for conservation and exploitation of genetic diversity in onion. In particular, the observation of high allelic diversity in OP populations confirms that these are key reservoirs of diversity in onion worthy of conservation. The lower values of heterozygosity observed in inbreds illustrate the potential for erosion of diversity associated with hybrid breeding. The observation that Indian populations form a genetically distinctive group suggests that this region may have potential to provide novel germplasm resources to broaden the base for breeding and genetic studies in bulb onion.




Contributor Notes
This research was funded by the New Zealand Foundation for Research Science and Technology contract C02X0203.
We gratefully acknowledge the provision of germplasm surveyed in this study by Cornell University, Terranova Seeds, New Zealand, Tropicasem Ltd., Senegal, and other collaborators.
Corresponding author. E-mail: mccallumj@crop.cri.nz.