Development and Characterization of Genic Microsatellites for the Ornamental Plant Green and Gold ( Chrysogonum virginianum )
Click on author name to view affiliation information
Click on author name to view affiliation information
The genus Chrysogonum is native to the eastern United States. Three entities have been recognized—either as three varieties of Chrysogonum virginianum or as two species, one of them with two varieties. The current study suggests that a fourth entity should be recognized. Several forms of the complex are in commercial trade as ornamentals. As very limited molecular information on Chrysogonum is available, we developed a set of genic simple sequence repeat markers (eSSRs) from de novo transcriptome sequencing. We tested a set of 17 eSSRs on a collection of C. virginianum genomic DNA samples from the three botanical varieties, and a new putative type observed in Tennessee, dubbed “Ocoee-type” for its geographic origin. The polymerase chain reaction and capillary electrophoresis analyses with downstream population genetics tools verified the usefulness of the eSSRs. By applying this approach, we showed recognizable variation within Chrysogonum, although it did not correspond exactly to previous infraspecific classifications. Finally, as demonstrated for the commercial cultivar Pierre included in the study, the eSSRs can be used for enhancing the future breeding or hybridization efforts of this ornamental plant.
The monotypic genus Chrysogonum (Linnaeus, 1754; Asteraceae: Heliantheae) is native to the eastern United States. The single species currently recognized taxonomically is Chrysogonum virginianum L. (Green and Gold; Goldenstar), a perennial herb that produces small, bright yellow flower heads. The following three botanical varieties of C. virginianum are separated by habit and geographic location: var. virginianum has the tallest plant height, lacks stolons, has leafy flowering stems, and is found in the northeastern portion of the generic range; var. australe is the shortest, has prominent, long stolons, leafless flowering stems, and occurs in the most southern and western part of the range; var. brevistolon is of intermediate plant height, has stolons of medium length, has both leafy and leafless flowering stems, and occurs in the middle portion of the generic range (Nesom, 2001; Stuessy, 1973, 1977). The relationships and generic limits of Chrysogonum have been a source of controversy, as has the circumscription of taxa within the genus (Gray, 1882; Nesom, 2001; Stuessy, 1973, 1977). An alternative taxonomy of the genus was proposed (Weakley, 2015); it recognized the “australe” entity at specific rank, presumably because of its allopatry and distinct morphology, in contrast to previous categorization that overlooked that fact (Nesom, 2001). Issues with subtribal relationships were resolved with molecular phylogenetic data, starting with the studies based on enzymatic restriction of the chloroplast genome that explored the phylogenetic position of Chrysogonum in the Heliantheae tribe (Panero and Jansen, 1997; Panero et al., 1999). Sequence-based studies analyzing the nuclear ribosomal sequences of the internal transcribed spacer and the external transcribed spacer provided support for removal of all but one of the classically recognized species of Chrysogonum and placement of the genus Chrysogonum in subtribe Engelmanniinae, together with Silphium L. and four other small genera (Clevinger and Panero, 2000). The genome sequence of C. virginianum has not been reported, and before this study only 14 sequences were available in the National Center for Biotechnology Information GenBank for the species. The species ploidy is assumed to be diploid (2N = 2× = 32; Nesom, 2001; Solbrig et al., 1972), although some authors suggested it is tetraploid, based on the relatively high base chromosome number (Stuessy, 1973). Only scant phytopathological information is available on Green and Gold and includes a recent report on powdery mildew (Golovinomyces spadiceus; Trigiano et al., 2018a) and aerial blight (Sclerotinia sclerotiorum; Trigiano et al., 2018b). Green and Gold is used as a home garden ornamental plant in shady, moist areas with several cultivars of all three taxa available commercially, and some of them recognized by patents (Hawks, 2008; Hoffman et al., 2005). At the basal part of Green and Gold diaspores, there is an elaiosome, a fleshy, oil-bearing structure. Ants foraging on the elaiosomes contribute to the plant dispersal (Gaddy, 1986; Nesom, 1978). The plant flowering time was used as a marker of global climate change in the Washington, DC area and was confirmed to advance by 31 d over 30 years of studies, ranking it as the fifth most altered among the 89 plant species investigated (Abu-Asab et al., 2001).
Among the neutral molecular markers, the microsatellites or simple sequence repeats (SSRs) are often used for population genetics and (sub)species delimitation (Duminil and Di Michele, 2009; Dupuis et al., 2012; Esfandani Bozchaloyi et al., 2017; Meudt et al., 2009). Compared with the genomic SSRs (gSSRs) developed on the nuclear DNA sequences, the eSSRs are developed from the assembled transcriptomes (Ellis and Burke, 2007; Varshney et al., 2005). eSSRs have become an attractive and efficient high-throughput molecular tool owing to comparatively lower cost for their development from species lacking sequenced genomes. The eSSRs conserved in the transcriptomes usually display somewhat lower diversity indices and higher conservation across taxa than gSSRs (Ellis and Burke, 2007; Eujayl et al., 2001; Varshney et al., 2002).
To better substantiate the current taxonomic subspecies delineation in the complex C. virginianum, we developed eSSRs based on de novo RNA sequencing (RNAseq). We selected 17 eSSRs and assessed their utility using a collection of 32 genomic DNA (gDNA) samples from the three currently recognized C. virginianum varieties from the eastern United States and a recently-discovered putative new form (“Ocoee-type”) first observed in eastern Tennessee. Our survey of Chrysogonum also included a sample of the commercial cultivar Pierre, whose varietal attribution was not described. The purpose of this report was to present the results of the Chrysogonum eSSR development and the initial results on variability within the genus.
Samples of C. virginianum were obtained to represent a variety of locations from throughout its range in the southeastern United States. Sources included herbarium specimens collected between 1967 and 2003 and living material collected 2016 through 2018. The samples (n = 31) belonged to the three currently recognized varieties assigned following Nesom (2001) and the newly observed putative “Ocoee-type” (Ocoee and Hiawassee River Valley samples, Tennessee). The “Ocoee-type” is distinct in having a tapering, cuneate leaf base similar to var. australe but lacking its distinctive long stolons. gDNA was isolated from dried leaves (herbarium specimens) or fresh leaf material (Table 1) using the DNA Plant Mini Kit (Qiagen, Germantown, MD), according to the manufacturer’s protocol. A single leaf sample from a home garden (a morphologically typical C. virginianum of the “Ocoee-type,” collected and transplanted from the Hiawassee River Valley, TN; ‘Chry_2’, Maryville, TN, collected 2018 and stored on dry ice) was also included. The gDNA of this sample was isolated from leaf tissue using the DNA Plant Mini Kit (Qiagen) according to the manufacturer’s protocol. A leaf sample of the same living plant was also processed for total RNA using the Ribospin II kit (GeneAll Biotechnology, Seoul, Korea) as per the manufacturer’s protocol.
The RNA of C. virginianum specimen ‘Chry_2’ was submitted for Illumina MiSeq [pair-end 2 × 250 base pairs (bp)] sequencing (GeneWiz, South Plainfield, NJ). RNAseq data for C. virginianum produced were assessed for read quality using the quality control program FastQC, which identifies base content, sequence quality, and sequence duplication in fastq files of interest (Andrews, 2014). Adapter content was removed using the trimming program Skewer (Jiang et al., 2014) (minimum length: 30 bp, defaults used for all other parameters). To enable microsatellite identification, reads were assembled using Assembly By Short Sequences (ABySS), specifically its paired-end option, abyss-pe, using a k-mer size of 81 and default settings for all other options (Simpson et al., 2009). Low-complexity regions of the transcriptome were masked out with Dustmasker, with a level of 1, before finding SSRs within the assembly (Morgulis et al., 2006). This study used a custom perl script to identify SSRs and call primers with Primer3 (Staton and Ficklin, 2018; Untergasser et al., 2012). This script searched for di-, tri-, and tetra-repeating motifs with an amplified fragment range of 100 to 250 bases.
From the developed eSSRs, we initially screened a pool of 50 primer pairs (20 of di-, 20 of tri-, and 10 of tetra-motif repeats) on the gDNA sample whose leaf was also used for RNA isolation and sequencing (‘Chry_2’). The polymerase chain reaction (PCR) reaction volume was 10 µL and included 5 µL of 2 × AccuStart II PCR SuperMix (Quanta BioSciences, Inc., Beverly, MA), 0.25 µM of each primer, and 4 ng of gDNA. The PCR thermal profile used was as follows: initial denaturation at 94 °C for 4 min, 10 touch-down cycles (94 °C for 20 s, 65 °C −1.0 °C per cycle for 20 s, 72 °C for 30 s), 30 cycles (94 °C for 20 s, 55 °C for 20 s, 72 °C for 30 s), and final extension at 72 °C for 5 min. The PCR products were visualized using capillary electrophoresis (QIAxcel Advanced Electrophoresis System; Qiagen) and analyzed using a 25 to 500 bp DNA size marker and an internal 15/600 bp alignment marker. The 17 best-performing eSSRs (four di-, eight tri, and five tetra-motif repeats) based on agreement with the predicted size and one or two unambiguous alleles, were selected and used for genotyping of the C. virginianum gDNA collection. The resultant dataset was binned into allelic classes using FlexiBin (an MSExcel macro; Amos et al., 2007).
The binned dataset was analyzed for an array of population genetics parameters. To estimate the basic indices, including number of alleles per locus, observed and expected heterozygosity, evenness, Shannon’s diversity index, and allelic richness (defined as the number of alleles divided by the number of samples without missing data at that locus), we used the following packages: poppr (Kamvar et al., 2014, 2015), hierfstat (Goudet, 2005; Goudet et al., 1996), and strataG (Archer et al., 2017) in R version 3.4.3 (The R Core Team, 2014). Gene flow (Nm) was estimated using GenAlEx version 6.503 (Peakall and Smouse, 2012). The package poppr in R was used for calculations of the (pairwise) linkage disequilibrium. Discriminant Analysis of Principal Components (DAPC) was performed in R using the package adegenet version 2.1.1 (Jombart et al., 2010, 2018) and included cross-check and optimization by 1000 permutations over principal components number ranging from 1 to 32.
After pre-processing the RNAseq data through Skewer, 7,054,370 of the initial 7,145,419 read pairs were available for further analysis; of these, 991,875 were trimmed using Skewer, and 6,062,495 did not require trimming. The assembly generated by running ABySS on these remaining reads contained 841,375 scaffolds. A total of 4940 SSRs with primers were identified from the C. virginianum assembled transcriptome; of these, 3162 were dinucleotide repeats, 995 were trinucleotide repeats, and 783 were tetranucleotide repeats. All unique di-repeat SSR types were discovered. Similarly, of 60 unique tri-repeat SSRs, only [AGT]n-types (and their redundant iterations) were missing in the assembled transcriptome of C. virginianum. From among the total of 252 unique tetra-SSRs, only 118 were discovered in the assembled transcriptome of C. virginianum with at least one SSR present. The number of SSRs that yielded primers as per our algorithm corresponded with their respective counts of SSRs discovered in the assembled transcriptome, regarded both in the repeat-length fashion, or repeat-motif (Fig. 1). Similar to other studies (Arias et al., 2011), the most frequent SSRs were di-repeats with [GC]n ranking the lowest among them. In contrast to other Asteraceae plants (Peng et al., 2014), the tetra-repeat SSRs occurred almost as frequently as the tri-repeat SSRs and were among the most frequently discovered in the unique motif manner.
Citation: HortScience horts 54, 2; 10.21273/HORTSCI13739-18
Based on the genotyping dataset accrued, the used eSSRs are well dispersed over the C. virginianum genome (Fig. 2A). The small range of pairwise values of Index of Association d (from −0.2 to 0.3) corroborates this result. Furthermore, the overall value of
d = 0.014 (P = 0.083) (Fig. 2B) indicative of the outcrossing mode of reproduction, is in agreement with the species biology (Nesom, 2001; Stuessy, 1973, 1977).
Citation: HortScience horts 54, 2; 10.21273/HORTSCI13739-18
With regard to the diversity indices calculated, the selected 17 eSSRs detected from 4 to 15 alleles per locus with a mean of ≈8 (Table 2). The allelic richness ranged from 0.09 to 0.52, with a high cumulative mean of 0.32. The observed heterozygosity (0.26) was much lower than the expected heterozygosity (0.70). This was likely derived from the low number of accessions present in this collection, or, given the growth habit, the plants may be relatively inbred. The very low value for Shannon’s diversity index (1.17) indicative of generally low allele species richness/evenness and far from saturating the expected species diversity, may have stemmed from the same relatively low sample number in this study. The same limitation also applied to the relatively high values of the eSSR evenness indices, ranging from 0.56 to 0.96, with a mean of 0.74. Very high values of the Wright’s fixation index (F) implied an excess of homozygosity relative to that expected under the Hardy-Weinberg equillibrium of the varieties of C. virginianum, despite rather strong support for the outcrossing mode of this plant reproduction (Fig. 2B). High gene flow calculated for our C. virginianum collection (Nm = 1.68) likely had a smoothening effect on the allele frequencies, rendering high values of the evenness index. All these characteristics, however, must be taken with caution owing to the limited sampling and, although promising, certainly needs to be confirmed in a large-scale study.
Despite the low number of samples included in this study, we achieved reliable separation of some of the C. virginianum varieties with DAPC (Fig. 3). Specifically, we observed distant placement of the var. australe from the other varieties, and a possible emergence of an additional new botanical variety, tentatively dubbed ‘Ocoee-type’ for its place of origin. In contrast, vars. virginianum and brevistolon were not clearly delineated from each other. The morphological difference between typical var. virginianum and var. brevistolon is small, but both herbarium and field observations showed that the geographic boundary is sharply defined (as mapped), thus some kind of genic differentiation should be expected (G.L. Nesom, personal communication). Notably, our observations are tentative in character and need to be tested on a large-scale study, possibly involving auxiliary molecular evidence (e.g., sequencing of chosen informative or “barcoding” loci; CBOL Plant Working Group, 2009), as well as descriptive taxonomy using the geographical and ecological methods (Duminil and Di Michele, 2009; Reed and Frankham, 2001). Several reports successfully used the Amplified Fragment Length Polymorphism as molecular markers neutral in character and of nuclear origin, for plant species delimitation (Chen et al., 2013; Meudt et al., 2009; Segatto et al., 2017) or SSR alike (Chen et al., 2013; Dupuis et al., 2012; Meudt et al., 2009). In particular, the eSSR markers may be of interest due to their conservation and informativeness among the closely related species (Ellis and Burke, 2007). SSRs also help increase the power of low taxonomic level distinction in agreement with the “one marker is not enough” approach (Dupuis et al., 2012; Segatto et al., 2017). The tentative assignment of the commercial cultivar Pierre to the large cluster with samples of vars. virginianum and brevistolon (which are not distinguished here) serves as proof of concept for such analyses in the C. virginianum subspecies complex. Although our collection did not allow for reliable distinction between those two varieties, we postulate that the eSSRs developed in this study can serve such a general purpose. Thus, an investigation using broader sampling of C. virginianum specimens may clarify the taxonomic classification of the genus at the lowest ranks (Nesom, 2001; Stuessy, 1973, 1977). In addition, this might lead to changes of varietal circumscriptions (Clausen et al., 1939; Duminil and Di Michele, 2009; Lowry, 2012).
Citation: HortScience horts 54, 2; 10.21273/HORTSCI13739-18
In conclusion, our de novo eSSR markers provide a material progress in the analyses of C. virginianum complex in several aspects. As exemplified with the analyzed collection, the eSSRs can be used for species diversity studies, including conservation research. By applying the eSSR genotyping and downstream analyses, we achieved reliable separation of the Chrysogonum varieties, confirmed the molecular distinction of a tentative new variety, and postulated an enhanced separation for the vars. virginianum and brevistolon. Finally, as demonstrated for the commercial cultivar Pierre, the eSSRs can be used for enhancing the future breeding or hybridization efforts of this ornamental plant.
Contributor Notes
This study was funded fully by an Agricultural Research Service–U.S. Department of Agriculture grant (58-6062-6) to the Trigiano laboratory for ornamental plants research. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Use of trade names is for identification purposes only and does not imply their endorsement by the authors or the study funding entities. The authors are grateful to the plant material collectors and herbaria for making them available to this study. The RNA sequencing raw read files are available as National Center for Biotechnology Information Sequence Read Archive accession SRR8181940.
Corresponding author. E-mail: mnowicki@utk.edu