Analysis of Genetic Diversity and Population Structure for the Native Tree Viburnum rufidulum Occurring in Kentucky and Tennessee
Click on author name to view affiliation information
Click on author name to view affiliation information
Viburnum rufidulum is a deciduous tree native to North America that has four-season appeal, which provides commercial horticultural value. In addition, the plant has unique and attractive red pubescence on leaf buds and petioles, common to no other Viburnum species. As habitat undergoes development and subsequent fragmentation of native plant populations, it is important to have baseline genetic information for this species. Little is known about the genetic diversity within populations of V. rufidulum. In this study, seven microsatellite loci were used to measure genetic diversity, population structure, and gene flow of 235 V. rufidulum trees collected from 17 locations in Kentucky and Tennessee. The genotype data were used to infer population genetic structure using the program InStruct and to construct an unweighted pair group method with arithmetic mean dendrogram. A single population was indicated by the program InStruct and the dendrogram clustered the locations into two groups; however, little bootstrap support was evident. Observed and expected heterozygosity were 0.49 and 0.78, respectively. Low-to-moderate genetic differentiation (FST = 0.06) with evidence of gene flow (Nm = 4.82) was observed among 17 populations of V. rufidulum. A significant level of genetic diversity was evident among V. rufidulum populations with most of the genetic variations among individual trees (86.37%) rather than among populations (13.63%), and a Mantel test revealed significant correlation between genetic and geographical distance (r = 0.091, P = 0.001). The microsatellites developed herein provide an initial assessment or a baseline of genetic diversity for V. rufidulum in a limited area of the southeastern region of the United States. The markers are a genetic resource and can be of assistance in breeding programs, germplasm assessment, and future studies of V. rufidulum populations, as this is the first study to provide genetic diversity data for this native species.
Viburnum rufidulum is a member of Adoxaceae, which includes the genera Viburnum, Sambucus, Sinadoxa, and Adoxa. The genus Viburnum is estimated to include ≈170 species with worldwide distribution (Winkworth and Donoghue, 2004). Viburnum rufidulum, one of the 12 species native to North America (U.S. Department of Agriculture, 2008), is a small deciduous tree capable of growing to a height of about 3.0 to 9.0 m (Dirr, 2007). Cymes of small white flowers attract a variety of pollinators in late spring (Miliczky and Osgood, 1979), which facilitate outcrossing of this self-incompatible species and attractive drupes are produced in late summer that begin as pale green and mature to almost black-purple in late fall. The species can be easily identified by the red pubescence that occurs on leaf buds, petioles, and abaxial surfaces of leaves. This abundant pubescence gives V. rufidulum its common name of rusty blackhaw, and the attractive and unique rust-colored pubescence distinguishes it from all other Viburnum species (Dirr, 1998). The native range of V. rufidulum extends from the southeastern United States, spreading north into the southern portions of Illinois, Indiana, Missouri, and Ohio, and extending westward to eastern sections of Kansas, Oklahoma, and Texas. Viburnum rufidulum is often found growing as an understory tree on limestone outcroppings in shallow, xeric environments (Homoya, 1987).
The genus Viburnum has many species that are economically important to ornamental horticulture (Clement and Donoghue, 2012; Dean et al., 2011). Many nonnative Viburnum species and cultivars are popular; however, the anthropogenic introduction of nonnative species is a major threat (second only to habitat loss) to the biodiversity of native plants (Gurevitch and Padilla, 2004; Vitousek and Walker, 1989). In addition, urbanization has led to loss of habitat and fragmentation that divides contiguous populations into many smaller patches where they are at risk of genetic isolation (Fahrig, 2003; Lienert, 2004). Spatial isolation may promote inbreeding and the reduction of gene flow among populations (den Nijs and Oostermeijer, 1997). Moreover, genetic differentiation, gene flow, and levels of heterozygosity of native plant populations are of interest to biologists and serve as indicators of local genetic diversity of species in plant and animal communities (Kati et al., 2004; Lindenmayer et al., 1999).
Viburnum rufidulum has not been genetically characterized and there is a dearth of information concerning population structure and genetic diversity. Molecular studies can provide valuable insight into the genetic composition of populations and this knowledge can assist in planning long-range management of at-risk habitat. This type of study is often the first step taken to manage and conserve native populations. Moreover, genetic characterization of native North American species can become important to successful conservation efforts, should such actions be required (Hamrick and Godt, 1996; Holsinger and Gottlieb, 1991; Newton et al., 1999).
Microsatellites, also referred to as SSR (SSRs), are a class of codominant markers that are useful in assessing genetic diversity of plants due to their high level of polymorphism (Gupta et al., 1996). These markers are often used for population genetic studies that involve closely related individuals and are a valuable tool for studying gene flow, inbreeding, and levels of differentiation between populations (Arnold et al., 2002; Hadziabdic et al., 2012; Zhang and Hewitt, 2003).
In this study, the native North American species V. rufidulum was genetically characterized. To our knowledge, this is the first study to examine the genetic diversity and population structure of V. rufidulum in Kentucky and Tennessee. Therefore, this study affords a baseline of genetic diversity and population structure for this species. The specific objectives of this study were to develop and use microsatellite markers to: 1) quantify genetic diversity, 2) analyze population structure of various indigenous populations of V. rufidulum, and 3) generate necessary data to serve as a genetic resource for breeding programs, germplasm assessment, and future studies.
During late winter and early spring of 2011 through 2013, 235 samples of V. rufidulum leaves and flower buds were collected from trees at 17 sampling areas located in Kentucky (KY) and Tennessee (TN). There were three sites in Fayette County (KY), six sites in Knox County (TN), six in Loudon County (TN), and two sites in Anderson County (TN) (Table 1). The sample sizes from the various sites ranged from 2 to 33 trees. We elected to include small populations in the study as each of those collection sites was undergoing significant environmental disturbances from urbanization or had invasive species present. In addition, small population sizes did not affect estimates of gene diversity in prior studies (Nybom and Bartish, 2000; Semaan and Dodd, 2008). Individual samples were labeled and global positioning system (GPS) coordinates recorded (Table 1). Plant materials were transported from the field to laboratory on ice and then stored at −80 °C until DNA isolation.

Genomic DNA was extracted from unexpanded leaves or closed floral buds with the DNeasy® Plant Mini Kit (Qiagen, Valencia, CA). DNA purity was evaluated and quantified with a spectrophotometer (NanoDrop ND-1000; NanoDrop Technologies, Wilmington, DE). A small insert genomic DNA library (V. rufidulum accession no. 24 collected in Anderson County, TN) was constructed and enriched for (CA)n motif microsatellites using the methods of Dean et al. (2011), Wadl et al. (2011), and Wang et al. (2007). Colonies with putative microsatellites were sequenced using T3 and T7 primers on a capillary electrophoresis DNA sequencer (ABI 3730XL; Applied Biosystems, Foster City, CA). After sequencing, microsatellite repeats were identified using Imperfect SSR Finder (Stieneke and Eujayl, 2007) and vector flanking regions were trimmed from the insert using the program VecScreen (National Center for Biotechnology Information, 2013). Forty-seven primer pairs were designed using Primer 3.0 (Untergrasser et al., 2012) with the following optimum parameters: primer oligonucleotide length = 20 bp, GC percent = 50%, and an annealing temperature of greater than 56 °C. Primer pairs were synthesized by Integrated DNA Technologies (Coralville, IA). Primer pairs were optimized for polymerase chain reaction (PCR) using genomic DNA from 20 V. rufidulum samples and a 10-μL amplification reaction consisted of the following: 4 ng of genomic DNA, 0.25 μM forward and reverse primers, 2.0 mm MgCl2, 0.2 mm dNTPs, 1 × PCR Gold Buffer (Lifescience Technologies, Carlsbad, CA), 0.4 U AmpliTaq Gold DNA polymerase (Lifescience Technologies), and 5% dimethyl sulfoxide. Thermal cycler conditions were as follows: one cycle of 94 °C for 3 min, and 35 cycles of 94 °C for 40 s, 58 °C for 40 s, 72 °C for 30 s, and one cycle of 72 °C for 4 min. Amplicons were sized on a capillary electrophoresis system (QIAxcel, Qiagen). Of the 47 primer pairs, seven microsatellite markers that amplified DNA from most samples and exhibited polymorphic alleles were optimized and used to amplify DNA from the 235 V. rufidulum trees collected in KY and TN (Table 2). Reactions that failed to produce amplified products were repeated twice before an individual was considered null at that locus.

Before analyses, the data from the raw allele-length bp sizes were placed into allelic-size classes using the automated binning program FlexiBinV2 (Amos et al., 2007). The capillary electrophoresis system was used for PCR analyses and a conservative 2 bp binned allele size determination error rate was used to permit reproducibility of data. To determine genetic diversity of V. rufidulum, 235 trees from Fayette County (KY) and Anderson, Knox, and Loudon counties (TN) were evaluated. Expected and observed heterozygosity (He and Ho), and the number of alleles (Na) for each polymorphic microsatellite locus across all collection sites were calculated using FSTAT version 2.9.2.3 (Goudet, 1995, 2001). The same program was used to calculate allelic richness (AR), using a minimum sample size of 54 individuals [the smallest number of samples based on grouping of sites by county (Fayette County, KY)] or 108 alleles. Genetic differentiation or fixation index (FST), inbreeding (FIS), overall inbreeding coefficients (FIT), and gene flow (Nm) were calculated with a 95% confidence level using the program GenAlEx version 6.4.1 (Peakall and Smouse, 2006). Principal coordinate analysis (PCoA) was completed to visualize structure in the relationships among V. rufidulum collection sites using a covariance matrix with data standardization. This multivariate analysis identifies major patterns among a multivariate data set and creates two-dimensional scatter plots by using the first and third principal coordinates. All loci were tested for deviation from Hardy–Weinberg equilibrium [HWE (P < 0.001)] in FSTAT using a modified Markov chain method described by Guo and Thomson (1992) with 9999 permutations.
Genetic structure of V. rufidulum populations was examined using two Bayesian clustering programs, InStruct (Gao et al., 2007) and Structure version 2.3.4 (Pritchard et al., 2000). Program InStruct eliminates HWE assumptions within clusters and is recommended for populations where inbreeding is occurring or with a partially selfing species (Gao et al., 2007). The following parameters were used in the program InStruct: 10 Markov chain Monte Carlo (MCMC) chains for each value of K, iterations of 1,000,000 a burn-in period of 500,000 with a thinning interval of 20 iterations, followed by the admixture mode. For estimation of number of subpopulations, we used the deviance information criterion [DIC (Gao et al., 2011)]. Structure version 2.3.4, which is widely used for inferring the number of populations (K) in population genetic studies, was used to compare with InStruct analysis. The program Structure group individuals based on their multilocus genotypes with an assumption that HWE or linkage disequilibria are the result of population structure, and therefore it attempts to identify those populations that are not in disequilibrium (Pritchard and Donnelly, 2001). However, Gilbert et al. (2012) reported that the program Structure was unable to reproduce the value of K in 30% of the analyses preformed. To maximize reproducibility of Structure results (Gilbert et al., 2012), we used 20 iterations/independent runs for each value of K (K = 1 to 10) with a burn-in period of 250,000, and 250,000 MCMC replications after burn-in. The admixture (mixed ancestry among individuals that may be present) and the correlated allele frequencies (allele frequencies may be similar among the populations) models were also employed. The most probable value of K was selected based on the calculation of [P (X|K)] (probability distribution), and by estimating the number of populations with the ad hoc statistic ΔK (Evanno et al., 2005) using Structure Harvester version 0.6.1 (Earl and von Holdt, 2012). Although, ΔK helps in identifying the correct number of genetic clusters in many data sets, it should not be used when K = 1, which is the artifact of Evanno’s methodology (Evanno et al., 2005).
Analysis of molecular variance (AMOVA) was performed using Arlequin version 3.5.1.2 (Excoffier and Lischer, 2010) to determine the hierarchical distribution of genetic variation of individuals within and among populations. The following three AMOVA analyses were performed: 1) all V. rufidulum individuals were placed into one hierarchical group from the 17 collection sites, 2) V. rufidulum individuals were grouped geographically based on county location, and 3) the state in which samples were collected (Table 1). The summary statistic was used to quantify genetic differentiation and the hierarchical partitions between, among, and within groups or populations (Excoffier, 2007; Holsinger and Weir, 2009; Kronholm et al., 2010). The program Arlequin was used to estimate genetic differentiation between 17 populations of V. rufidulum using pairwise FST calculations with 10,000 permutations (P < 0.05).
The correlation between genetic and geographic distance was analyzed with the Mantel test for isolation by distance with 10,000 permutations (Mantel, 1967) using GenAlEx version 6.4.1 (Peakall and Smouse, 2006). An unweighted pair group method with arithmetic mean (UPGMA) dendrogram was created using standard genetic distance (Nei, 1972) among 17 V. rufidulum populations. In addition, a bootstrap analysis was performed with the program Populations 1.2.32 (Langella, 2002) and branch support values were estimated using 500 bootstrap randomizations. A consensus tree with bootstrap support values was generated and viewed using TreeView version 1.6.6 (Page, 1996).
Seven polymorphic microsatellite markers amplified 8 to 35 alleles/locus or 132 alleles. Locus VR008 had the fewest alleles (8) and VR004 revealed the most alleles (35). Without the assumption of random mating within the samples, all markers deviated from Hardy–Weinberg equilibrium [P < 0.001 (Table 3)]. The mean values for inbreeding and overall inbreeding coefficient were FIS = 0.36 and FIT = 0.40, which supported the finding of an overall heterozygote deficiency. The allelic richness (AR) across four county localities (corrected by rarefaction of minimum sample size of 52 individuals) had a mean of 11.13, ranging from 6.23 (VR008) to 18.48 (VR004) (Table 3). The greatest degree of gene diversity was found in Anderson County, TN (He = 0.84) and at locus VR004 (He = 0.87), whereas the least was in Knox County, TN (He = 0.71) and locus VR028 (He = 0.65) (Table 3). Our results indicated low-to-moderate genetic differentiation across all loci (FST = 0.06) with high levels of gene flow (Nm = 4.82) (Table 3).

The Bayesian clustering analysis using InStruct indicated that the highest value of K occurred at K = 1 (Fig. 1A). However, Structure analysis indicated that the highest value of ΔK occurred at K = 2. According to Structure, two genetic clusters emerged with subpopulations PNS, FR2, TL1, TL2, TL4, and OL being admixed almost evenly (Fig. 1B), whereas the remaining collections were more uniform in genetic composition. However, the limitation of Evanno’s ad hoc statistic ΔK (2005) is that it fails to identify ΔK = 1, but was retained here for overall comparison and educational purposes when using similar data sets. Results from InStruct were also in agreement with the PCoA scatter plot (Fig. 2). The first three principal coordinates from the genetic distance matrix explained 32.09%, 50.02%, and 65.37% of variation across large, noncontiguous (fragmented), single population, respectively.





Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 140, 6; 10.21273/JASHS.140.6.523





Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 140, 6; 10.21273/JASHS.140.6.523
Significant genetic differentiation exists among populations of V. rufidulum (Table 4). When all 17 sample locations were grouped into one hierarchical group, AMOVA analysis revealed that the greatest amount of genetic variation was found among individuals (86.37%), rather than between V. rufidulum populations (13.63%), indicating moderate-to-high genetic differentiation (FST = 0.13, P < 0.001) (Table 4). When V. rufidulum populations were grouped based on geographical grouping (county and state), results revealed a similar pattern of individual variation rather than differences among (59.91%) and within populations (60.40%) of individual variation for four counties and two states. However, genetic differentiation was low-to-moderate when four counties were compared (FST = 0.06, P < 0.001) and very low when two states (TN and KY) were compared (FST = 0.01, P < 0.001) (Table 4). The FIS and FIT values indicated a deficit of heterozygotes due to nonrandom breeding at the population level.

Considering all 17 V. rufidulum collection sites, the greatest differentiation occurred between sites FR1 (population 6) and VA (population 8) in Knox County, TN (0.46, P < 0.05). Populations PNS (population 4) and RR2 (population 16) from Knox and Fayette counties had the smallest genetic differentiation (0.03, P < 0.05) (Table 5). The UPGMA analysis (Fig. 3) clustered the 17 collections into two major clades, but the relationships were not supported by bootstrap values. Analysis of isolation by distance across all V. rufidulum individuals showed a positive significant correlation between genetic and geographical distance [r = 0.091, P = 0.001 (Fig. 4)].






Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 140, 6; 10.21273/JASHS.140.6.523





Citation: Journal of the American Society for Horticultural Science J. Amer. Soc. Hort. Sci. 140, 6; 10.21273/JASHS.140.6.523
Moderate levels of genetic diversity exist within the 17 V. rufidulum collection sites in KY and TN. Furthermore, our study indicated moderate genetic differentiation between the collection sites and significant isolation of the populations by distance.
The moderate levels of genetic diversity found at the 17 collection sites of V. rufidulum are similar to other studies involving the population genetics of outcrossing trees (Furches et al., 2013; Hamrick et al., 1992; Hamrick and Godt, 1996). The average gene diversity of outcrossing plant species in other studies employing microsatellite markers was 0.65 (Nybom, 2004). However, our study revealed an average gene diversity of 0.78, which was higher than that reported for other studies of woody species (Hornero et al., 2001; Kikuchi and Isagi, 2002). High levels of diversity can be expected in plants that have outcrossing mating systems, are long lived, and have more than one generation that coincides simultaneously with other generations at the same location (Hamrick and Godt, 1996; Marra et al., 2013; Nybom, 2004; Till-Bottraud and Gaudeul, 2002). The high mean value of gene diversity within V. rufidulum populations could be related to the extensive geographic range of the species. Gene diversity contributes to the stability of populations when adverse conditions occur (such as disease and insect epidemics) and to successful adaptation and survival during environmental changes (Freeman and Herron, 1998; Leonard, 1969). The highest levels of diversity in our study were at sites that boasted large areas of unspoiled habitat, which included sites in Anderson and Loudon counties in TN and Fayette County, KY, whereas the least gene diversity was found at urban sites in Knox County, TN. Interestingly, while the vast area at the Forks of the Rivers sites (Knox County) is an ideal habitat for V. rufidulum, very few trees were growing there, and those that were discovered were in tight clusters. Furthermore, the trees growing at two sites at the Forks of the Rivers in Knox County (FR1 and FR2) lacked the vigor of the trees growing at the sites with higher gene diversity.
Structure is one of the most widely used population genetics programs (Gao et al., 2007), but calculates ΔK (number of subpopulations) in a manner (Evanno et al., 2005) in which the minimum value obtainable is K = 2—a subpopulation finding of 1 is mathematically impossible using this methodology. In our study, Structure analysis partitioned V. rufidulum individuals from the 17 collection sites into two principal genetic clusters; however, admixture of the two populations was noted within almost all of the collection areas. The presence of admixture within the two genetic clusters, as inferred by the program Structure, implied that interbreeding or sharing of alleles has occurred between the two populations. Most of the individuals from 17 collection sites displayed relatively equal membership assignment probability to one of the two genetic clusters identified by Structure with the exception of sampling areas WC1, WC2, and TL5 in which limited admixture was observed. However, population analysis using the program Structure is not adequate when genotypic data indicate nonrandom mating or mating among close relatives (Gao et al., 2007; Gilbert et al., 2012). The aforementioned authors argue that the program Structure should be modified when inferring population structure of inbreeding species or species not conforming to HWE proportions (Gao et al., 2007). Therefore, an analysis was performed with the program Instruct, which reduces the false-positive rates, corrects for bias toward spurious admixture, and allows identification of different mating systems in structured as well as unstructured populations (Gao et al., 2007). A finding of K = 1 using DIC values from program InStruct indicated that the trees in this study are part of large, noncontiguous, single population with low-to-moderate genetic differentiation, and high gene flow. In contrast, the program Structure indicated two populations and that the population is fragmented. However, the FIS values indicate a high inbreeding coefficient and suggest the trees are in fact breeding with closely related individuals. Therefore, we conclude that from our study, the result of the Structure analysis, K = 2 populations, is an artifact of the Evanno’s ad hoc statistic ΔK (Evanno et al., 2005) and V. rufidulum subpopulations represent single genetic cluster across KY and TN. These findings also support the contention of Gao et al. (2007) and Gilbert et al. (2012) that the program Structure should not be used with inbreeding populations. As with other obligate outcrossing species, the overall heterozygote deficit and deviation from HWE proportions found in this study may be explained by the ability of this V. rufidulum to mate with half-sibs, resulting in inbreeding among closely related individuals (Ægisdóttir et al., 2009; Hadziabdic et al., 2012).
This study provides the first genetic assessment of V. rufidulum and makes available a foundation of genetic information for the species. However, because of the small scope of this study, it is not possible to ascertain if these results and observations can be extrapolated to the larger distribution of V. rufidulum and additional studies are warranted. In addition, future studies would benefit from using more microsatellite markers and include a much larger sampling area. Sampling from a larger area and greater number of locations within the natural range of V. rufidulum would provide a much more robust representation of genetic diversity and population structure of the species throughout its native range. This study provides the data and information necessary to serve as a genetic resource for breeding programs, germplasm assessment, assistance in maintenance of the natural genetic structure, and future biodiversity and genetic studies of this native species.




Contributor Notes
We thank USDA (grant number 58-6404-1-637) and University of Tennessee Institute of Agriculture, AgResearch for financial and facilities support of this project. We also express our gratitude to the reviewers and editors for comments and suggestions.
These authors contributed equally to this work.
Corresponding author. E-mail: rtrigian@utk.edu.