Skip to main content

The level of genetic diversity and differentiation of tropical lotus, Nelumbo nucifera Gaertn. (Nelumbonaceae) from Australia, India, and Thailand



Nelumbo nucifera Gaertn., a perennial aquatic macrophyte species, has been cultivated in several Asian countries for its economic importance, and medicinal uses. Two distinct ecotypes of the species are recognized based on the geographical location where the genotypes are adapted, i.e., tropical lotus and temperate lotus. The genetic diversity levels and differentiation of the tropical lotus from poorly studied geographic regions still remain unclear. Here, the population genetic diversity and structure of 15 tropical lotus populations sampled from the previous understudied natural distribution ranges, including India, Thailand, and Australia, were assessed using nine polymorphic SSR markers.


The SSR markers used to genotype the 216 individuals yielded 65 alleles. The highest and lowest genetic diversity estimates were found in Thailand and Indian populations, respectively. STRUCTURE analysis revealed three distinct genetic clusters, with relatively low admixtures, supported by PCoA cluster analysis. Low levels of gene flow (mean N⁠m = 0.346) among the three genetic clusters signified the Mantel test for isolation by distance, revealing the existence of a positive correlation between the genetic and geographic distances (r = 0.448, P = 0.004). Besides, AMOVA analysis revealed a higher variation among populations (59.98%) of the three groups. Overall, the populations used in this study exposed a high level of genetic differentiation (FST = 0.596).


The nine polymorphic microsatellite markers used in our study sufficiently differentiated the fifteen tropical N. nucifera populations based on geography. These populations presented different genetic variability, thereby confirming that populations found in each country are unique. The low genetic diversity (HE = 0.245) could be explained by limited gene flow and clonal propagation. Conserving the available diversity using various conservation approaches is essential to enable the continued utilization of this economically important crop species. We, therefore, propose that complementary conservation approaches ought to be introduced to conserve tropical lotus, depending on the genetic variations and threat levels in populations.


Nelumbo nucifera Gaertn. (Lotus), a perennial aquatic macrophyte species, belongs to the genus Nelumbo in the family Nelumbonaceae. Cultivation of lotus dates long back in history as an ornamental and vegetable in several Asian countries (Guo 2009; Yang et al. 2012; Zhang et al. 2014). N. nucifera is mainly distributed in Asia and Australia (Han et al. 2007), and has also been utilized for its economical importance (Yang et al. 2013). In China, for example, N. nucifera seeds are widely used for the preparation of Chinese herbal medicine (Chen et al. 2008; Li et al. 2010), and the rhizome of this species is a common vegetable (Tian et al. 2008). N. nucifera flowers are the main traditional flowers in China, while in India and Vietnam, they are regarded as the national flowers (Chen et al. 2008; Tian et al. 2014).

Lotus flowers are protogynous and usually out-crossed by insects (Kubo et al. 2009). This species can be propagated either by seeds or rhizomes (Goel et al. 2001; Pan et al. 2011). Lotus is capable of producing new hybrids through hybridization between wild and domesticated varieties (Liu et al. 2012). So far, a sizable number of cultivars have been developed from N. nucifera (Li et al. 2015). Notably, the wild lotus populations have served as essential germplasm sources for breeding purposes (Xue et al. 2006; Han et al. 2007), and varied agro-climatic conditions have contributed to the existence of diverse genotypes of wild lotus in China (Liu et al. 2012).

Recently, morphological features, ecological adaptation, and genetic studies in lotus indicated that the South-eastern Asia lotus is distinct from Chinese lotus (Li et al. 2010). Zhang and Wang (2006) grouped the N. nucifera populations into two distinct ecotypes based on the geographical location where the genotypes are adapted, i.e., tropical lotus and temperate lotus. These ecotypes have shown differences in the duration of flowering, growth, and rhizome morphology. The temperate lotus have annual growth habits and big rhizome, whereas the tropical lotus is perennial, has a small rhizome and long flowering period (Zhang and Wang 2006). Lotus grown in East and North-east Asian countries belong to the temperate group, whereas the lotus grown in South-east Asian countries and Australia are considered as tropical ecotype (Zhang and Wang 2006; Li et al. 2010). A previous study revealed that the Thailand lotus, one of the tropical lotus groups, had 2 to 3 months longer flowering periods than the Chinese cultivars (Li et al. 2010; Yang et al. 2013). Tropical lotus is often used for enhancing the ornamental value of temperate lotus by providing valuable traits for developing varieties with a more extended flowering period (Li et al. 2010; Liu et al. 2012; Yang et al. 2013).

Future breeding programs and conservation of N. nucifera will depend on the available knowledge of genetic variation among populations (Han et al. 2009; Hu et al. 2012). In addition, genetic diversity and structure studies avail platforms for undertaking evidence-based management planning (Luo et al. 2018). Previous studies have assessed the genetic diversity of N. nucifera (Han et al. 2009; Pan et al. 2011), with much consideration being accorded to the temperate lotus. These studies have revealed higher genetic diversity levels for N. nucifera using varied molecular markers (Na et al. 2009; Han et al. 2009; Pan et al. 2011). On the contrary, the population genetic studies on tropical lotus have mostly utilized lotus populations from Thailand, however, with relatively low sampling (Li et al. 2010; Hu et al. 2012). Comparing the genetic diversity levels of the two ecotypes yields striking results. For instance, Liu et al. (2012) indicated that tropical lotus had lower genetic diversity than temperate lotus. However, a more recent study by Yang et al. (2013) showed that the wild tropical lotus had higher genetic diversity than the temperate ecotype. Hu et al. (2012) also reported that the natural lotus accessions from Thailand differentiated from other natural lotus accessions in South-east Asian countries and China using variable molecular markers (AFLPs and SSRs). Among these studies, only a few samples of the tropical lotus were included, and the representations of the tropical lotus were insignificant in comparison to temperate groups. To this day, the genetic diversity of the tropical N. nucifera ecotypes has not explicitly been addressed from the other major distribution regions, including India and Australia, compared to Thailand populations. The genetic diversity levels and differentiation of the tropical lotus from these poorly studied geographic regions remain unclear. Therefore, there is the need to conduct population genetic studies of tropical lotus from these understudied areas.

Here, we genotyped 15 tropical N. nucifera populations sampled from the natural distribution ranges in Australia, India, and Thailand using nine polymorphic microsatellite markers. We aim to (i) evaluate the level of genetic diversity of the tropical lotus populations from the previous poorly studied natural distribution ranges, and (ii) estimate the degree of differentiation and population structure of N. nucifera.


Sample collections and DNA extraction

Fifteen wild tropical N. nucifera populations comprising of 216 individuals were sampled from the natural distribution range in Australia, India, and Thailand (Table 1; Fig. 1). N. nucifera is a clonal species, and therefore, to reduce the resampling of the same individuals, leaves samples were collected at a minimum 10 m apart. The collected leaves were dried with silica gel and preserved in the refrigerator until DNA extraction. The DNA extraction and quantification followed a similar protocol as published in Islam et al. (2020), followed by preservation in a freezer at − 20 °C for subsequent analysis.

Table 1 Locations and sample size of N. nucifera populations investigated in the present study
Fig. 1
figure 1

Sample collection sites of 15 N. nucifera tropical populations in Australia, India, and Thailand. The pie charts indicate the proportion of admixtures in the three genetic groups (K = 3), which were yielded by STRUCTURE analysis

SSR genotyping and PCR amplifications

Nine SSR markers previously developed for N. nucifera by Tian et al. (2008), Kubo et al. (2009), and Pan et al. (2010), were selected for the present study (Additional file 1: Table S1). Fluorescent dye FAM (Applied Biosystems, Foster City, CA, USA) was used to label all forward primers. The polymerase chain reactions (PCR) and amplifications were performed following Islam et al. (2020). PCR products were confirmed by electrophoresis on 2.0% (W/V) agarose gel stained with ethidium bromide. Later, ABI 3730 XL automated sequencer (Wuhan Gene Create Biological Engineering Co. Ltd., Wuhan, China) was used to identify the products. GeneScan 500 LIZ (Applied biosystems) was used to check the dye sizes in each lane to allow the correct determination of fragment size. Lastly, the allele sizes were detected manually using GeneMarker2.2.0 (Soft Genetics) with the default settings.

Data analysis

Genetic diversity indices

The Cervus version 3.0 program (Kalinowski et al. 2007) was used to assess Hardy–Weinberg Equilibrium deviations with Bonferroni corrections as well as the polymorphic information content (PIC) for each SSR marker and inbreeding coefficient (FIS) for the 15 N. nucifera populations were estimated. GenAlEx version 6.5 (Peakall and Smouse 2012) was used to determine the genetic diversity characteristics of all loci and populations. The following parameters were assessed for each SSR; the effective number of alleles (Ne), expected heterozygosity (He), observed heterozygosity (Ho), the number of alleles per locus (Na). Population-based characteristics estimated included; the effective number of alleles (NE), expected heterozygosity (HE), observed heterozygosity (HO), the number of alleles per locus (NA), Shannon’s information index (IS), the number of private alleles (Np), and inbreeding coefficient (FIS).

Population structure

To examine the level of genetic variation among N. nucifera populations, and estimates of genetic differentiation (FST), the analysis of molecular variance (AMOVA) was done in Arlequin version 3.1 (Excoffier et al. 2005). STRUCTURE version 2.3.3 (Pritchard et al. 2000) that uses a Bayesian algorithm was used to assign populations to genetic clusters. 100,000 burn-in steps and ten iterations for each K from 1 to 15 were run independently (where K = Number of populations), followed by 1,000,000 Markov Chain Monte Carlo (MCMC). An online tool, STRUCTURE HARVESTER ( (Earl 2012), was used to analyze the results and predict the suitable number of genetic clusters. K consistent values were readjusted in CLUMPP version 1.1.2 (Jakobsson and Rosenberg 2007) while employing the Greedy algorithm with 10,000 replications. The resulting genetic structure of the 15 populations N. nucifera was constructed and displayed in DISTRUCT version 1.1 (Rosenberg 2004). Using Nei’s genetic distance matrix (Nei et al. 1983), GenAlEx version 6.5 (Peakall and Smouse 2012) executed the principal coordinate analysis (PCoA).

Bottleneck analysis

Bottlenecks across the 15 N. nucifera populations were assessed using the Bottleneck version 1.2.02 program (Piry et al. 1999). Wilcoxon’s sign-rank tests were performed with 10,000 simulations at the 5% significance, using the two-phase model (TPM = 70%SMM +30%IAM), the step-wise mutation model (SMM), and the infinite allele model (IAM). The deviation of the populations from normal L-shaped distribution (mode shift), which indicates a demographic bottleneck on populations, was also checked (Luikart and Cornuet 1998).

Estimation of historical gene flow

The number of migrants per generation (Nm) among the genetic groups (K = 3), in the previous 4Ne generations (Ne = effective population size), was estimated by MIGRATE-n version 3.7.2 program (Beerli 2012). A Bayesian and coalescent inference approach (Beerli 2006) was used while applying the Brownian approximation model. The θ (mutational scaled effective population size) and M (mutation scaled migration rate) were obtained from the program with settings attuned to default, then used to approximate Nm. The Nm was estimated as in the equation; \({\text{Nm }} = \left[ {\left( {\uptheta {\text{a }} \times {\text{Mb}} \to {\text{a}}} \right)/4} \right],\) i.e., population b migrants per generation to population a (Beerli 2012).


Characteristics of the microsatellite markers

All SSRs markers observed significant Hardy–Weinberg deviations. The nine microsatellite markers used in the present study yielded high polymorphisms in all populations. For each microsatellite marker, the effective number of alleles (Ne) varied from 1.223 to 1.956 (mean = 1.475). Observed (Ho) and expected heterozygosity (He) estimates ranged from 0.067 to 0.631 and 0.140 to 0.474, respectively (mean, Ho = 0.274 and He = 0.245) (Additional file 1: Table S1). Nelumbo-13 and PR05 markers had the highest number of alleles (Na = 10). PIC is considered as a measure of the informativeness of the SSR markers (Babu et al. 2014), and high PIC values are reported to have a high discriminating ability and recommended for population genetic diversity studies (Ngailo et al. 2016). PIC values of the microsatellite markers used in our study varied from 0.322 at locus Nelumbo-32 to 0.775 at locus PR05 (mean = 0.593). Only, Nelumbo-32 and NNEST17 markers had PIC values less than 0.50.

Genetic diversity of N. nucifera

Sixty-five alleles were identified in the 15 tropical N. nucifera populations, ranging from six to ten alleles per locus (mean = 7.220) (Additional file 1: Table S1). The number of effective alleles (NE) and the number of observed (NA) per population ranged from 1.140 to 2.023 and 1.333 to 2.667, respectively. The heterozygosity levels, observed and expected, ranged from 0.044 to 0.824 and 0.081 to 0.470, respectively. The average expected heterozygosity (HE = 0.358) was higher in Thailand than in both India and Australia. Similarly, Shannon’s information index varied from 0.129 to 0.730. Private alleles were detected in nine of the 15 populations examined. Eleven of the 19 observed private alleles were found in populations sampled from Thailand, and population T4 had the highest count (6) (Table 2).

Table 2 The genetic diversity parameters measures among the 15 N. nucifera populations

Eleven populations showed low levels of coefficient of inbreeding (FIS). This observation reflects the presence of high cross-pollination levels among populations. However, only four populations (A1, A6, I1, and I6) had positive FIS values, suggesting that there existed inbreeding among the individuals of these populations. Populations T4 and I3 had the highest and lowest genetic diversity (HE), respectively. Overall, the microsatellite markers showed a low genetic variation in N. nucifera populations.

Genetic structure of N. nucifera

The Bayesian clustering in STRUCTURE suggested three genetic clusters in the N. nucifera populations, according to delta K. These populations were divided geographically according to the three countries (India, Thailand, and Australia), except for two Australian populations that were assigned together with the Indian populations (Fig. 2). Among the 15 tropical N. nucifera populations, the highest (2.383) and lowest (0.005) genetic distance was found in the populations sampled from Australia (Additional file 2: Table S2). The PCoA analysis revealed similar clustering patterns as STRUCTURE results, including the assignment of the two Australian populations to the Indian cluster (Fig. 3). The first and second axes in the PCoA explained 72.75% of the total variation (Fig. 3).

Fig. 2
figure 2

Genetic structuring of the15 tropical N. nucifera populations obtained from the STRUCTURE analysis, K = 3 (shown on the left); and the plot of K against delta K (shown on the right)

Fig. 3
figure 3

The scatter plot of principal coordinate analysis (PCoA) based on the microsatellite data. Australia populations (A), India populations (I), and Thailand populations (T). Coord.1 (53.91%) and Coord. 2 (18.84%) refer to the first and second principal components, respectively

Results of AMOVA revealed a higher variation among populations (59.98%) than within populations (40.02%) of the three countries, supported by high levels of genetic differentiation (FST = 0.596) (Table 3). In addition, the Mantel’s test confirmed the existence of a significant positive correlation between Nei’s genetic distance (Nei et al. 1983) and geographic distance (km) for all pairwise populations (r = 0.448, P = 0.004) (Fig. 4). Mantel test results indicated that the geographical distribution of the populations had contributed significantly to the observed genetic diversity.

Table 3 Analysis of molecular variance (AMOVA) for the 15 N. nucifera populations
Fig. 4
figure 4

Mantel test for isolation by distance between Nei’s genetic distance and geographic distance (km) for the N. nucifera populations

Population demographic bottlenecks and historical gene flow

The results of the bottleneck analysis are outlined in Additional file 3: Table S3. The three models used detected recent bottlenecks in T4, whereas only the infinite allele model (IAM) detected bottleneck in T5. Similarly, five populations (A1, A6, I1, I2, and I4) showed a shifted mode indicating the effect of recent population bottlenecks. The Migrate-n results of gene flow (Nm) among the three groups of N. nucifera populations revealed that the mutation scale effective population size (θ) for the three genetic clusters were 0.09137, 0.09711 and 0.09759, respectively. Among the populations the computed mutation scaled migration rate (M) ranged from 4.163 (M2- > 1) to 23.631 (M2- > 3). The gene flow was higher from India to Thailand populations (Nm = 0.577). Low gene flow occurred from India to Australia populations (Nm = 0.095). The study also found bidirectional gene flow among the genetic clusters. Overall, low gene flow (mean Nm = 0.394) was obtained among the three genetic clusters of N. nucifera populations. The details of the historical gene flow analysis among populations are shown in Additional file 4: Table S4.


Genetic diversity in wild N. nucifera populations

In the current study, we included 15 tropical N. nucifera populations sampled from natural distribution ranges in Australia, India, and Thailand. The genetic diversity (HE) values varied from 0.160–0.277; 0.081–0.216 and 0.254–0.470, in Australia, India and Thailand populations, respectively. This highlights the presence of wide genetic variability within each genetic group. The overall findings of this study exhibited low genetic diversity levels (mean HE = 0.245), which is mainly explained by low gene flow levels and clonal propagation. The result of the inbreeding coefficient analysis revealed that A1, A2, I1, and I2 populations had significant positive FIS values, which may be attributed to the high level of matings between closely related individuals. This phenomenon might have contributed to the observed low level of genetic diversity in these populations. The mean expected heterozygosity found in this study is greater than that recorded for the tropical lotus (HE = 0.152) by Liu et al. (2012). This value is much lower than the value (HE = 0.320) reported for the same ecotype by Yang et al. (2013). Moreover, the genetic diversity level found in this study is lower than the values reported for other aquatic plant species such as Ottelia acuminata (HE = 0.351) by Zhai et al. (2018) and Ottelia acuminata var. jingxiensis (HE = 0.441) by Li et al. (2019) using microsatellites markers. Comparably, the highest genetic diversity estimate (HE) in this study was found among the populations from Thailand (HE = 0.360), whereas the least genetic diversity was found in Indian populations (HE = 0.156). The higher genetic diversity found in Thailand populations might be related to the inherent broad genetic base of the germplasm or the presence of suitable growing conditions for the species in this country. The highest genetic diversity values (HE = 0.470), was found in T4. From the result, we can suggest that the higher genetic diversity level revealed in this population might have been accumulated during a long evolutionary history of the population. The highest genetic diversity and private alleles (HE = 0.470 and NP = 6, respectively), were found in T4 (Table 2). From the result, we can suggest that the higher genetic diversity level revealed in this population might have been accumulated during a long evolutionary history of the population. Besides, a previous study reported that, if a population had a large size, stable, and persisted for a long period, that population could still maintain high genetic diversity even after experiencing bottleneck events (Assis et al. 2013), as evidenced in T4.

The mean PIC value (0.593) detected in this study indicates that the markers are effective for population genetic studies of N. nucifera. The value is comparable with the PIC value reported by Liu et al. (2012), which observed a mean value of 0.537. Recently, the study conducted by Islam et al. (2020) on N. lutea sampled from the USA revealed higher PIC values (0.793), generally greater than the current study PIC. Overall, the present results portrayed that the SSR markers used are handy for genetic diversity studies in the N. nucifera germplasm.

Unlike sexual reproduction, in clonal propagation, there is no genetic recombination, and only a rhizome is used as a seed (Chen et al. 2008). Therefore, a cultivar’s total heterozygosity remains the same when using the same rhizomes vegetative propagation. As a result, plants propagated by clonal methods generally have low genetic variation than sexually propagated ones (Chen et al. 2008; Xue et al. 2006). Li et al. (2015) inferred that asexual reproduction through rhizomes in N. nucifera contributed to low genetic diversity. Positive FIS values observed in four N. nucifera populations (Table 2) reflect the presence of excess homozygote individuals, and it is expected to contribute to the lower genetic diversity in these populations, an aspect supported by Hyten et al. (2006) study. Similarly, Beatty and Provan (2011) stated that the habitat of species found at the peripheral areas are highly fragmented, and the populations are often found at the edge of their ranges. In the present study, most of the N. nucifera populations were sampled from the peripheral areas, for instance, the Australian populations and some Indian populations. Therefore, it is likely that these populations had already been affected by habitat fragmentation, which eventually leads to lower genetic diversity.

Population genetic structure

The higher percentage of variation in N. nucifera was found among, compared to within populations of the three countries; however, this dissimilarity was not significant. The PCoA investigation revealed that N. nucifera populations were distinct. Similarly, the STRUCTURE analysis showed three distinct genetic clusters with low admixtures, supported by PCoA cluster analysis (Fig. 2). A1 and A6 consistently clustered together with the Indian populations. This clustering pattern is difficult to explain in terms of the proximity of geographical distance. However, we infer that the populations either might have diverged a long time ago from the same ancestors or recently introduced by humans. Xue et al. 2006 suggested that birds can occasionally disperse seeds. The geographic distance between A1 and A6 populations is approximately 106 km, and gene flow can occur between these populations. Because of this, the populations might have possessed common ancestral polymorphism, which differentiates them from other Australian populations. The low sampling of the two populations might have also contributed to the observed clustering pattern. The high level of differentiation (FST = 0.596) in the present study is lower than the previous findings reported for N. nucifera (Han et al. 2007; Pan et al. 2011). A recent study by Islam et al. (2020) identified a lower level of gene flow, founder effect, inbreeding, and common ancestry as the major reasons for genetic differentiation in N. lutea populations in the USA. Slatkin (1987) reported that a lower gene flow (less than one) can cause genetic differentiation among populations. Hence, the high FST (0.596) and the low gene flow (0.346) found in this study contributed to the observed genetic structure, supported by significant IBD patterns in the study area (r = 0.448, P = 0.004). Zhang et al. (2019) submitted that asexual propagation would also reduce genetic differences among individuals within populations and increase differences among populations.

Gene flow estimation and bottleneck analysis

Gene flow may have a significant impact on the genetic differentiation of the local populations (Storfer 1999). It plays a vital role in influencing genetic variations within populations by limiting inbreeding depression (Robledo-Arnuncio et al. 2014). Results of Migrate-n analysis indicated that the highest gene flow (Nm = 0.577) was observed from India to Thailand, and the lowest (Nm = 0.095) was from India to Australia. The agents of gene flow in lotus can be insects, birds, water currents (Kubo et al. 2009; Xue et al. 2006), and humans. Besides, due to the large geographical distance between Thailand and India, gene flow may not be carried out by insects attributed to the insect’s short flight ranges. Therefore, water currents, birds or anthropogenic introductions may be significant among the main drivers of gene flow in lotus between the two countries. Slatkin (1987) indicated that genetic drift results in higher genetic differentiation when the gene flow among populations is less than one (Nm < 1). We, therefore, suggest that genetic drift might have influenced the observed genetic differentiation in the N. nucifera populations hence the low gene flow. Li et al. (2010) also reported a low level of recurrent gene flow among the wild populations of N. nucifera sampled from China, Japan, India, and Thailand. The bottleneck analysis revealed that seven of the 15 N. nucifera populations had experienced bottlenecks, of which, T4 and T5 had significant probabilities (Additional file 3: Table S3). Chen et al. (2019) outlined that habitat loss, fragmentation, and over-exploitation were the major factors that contributed to bottlenecks in N. nucifera populations. Hence, from the observation made in the present study, we presume that some of the populations (T4 and T5) have already been affected by fragmentation.

Implication for conservations

The presence of high genetic diversity (HE) within crop species plays a critical role in crop improvement programs (Salgotra et al. 2015). Besides, genetic diversity determines the potential of species survival and adaptation in the changing environmental conditions (Otálora et al. 2015; Chen et al. 2019). The lotus varieties currently found under production were obtained by continuous selection from the wide diversity available in the agricultural fields and wild states (Tian et al. 2008). According to Hu et al. (2012), wild N. nucifera populations found in Thailand and northeastern China are valuable germplasm in lotus breeding work. In another study, it was reported that the tropical lotus germplasm found in Thailand was used in breeding to improve the ornamental and economic values of Chinese lotus varieties (Yang et al. 2013). Notably, breeders used the genetic variation found in wild species to identify agriculturally important traits and introducing them into new varieties (Samiei et al. 2010). This suggests that countries are in one way or the other dependent on other countries’ genetic resources for improving their indigenous species. At present, our study realized low genetic variability in most populations. The wetlands used as habitat for tropical lotus have been turned into agriculture, and other land uses (La-ongsri et al. 2009). This phenomenon will likely affect the populations, and the genetic diversity might continue to decline. Two (T4 and T5) out of the seven populations that had experienced recent bottlenecks had significant probabilities, indicating that anthropogenic and natural factors had already threatened them. Hence, conservation of these threatened tropical lotus germplasm deserves special attention to ensure their continued availability. The highly diverse populations found in this study could be valuable germplasm for future breeding programs of the crop. Conservation priority should be given to populations with the highest genetic diversity, and those that have exhibited recent bottlenecks. Hence, we suggest the implementation of complementary conservation (i.e., in situ and ex situ) approaches for this species.


Population genetic structure studies of N. nucifera are essential to identify populations with unique traits and design appropriate conservation methods. The nine polymorphic microsatellite markers used in our study sufficiently differentiated the 15 tropical N. nucifera populations based on geography. The populations showed different genetic variability, and the results confirmed that the populations found in each country are unique. Geographically separated populations will likely develop genetic differences due to the adaptation to different habitats. We recommend that future breeding programs and conservation of N. nucifera, to utilize the germplasms of tropical populations with high genetic levels, as yielded in our study.

Further studies using additional samples from all the species distribution areas and more markers should be conducted to gain more insights into the population genetic structure of N. nucifera. Conserving the available diversity using various conservation approaches is essential to enable the continued utilization of this economically important crop species. Therefore, based on the findings of this study, conservation priority should be given to populations with a high level of genetic diversity (e.g., T4, in Thailand), and to those that have exhibited bottlenecks. We recommend that complementary conservation approaches should be effected to maintain endangered and the declining populations of tropical lotus.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its additional files.



Analysis of molecular variance

HE :

Expected heterozygosity

HO :

Observed heterozygosity


Inbreeding coefficient


Genetic differentiation


Hardy–Weinberg equilibrium


The infinite allele model


Isolation by distance

IS :

Shannon’s information index


Mutation scaled migration rate


Markov chain Monte Carlo

NA :

Number of alleles per locus

NE :

Effective number of alleles


Historical gene flow


The number of private alleles


Principal coordinate analysis


Polymerase chain reactions


Polymorphic information content


Step-wise mutation model


Simple sequence repeats


Mutational scaled effective population size


  • Assis J, Nelson Castilho Coelho FA, Valero M, Raimondi P, Reed D, Serrão EA (2013) High and distinct range-edge genetic diversity despite local bottlenecks. PLoS ONE 8(7):e68646.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Babu BK, Meena V, Agarwal V, Agrawal PK (2014) Population structure and genetic diversity analysis of Indian and exotic rice (Oryza sativa L.) accessions using SSR markers. Mol Biol Rep 41(7):4329–4339.

    Article  CAS  PubMed  Google Scholar 

  • Beatty GE, Provan J (2011) High clonal diversity in threatened peripheral populations of the yellow bird’s nest (Hypopitys monotropa; syn. Monotropa hypopitys). Ann Bot 107(4):663–670.

    Article  PubMed  PubMed Central  Google Scholar 

  • Beerli P (2006) Comparison of Bayesian and maximum-likelihood inference of population genetic parameters. Bioinformatics 22(3):341–345

    Article  CAS  PubMed  Google Scholar 

  • Beerli P (2012) Migrate, documentation version 3.7.2. Florida State University, Tallahasee

    Google Scholar 

  • Chen Y, Zhou R, Lin X, Wu K, Qian X, Huang S (2008) ISSR analysis of genetic diversity in sacred lotus cultivars. Aquatt Bot 89(3):311–316.

    Article  CAS  Google Scholar 

  • Chen YY, Wang WC, Fan XR, Sun JY, Li W, Li XL, Liu YL (2019) Genetic discontinuities and abundant historical gene flow in wild lotus Nelumbo nucifera populations from the Yangtze River. Aquat Bot 158:103130.

    Article  Google Scholar 

  • Earl DA (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Res 4(2):359–361

    Article  Google Scholar 

  • Excoffier L, Laval G, Schneider S (2005) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinform 1:117693430500100003

    Article  Google Scholar 

  • Goel A, Sharma SC, Sharga AN (2001) The conservation of the diversity of Nelumbo (Lotus) at the National Botanical Research Institute, Lucknow, India. Bot Gard Conserv News 3(6):52–54

    Google Scholar 

  • Guo HB (2009) Cultivation of lotus (Nelumbo nucifera Gaertn ssp. nucifera) and its utilization in China. Genet Resour Crop Evol 56(3):323–330

    Article  Google Scholar 

  • Han YC, Teng CZ, Zhong S, Zhou MQ, Hu ZL, Song YC (2007) Genetic variation and clonal diversity in populations of Nelumbo nucifera (Nelumbonaceae) in central China detected by ISSR markers. Aquat Bot 86(1):69–75.

    Article  Google Scholar 

  • Han YC, Teng CZ, Wahiti GR, Zhou MQ, Hu ZL, Song YC (2009) Mating system and genetic diversity in natural populations of Nelumbo nucifera (Nelumbonaceae) detected by ISSR markers. Plant Syst Evol 277(1–2):13–20

    Article  Google Scholar 

  • Hu J, Pan L, Liu H, Wang S, Wu Z, Ke W, Ding Y (2012) Comparative analysis of genetic diversity in sacred lotus (Nelumbo nucifera Gaertn.) using AFLP and SSR markers. Mol Biol Rep 39(4):3637–3647

    Article  CAS  PubMed  Google Scholar 

  • Hyten DL, Song Q, Zhu Y, Choi IY, Nelson RL, Costa JM, Specht JE, Shoemaker RC, Cregan PB (2006) Impacts of genetic bottlenecks on soybean genome diversity. Proc Natl Acad Sci 103(45):16666–16671

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Islam MR, Zhang Y, Li ZZ, Liu H, Chen JM, Yang XY (2020) Genetic diversity, population structure, and historical gene flow of Nelumbo lutea in USA using microsatellite markers. Aquat Bot 160:103162

    Article  Google Scholar 

  • Jakobsson M, Rosenberg NA (2007) CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23(14):1801–1806

    Article  CAS  PubMed  Google Scholar 

  • Kalinowski ST, Taper ML, Marshall TC (2007) Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment. Mol Ecol 16(5):1099–1106

    Article  PubMed  Google Scholar 

  • Kubo N, Hirai M, Kaneko A, Tanaka D, Kasumi K (2009) Classification and diversity of sacred and American Nelumbo species: the genetic relationships of flowering lotus cultivars in Japan using SSR markers. Plant Genet Resour Charact Util 7(3):260–270.

    Article  CAS  Google Scholar 

  • La-ongsri W, Trisonthi C, Balslev H (2009) Management and use of Nelumbo nucifera Gaertn. in Thai wetlands. Wetl Ecol Manag 17:279–289.

    Article  Google Scholar 

  • Li Z, Liu X, Gituru RW, Juntawong N, Zhou M, Chen L (2010) Genetic diversity and classification of Nelumbo germplasm of different origins by RAPD and ISSR analysis. Sci Hortic 125(4):724–732.

    Article  CAS  Google Scholar 

  • Li C, Mo H, Tian D, Xu Y, Meng J, Tilt K (2015) Genetic diversity and structure of American lotus (Nelumbo lutea Willd.) in North America revealed from microsatellite markers. Sci Hortic 189:17–21

    Article  CAS  Google Scholar 

  • Li ZZ, Lu MX, Gichira AW, Islam MR, Wang QF, Chen JM (2019) Genetic diversity and population structure of Ottelia acuminata var. jingxiensis, an endangered endemic aquatic plant from southwest China. Aquat Bot 152:20–26.

    Article  Google Scholar 

  • Liu Y, Mei Y, Xiang Q, Xu L, Zeng Z, Bao MB (2012) Characterization of microsatellite markers and their application for the assessment of genetic diversity among lotus accessions. J Am Soc Hortic Sci 137(3):180–188

    CAS  Google Scholar 

  • Luikart G, Cornuet JM (1998) Empirical evaluation of a test for identifying recently bottlenecked populations from allele frequency data. Conserv Biol 12:228–237

    Article  Google Scholar 

  • Luo X, Cao S, Hao Z, Hou L, Cao D, Zhang J, Li H, Niu J, Xue H, Chen L (2018) Analysis of genetic structure in a large sample of pomegranate (Punica granatum L.) using fluorescent SSR markers. J Hortic Sci Biotechnol 93(6):659–665

    Article  Google Scholar 

  • Na A, Hong-Bo G, Wei-dong K (2009) Genetic variation in rhizome lotus (Nelumbo nucifera Gaertn ssp. nucifera) Germplasms from China assessed by RAPD markers. Agric Sci China 8(1):31–39.

    Article  Google Scholar 

  • Nei M, Tajima F, Tateno Y (1983) Accuracy of estimated phylogenetic trees from molecular data. J Mol Evol 19:153–170

    Article  CAS  PubMed  Google Scholar 

  • Ngailo S, Shimelis H, Sibiya J, Amelework B, Mtunda K (2016) Genetic diversity assessment of Tanzanian sweet potato genotypes using simple sequence repeat markers. South African J Bot 102:40–45

    Article  CAS  Google Scholar 

  • Otálora MA, Belinchón R, Prieto M, Aragón G, Izquierdo P, Martínez I (2015) The threatened epiphytic lichen Lobaria pulmonaria in the Iberian Peninsula: genetic diversity and structure across a latitudinal gradient. Fungal Biol 119(9):802–811

    Article  PubMed  Google Scholar 

  • Pan L, Xia Q, Quan Z, Liu H, Ke W, Ding Y (2010) Development of novel EST–SSRs from sacred lotus (Nelumbo nucifera Gaertn) and their utilization for the genetic diversity analysis of N. nucifera. J Hered 101(1):71–82

    Article  CAS  PubMed  Google Scholar 

  • Pan L, Quan ZW, Hu JH, Wang GY, Liu SN, He Y, Ke WD, Ding Y (2011) Genetic diversity and differentiation of lotus (Nelumbo nucifera) accessions assessed by simple sequence repeats. Ann Appl Biol 159:428–441.

    Article  CAS  Google Scholar 

  • Peakall R, Smouse PE (2012) GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics 28:2537–2539.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Piry S, Luikart G, Cornuet JM (1999) BOTTLENECK: a computer program for detecting recent reductions in the effective population size using allele frequency data. J Hered 90:502–503

    Article  Google Scholar 

  • Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155:945–959

    CAS  PubMed  PubMed Central  Google Scholar 

  • Robledo-arnuncio JJ, Klein EK, Muller-landau HC, Santamaría L (2014) Space, time and complexity in plant dispersal ecology. Mov Ecol 2:1–17.

    Article  Google Scholar 

  • Rosenberg NA (2004) DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes 4(1):137–138

    Article  Google Scholar 

  • Salgotra RK, Gupta BB, Bhat JA, Sharma S (2015) Genetic diversity and population structure of basmati rice (Oryza sativa L.) germplasm collected from north western Himalayas using trait linked SSR markers. PLoS ONE 10(7):1–19.

    Article  CAS  Google Scholar 

  • Samiei L, Naderi R, Khalighi A, Shahnejat-Bushehri AA, Mozaffarian V, Esselink GD, Osaloo K, Smulders MJ (2010) Genetic diversity and genetic similarities between Iranian rose species. Hortic Sci Biotechnol 85(3):231–237

    Article  CAS  Google Scholar 

  • Slatkin M (1987) Gene flow and the geographic structure of natural populations. Science 236(4803):787–792

    Article  CAS  PubMed  Google Scholar 

  • Storfer A (1999) Gene flow and endangered species translocations: a topic revisited. Biol Conserv 87:173–180

    Article  Google Scholar 

  • Tian H, Xue J, Wen J, Mitchell G, Zhou S (2008) Genetic diversity and relationships of lotus (Nelumbo) cultivars based on allozyme and ISSR markers. Sci Hortic 116:421–429.

    Article  CAS  Google Scholar 

  • Tian DK, Mo HB, Zhang WW, Huang X, Li C, Xu YY (2014) Progress on international lotus registration and construction of international Nelumbo database. Proceedings from the 6th international symposium on the taxonomy of cultivated plants. Acta Hortic 1035:79–85

    Article  Google Scholar 

  • Xue J, Zhuo L, Zhou S (2006) Genetic diversity and geographic pattern of wild lotus (Nelumbo nucifera) in Heilongjiang Province. Chin Sci Bull 51(4):421–432

    Article  CAS  Google Scholar 

  • Yang M, Han Y, Xu L, Zhao J, Liu Y (2012) Comparative analysis of genetic diversity of lotus (Nelumbo) using SSR and SRAP markers. Sci Hortic 142:185–195.

    Article  CAS  Google Scholar 

  • Yang M, Liu F, Han Y, Xu L, Juntawong N, Liu Y (2013) Genetic diversity and structure in populations of Nelumbo from America, Thailand, and China: implications for conservation and breeding. Aquat Bot 107:1–7.

    Article  Google Scholar 

  • Zhai SH, Yin GS, Yang XH (2018) Population genetics of the endangered and wild edible plant Ottelia acuminata in southwestern China using novel SSR markers. Biochem Genet 56:235–254.

    Article  CAS  PubMed  Google Scholar 

  • Zhang Q, Wang Q (2006) The discovery of tropical lotus flowers and the classification system of lotus varieties. Chin Landsc Archit 82–85 (in Chinese with English abstract)

  • Zhang W, Tian D, Huang X, Xu Y, Mo H, Liu Y, Meng J, Zhang D (2014) Characterization of flower-bud transcriptome and development of genic SSR markers in Asian lotus (Nelumbo nucifera Gaertn.). PLoS ONE 9(11):1–11.

    Article  CAS  Google Scholar 

  • Zhang X, Su H, Yang J, Feng L, Li Z, Zhao G (2019) Population genetic structure, migration, and polyploidy origin of a medicinal species Gynostemma pentaphyllum (Cucurbitaceae). Ecol Evol 9(19):11145–11170.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


The authors would like to thank Josphat K. Saina for his critical review and valuable suggestions on the draft manuscript.


This work was supported by grants from the National Natural Science Foundation of China (No. 31570220), Hubei Provincial Natural Science Foundation of China (No. 956 2019CFB275) and Bureau of Landscaping and Forestry of Wuhan Municipal (No. 2018-28).

Author information

Authors and Affiliations



J-MC, TS and X-YY conceived, designed the experiments, and secured funds. J-MC collected the samples. S-XH and Z-ZL performed the experiments. BKN, YM, and Z-ZL performed statistical analyses. YM, BKN, Z-ZL, KFO and Y-TL interpreted the results of the statistical analyses. YM wrote the paper. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jin-Ming Chen or Xing-Yu Yang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Polymorphism information of the nine SSRs markers used in the present study.

Additional file 2: Table S2.

Pairwise fixation index (FST) values between populations of tropical N. nucifera (below diagonal), and Nei genetic distance (above diagonal).

Additional file 3: Table S3.

Bottleneck analysis in 15 tropical N. nucifera populations.

Additional file 4: Table S4.

Migrate-n results of historical gene flow among the three genetic groups.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mekbib, Y., Huang, SX., Ngarega, B.K. et al. The level of genetic diversity and differentiation of tropical lotus, Nelumbo nucifera Gaertn. (Nelumbonaceae) from Australia, India, and Thailand. Bot Stud 61, 15 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: