Development of STS and CAPS markers for variety identification and genetic diversity analysis of tea germplasm in Taiwan
Botanical Studies volume 55, Article number: 12 (2014)
Tea (Camellia sinensis) is an important economic crop in Taiwan. Particularly, two major commercial types of tea (Paochong tea and Oolong tea) which are produced in Taiwan are famous around the world, and they must be manufactured with specific cultivars. Nevertheless, many elite cultivars have been illegally introduced to foreign countries. Because of the lower cost, large amount of “Taiwan-type tea” are produced and imported to Taiwan, causing a dramatic damage in the tea industry. It is very urgent to develop the stable, fast and reliable DNA markers for fingerprinting tea cultivars in Taiwan and protecting intellectual property rights for breeders. Furthermore, genetic diversity and phylogenetic relationship evaluations of tea germplasm in Taiwan are imperative for parental selection in the cross-breeding program and avoidance of genetic vulnerability.
Two STS and 37 CAPS markers derived from cytoplasmic genome and ESTs of tea have been developed in this study providing a useful tool for distinguishing all investigated germplasm. For identifying 12 prevailing tea cultivars in Taiwan, five core markers, including each one of mitochondria and chloroplast, and three nuclear markers, were developed. Based on principal coordinate analysis and cluster analysis, 55 tea germplasm in Taiwan were divided into three groups: sinensis type (C. sinensis var. sinensis), assamica type (C. sinensis var. assamica) and Taiwan wild species (C. formosensis). The result of genetic diversity analysis revealed that both sinensis (0.44) and assamica (0.41) types had higher genetic diversity than wild species (0.25). The close genetic distance between the first (Chin-Shin-Oolong) and the third (Shy-Jih-Chuen) prevailing cultivars was found, and many recently released varieties are the descents of Chin-Shin-Oolong. This implies the potential risk of genetic vulnerability for tea cultivation in Taiwan.
We have successfully developed a tool for tea germplasm discrimination and genetic diversity analysis, as well as a set of core markers for effective identification of prevailing cultivars in Taiwan. According to the results of phylogenetic analysis on prevailing tea cultivars, it is necessary to broaden genetic diversity from wild species or plant introduction in future breeding programs.
Tea (Camellia sinensis) is one of the most important beverage crops around the world and also a significant economic crop in Taiwan. Currently, there are 14,091 hectares of tea farms in Taiwan, producing 17,310 tons per year (Council of Agriculture 2012). Tea has been planted in Taiwan since 200 years ago, and has been manufactured into different types of tea in accordance with different eras and production areas (Chiu 1988; Jun and Lin 1997). Because different types of tea are produced with specific cultivars, numerous tea cultivars are grown in Taiwan. Paochong tea and Oolong tea are two major types of tea currently produced in Taiwan, whereas black and green tea are considered to be minor types. The cultivars suitable for making Paochong or Oolong tea are cultivars Chin-Shin-Oolong, TTES-12, Shy-Jih-Chuen, Chin-Shin-Dahpan, and TTES-13. Whereas cultivar Chin-Shin-Gantzy is fitting for green tea, and TTES-8 and TTES-18 are suitable for black tea (Tsai et al. 2004b). Furthermore, there are many other germplasm including landraces, introduced varieties, and wild species that could be selected or utilized for breeding new varieties.
Tea is a woody, perennial, and out-crossing crop that is highly heterozygous (Barua 1963). In tea breeding, the key points for parental selection are superior traits from parents and their wide-ranging genetic diversity that prevent the weakness of progenies (Bandyopadhyay 2011). Furthermore, many elite cultivars developed in Taiwan have been illegally introduced to China, Vietnam, Thailand, Indonesia, and so on. Because of the lower cost, large number of “Taiwan-type tea” are produced and imported to Taiwan, causing a dramatic damage in the tea industry. Therefore, seedlings and products of tea have been protected by the “Plant Variety and Plant Seed Act” which was enacted in 2004. In addition, the scientific database for identifying and examining varieties of tea should be well developed for the suspicious torts.
The simple method for genetic diversity assessment and variety identification of tea or its commercial product (processed tea) is based on the morphological traits. However, the available morphological traits are limited in number and easily affected by environments and growth stages of tea (Gunasekare 2007; Bandyopadhyay 2011). DNA markers are genetic markers that came from various classes of DNA mutations and rearrangements (Collard et al. 2005). Compared with morphological traits, DNA markers have numerous advantages such as multiple marker types, relative abundance of polymorphism, extensive genomic coverage, not disturbed by growth stage and tissue of plants, not affected by environment and gene expression, only a small quantity of DNA needed for assay, only a short period required for analyzing large amounts of samples, and more reproducible (Powell et al. 1996; Collard et al. 2005; Jones et al. 2009). DNA markers, including RAPD (randomly amplified polymorphic DNA), ISSR (inter-simple sequence repeat) and AFLP (amplified fragment length polymorphism) have been well developed for genetic fingerprinting and phylogenetic studies of tea in Taiwan (Lai et al. 2001; Tsai et al. 2003; Hu et al. 2005; Lin et al. 2005). Nevertheless, these markers are dominant, and their reproducibility and capacity for variety identification are less than targeted and locus-specific DNA markers, such as STS and CAPS.
STS (ssequence tagged site) is a relatively short and single-copy DNA sequence that can be specifically amplified by PCR (Olson et al. 1989). CAPS (cleaved amplified polymorphic sequence) or PCR-RFLP (polymerase chain reaction - restriction fragment length polymorphism) utilizes amplified DNA fragments digested with a restriction endonuclease to display restriction site polymorphisms (Konieczny and Ausubel 1993). STS and CAPS markers are co-dominant, locus-specific, and more reproducible. They have various advantages including their genotypes which are easily scored and interpreted, and only a small quantity of DNA is needed for one assay. Also the cleaved and un-cleaved amplification products can be adjusted arbitrarily by the appropriate placement of the PCR primers. The procedure is technically simple with robust results because the amplification product is always obtained (Drenkard et al. 1997).
DNA markers could be developed from whole nuclear genome or expressed sequence tags (ESTs). Because the whole genome sequences of tea plant are not available and updated, it is feasible to develop nuclear markers from ESTs database. ESTs are short cDNA sequences reversely transcribed from mRNA. In general, by using EST-derived primer pairs to amplify nuclear genome, the amplicons may consist of intron sequences that displayed higher variation to develop informative markers for variety identification (Shu et al. 2010). Besides, DNA markers could be also derived from the cytoplasmic genome, such as the mitochondria genome (mtDNA) and chloroplast genome (cpDNA). The cytoplasmic CAPS markers are not only maternal inherited from haploid genome (Kaundun and Matsumoto 2011), but also have a slower nucleotide substitution rate than the nuclear DNA (Palmer 1992). Because of conservative evolution, they have been widely used in detecting geographical origins of plant species (Kaundun and Matsumoto 2002; Katoh et al. 2003) and population differentiation (Schaal and Olsen 2000).
The aim of this study is to develop a stable, fast and reliable STS and CAPS DNA markers for fingerprinting commercial tea varieties in Taiwan and protect intellectual property rights for breeders. Furthermore, genetic diversity and phylogenetic relationship of tea germplasm in Taiwan are assessed to provide information for parental selection.
Plant materials and DNA extraction
A total of 55 germplasm were analyzed in this study, including 22 selected from crossing between varieties, nine local cultivars (landraces), 16 introduced varieties, and eight wild species. According to taxonomy, 22 C. sinensis var. sinensis (S), 12 C. sinensis var. assamica (A), 11 C. sinensis var. sinensis × var. assamica (SA), two C. sinensis var. assamica × var. assamica (AS), seven C. formosensis (F), and one C. formosensis var. yungkangensis (FY) are classified (Hu et al.2005; Su 2007; Su et al. 2009). Except four (I4 ~ I7) samples were obtained from the tea garden of Tung Pang Black Tea CO. LTD. in Nantou County, Taiwan. All samples were collected from the germplasm garden at the Tea Research and Extension Station in Taoyuan County, Taiwan (Table 1).
Design of STS markers
To develop cytoplasmic STS markers, primer pairs of two chloroplast and seven mitochondria were designed according to Hu (2004). In addition, 54 primer pairs of nuclear STSs, including four developed by Kaundun and Matsumoto (20032004), and 50 based on the public EST database (dbEST) of NCBI (National Center for Biotechnology Information, USA, http://www.ncbi.nlm.nih.gov/) were designed, using the Primer3 software (Rozen and Skaletsky 2000) (http://frodo.wi.mit.edu/). These primer pairs were prescreened with eight cultivars comprising of the following: TTES-8; TTES-12; TTES-13; TTES-18; TTES-19; TTES-20; Chin-Shin-Oolong and Shy-Jih-Chuen. The amplification was performed in a total volume of 38 μL containing 80 ng genomic DNA, 0.3 μM each primer, 4.7 mM MgCl2, 0.27 mM dNTPs, and 1 U of Taq DNA polymerase (Invitrogen by Life Technologies). The amplification was done by T-Gradient (Biometra, Germany) with denaturation at 94°C for 4 min; 40 cycles of 94°C for 30 s, 55-60°C (depending on the primer pair) for 30 s, and 72°C for 30 s; and final extension at 72°C for 4 min.
Design of CAPS markers
Nuclear amplicons that amplified two bands with length polymorphisms were directly applied as STS markers. Meanwhile, the DNA bands were sequenced by ABI PRISM 3730 DNA Analyzer (Applied Biosystems, USA) once the PCR products were less than 1 kb. For SNPs (single nucleotide polymorphism) and InDels (insertion/deletion) screening, sequence analyses were conducted with SeqMan Pro v.7.1 software (DNAStar, Inc., Madison, WI, USA). The sequences with SNPs or InDels were converted to CAPS markers by SNP2CAPS software (Thiel et al. 2004). To check restriction patterns, PCR reactions were performed in a final volume of 11.7 μL with 1× Taq buffer, 2 mM MgCl2, 0.27 mM dNTPs, 0.26 μM each primer, 1 U Taq DNA polymerase (Invitrogen by Life Technologies), and 40 ng DNA. Amplification was done by T-Gradient (Biometra, Germany) with programmed for 5 min preheating at 94°C followed by 35 cycles of 30 s at 94°C, 30 s at 55-60°C (depending on the primer pair) and 1 min at 72°C for the denaturation, annealing and extension steps, respectively. There was a final incubation for 10 min at 72°C. Amplification products were analyzed on 2% agarose gels stained with ethidium bromide to check the fragments being amplified. Amplified fragments were digested with restriction enzymes to detect CAPS and the products were resolved by electrophoresis on 2% agarose gels.
Data analysis and variety identification
The haploid and diploid types for cytoplasmic and nuclear markers were respectively scored, and each allele was assigned an alphabet for a particular primer set/enzyme combination. The polymorphism information of STS and CAPS markers was analyzed by PowerMarker v.3.25 (Liu and Muse 2005) to investigate the number of alleles and polymorphism information content (PIC) per marker.
, where Pi and Pj are the frequencies of the i th and j th alleles, and n is the number of alleles (Botstein et al. 1980).
Based on the identified STS and CAPS markers, the core markers and the flow chart for identifying 12 prevailing cultivars in Taiwan, including Chin-Shin-Oolong, TTES-12, Shy-Jih-Chuen, Chin-Shin-Dahpan, TTES-13, Chin-Shin-Gantzy, TTES-8, TTES-18, TTES-7, TTES-19, TTES-20, and TTES-21, were developed.
Genetic diversity analysis
In this study, the tea germplasm consist of three main groups, including sinensis type (S and SA), assamica type (A and AS) and wild species in Taiwan (F and FY) (shown in Table 1). The genetic diversity of those germplasm was analyzed by Popgene v.1.32 (Yeh and Boyle 1997) to estimate the observed number of alleles (NA), the effective number of alleles (Ne), the observed heterozygosity (HO), the Nei’s gene diversity (H), and Shannon’s Information index (I) per group.
Cluster analysis and principle coordinates analysis
Both the analyses of average genetic distances among three main groups and genetic distances between the pairs of germplasm were based on modified Roger’s distance (MRD) method (Wright 1978) by using TFPGA v.1.3 (Miller 1997). Upon the genetic distances between all pairwise combinations MRD, the cluster analysis and principal coordinate analysis (PCoA) were completed with NTSYSpc v.2.10 (Rohlf 1997). A dendrogram of the genetic relationships was developed by unweighted pair group method with arithmetic mean algorithm (UPGMA) using cluster analysis. The principal coordinate analysis (PCoA) was performed and the first two extracted coordinates extracted were used to derive the PCoA plot.
Polymorphism of STS and CAPS markers
The STS and CAPS markers in this study were derived from cytoplasmic genome and nuclear ESTs. From six polymorphic DNA sequences of cytoplasmic genome, 14 SNPs and same amount of InDels were screened and successfully designed for three chloroplast CAPS (C01 ~ C03) and seven mitochondria CAPS (M01 ~ M07) markers. A total of 54 nuclear EST primer pairs, including four pairs from the previous study (Kaundun and Matsumoto 20032004) and 50 pairs designed from public EST database of NCBI, as well as 27 primer pairs which amplified the expected size of amplicons. However, the remaining 27 primer pairs did not yield any scorable amplicon or yielded amplicons longer than 1 kb. In the expected size of 27 amplicons, 11 had no SNP, three had SNP (but without the restriction site), and the remaining 13 amplicons had 90 SNPs. Meanwhile, the four InDels could be successfully transferred into two STS (PAL and F3H) and 27 CAPS markers (including G01 ~ G27). For example, one SNP of an EST sequence coding zinc finger protein was designed for CAPS marker shown in Additional file 1: Figure S1. The detailed information of the two STS and 37 CAPS markers (including 10 cytoplasmic markers and 27 nuclear markers) are listed in Table 2.
A total of 98 alleles out of 39 polymorphic loci were detected in 55 germplasm. In 10 cytoplasmic CAPS loci, the average number of alleles was 2 and polymorphism information content (PIC) ranged from 0.13 (C02) to 0.35 (M06), with an average of 0.25 per locus. In 29 nuclear STS and CAPS markers, the number of alleles varied from 2 to 7, with an average of 2.7 per locus. The PIC values widely varied from 0.04 (G16) to 0.62 (G22), with an average of 0.34 per locus (Table 2).
Identification of the prevailing tea cultivars in Taiwan
Two STS and 37 CAPS markers developed in this study can be used to distinguish all 55 core germplasm in Taiwan, and their band patterns are shown in Additional file 1: Table S1. For the identification of 12 prevailing tea cultivars in Taiwan, the electrophoresis patterns of cleaved fragments in each STS and CAPS marker are shown in Additional file 1: Figure S2. In order to establish a flow chart for identifying 12 prevailing tea cultivars in Taiwan, five core markers, including M02 (mitochondria), C02 (chloroplast), G01 G03, and G04 (nuclear), were selected by variety-specific marker and PIC value. First, the sinensis type and the assamica type groups were distinguished by using the M02 marker. Secondly, the G03 and C02 can be employed to discriminate four cultivars within assamica type group, and the G03, G01 and G04 were used to separate eight cultivars within sinensis type group (Figure 1). In addition to five core markers, the remaining 34 markers could be used as a supplementary tool if more new varieties need to be identified in the future.
Genetic diversity of tea germplasm in Taiwan
On the basis of taxonomy, 55 tea germplasm in Taiwan can be divided into three classifications, including sinensis type (S and SA), assamica type (A and AS) and wild species in Taiwan (F and FY) (Table 1). The average genetic distance among the three groups are shown in Table 3. The average genetic distance between sinensis type (S and SA) and wild species (F and FY) is 0.45, and that between assamica type (A and AS) and wild species (F and FY) is 0.47. Both distances are larger than that between sinensis type (S and SA) and assamica type (A and AS) (0.28). According to the genetic distance matrix of MRD coefficients among all 55 core germplasm (Table 4 and Additional file 1: Table S2), the average distance of wild species (0.25) is less than that of sinensis type (S and SA) (0.44) and assamica type (A and AS) (0.41). Based on two indices for estimating genetic variation within the populations, involving the observed number of alleles (NA) and effective number of alleles (Ne), it showed that the assamica type (A and AS) (NA = 2.34, Ne = 1.66) were more similar to sinensis type (S and SA) (NA = 2.34, Ne = 1.77) compared to the wild species (F and FY) (NA = 1.72, Ne = 1.24) (Table 4). It reveals that the genetic diversity within the cultivated species (C. sinensis) is higher than that within wild species in Taiwan (C. formosensis). Besides, larger parameters on the observed heterozygosity (HO), Nei’s gene diversity (H) and Shannon’s Information index (I), were detected in both assamica type (A and AS) (HO = 0.35, H = 0.34, I = 0.55) and sinensis type (S and SA) (HO = 0.38, H = 0.36, I = 0.58) compared to the wild species (F and FY) (HO = 0.16, H = 0.15, I = 0.26) (Table 4). It also demonstrated that the cultivated species (C. sinensis) had greater genetic diversity than the wild species (C. formosensis).
Cluster analysis and principle coordinates analysis of tea germplasm in Taiwan
The genetic distances between all pairwise combinations were listed in Additional file 1: Table S2. The values among 55 surveyed germplasm in Taiwan ranged from 0.08 to 0.69, with an average value of 0.49. Among the germplasm of 47 cultivated tea (C. sinensis), the average values was 0.47. If only 37 sinensis type tea (S and SA) were surveyed, the genetic distances among this group ranged from 0.11 (TTES-14 and TTES-15) to 0.62 (Chin-Shin-Oolong and TTES-17), with an average value of 0.44. As for 14 assamica type tea (A and AS), the genetic distances ranged from 0.08 (TTES-8 and Jaipuri; Shan-1 and Shan-2) to 0.58 (Shan and Manipuri), with an average value of 0.41. However, the genetic distances among eight wild species (F and FY) were relatively small, ranging from 0.11 (Long-Tou wild tea and Le-Ye wild tea; Ming-Hai wild tea and Nan-Fong wild tea) to 0.38 (De-Hua-She wild tea and Yung-Kang wild tea), with an average value of 0.25.
In PCoA based on MRD estimates of all 55 germplasm, the first, second and third principle coordinates (abbreviated to PC1, PC2 and PC3) explained 24.5%, 15.9% and 11.3% of the molecular variance, respectively, while the cumulative contribution was 51.8%. The first two principle coordinates were used to develop the PCoA plot shown in Figure 2. In the PC1, the 55 tea germplasm were divided into two major groups, cultivated tea (C. sinensis) and wild species in Taiwan (C. formosensis). In the PC2, the cultivated tea (C. sinensis) were divided into two major groups, sinensis type (S and SA) and assamica type (A and AS).
The UPGMA dendrogram was constructed to separate the 55 germplasm into three major groups (Figure 3). Based on genetic distance coefficient of 0.57, the first group (GroupI) including C. formosensis could be isolated from the cultivated germplasm (C. sinensis). When the coefficient was reduced to 0.51, the assamica type (A and AS) (GroupII) and the sinensis type (S and SA) (GroupIII) germplasm were distinguished. Group III was divided into three subgroups, namely Group IIIa, Group IIIb, and Group IIIc. Many famous varieties belonged to Group IIIa including Chin-Shin-Oolong, Shy-Jih-Chuen, Bair-Mau-Hour, Wuu-Yi, Horng-Shin-Dahpan, and its derived varieties (TTES-3, TTES-4, and TTES-9). The Group IIIb comprised of Tiee-Guan-In, Hwang-Gan, and its derived varieties (TTES-10, TTES-12, TTES-14, TTES-15, and TTES-19). The Group IIIc, on the other hand, contained Chin-Shin-Gantzy, Chin-Shin-Dahpan and its derived variety (TTES-1), Dah-Yeh-Oolong, and its derived varieties (TTES-2, TTES-11, TTES-16, TTES-17, and TTES-20), Ying-Jy-Horng-Shin its derived variety (TTES-13).
Polymorphism of STS and CAPS markers
In this study, 11 nuclear CAPS markers including G03, G12, G14, G15, G17, G18, G20, G22, G23 and G26 showed multi-allele patterns, while the others had only two alleles (bi-allele) (Table 2). There was only one restriction site within each CAPS locus resulting in the bi-allele markers, and their genotypes were easily scored and interpreted. Otherwise, the multi-allele markers were based on different point mutation positions within the locus that had more than two restriction sites. They yielded more complicated genotypes but may still be considered very useful. For example, the multi-allele CAPS markers could be used widely in pepper breeding for viral resistance (Yeam et al. 2005).
Polymorphism information content (PIC) means different informative levels of a locus and it also implies the genetic variation of a marker. The value larger than 0.5, ranging from 0.25 to 0.5, and smaller than 0.25 suggest that the locus is highly informative, reasonably informative, and slightly informative, respectively (Botstein et al. 1980). Of all the 39 cytoplasmic and nuclear markers examined in this study, the PIC ranged from 0.04 to 0.62, with an average of 0.32. The PIC of 10 cytoplasmic markers was 0.25, and seven of them were reasonably informative. Otherwise, the remaining three were slightly informative (Table 2). The 29 nuclear markers had an averaged PIC of 0.34, in which six were found to be highly informative, 16 were reasonably informative, and the remaining seven were slightly informative. The averaged PIC of the nuclear markers was higher than the cytoplasmic, and the average of the mtDNA markers (0.29) was higher than the cpDNA (0.18) (Table 2). Similar results were also reported by Ishii’s group, in which they found that the nuclear microsatellites (the averaged PIC is 0.89) had higher PIC values than the chloroplast microsatellites (the averaged PIC is 0.38) among A-genome species of rice (Ishii et al. 2001). Because the variation of cytoplasmic markers are lower than nuclear markers, the former could be used to examine relationship among distant-related taxa, and the latter are more suitable for the assessment of genetic diversity of close- related taxa.
In our previous study, the observed number of EST-SSR alleles (NA) per locus was 5.6 (Hu et al. 2011). However, in this study, the values of STS and CAPS markers derived from cytoplasmic and nuclear were 2.00 and 2.69, respectively. The PIC per locus for EST-SSR (0.62) was higher than those of STS and CAPS from cytoplasmic (0.25) and nuclear (0.34). Because small size difference between polymorphic bands was shown in the EST-SSR markers, there was high resolution of agarose gel, polyacrylamide gel electrophoresis or Genetic Analyzer (Hu et al. 2011). However, large size difference between polymorphic bands was found in STS or CAPS markers, and it suggested that only less expensive agarose gel was needed to obtain accurate data.
Identification of 12 prevailing tea cultivars in Taiwan
In this study, 12 dominant cultivars were selected for variety identification based on the following criteria: (1) the acreage under cultivation of each variety; (2) the variety suitable for manufacturing unique tea; and (3) the newly bred varieties. According to statistics data from Tea Research and Extension Station in 2011, these 12 cultivars take over 98% acreage of Taiwan. Of these 12 cultivars, Chin-Shin-Oolong, TTES-12, Shy-Jih-Chuen, Chih-Shih-Dahpan, and TTES-13 are the top five cultivars in Taiwan that has been found to be suitable for both Paochong tea and Oolong tea. Shy-Jih-Chuen and Chih-Shih-Dahpan are mainly grown in Nantou County and north-west region of Taiwan, respectively, while others are distributed around Taiwan (estimated by Tea Research and Extension Station in 2011). Besides, Chin-Shin-Gantzy is fitted for green tea, and TTES-18, TTES-8, and TTES-7 are the excellent cultivars for making black tea. Chin-Shin-Gantzy is cultivated in New Taipei City, and the other three cultivars are mainly planted in Nantou County (estimated by Tea Research and Extension Station in 2011). In addition, varieties TTES-19 and TTES-20 were bred for manufacturing Paochong and Oolong tea, having been protected by the “Plant Variety and Plant Seed Act” in Taiwan since 2004 (Tsai et al. 2004a). TTES-21, on the other hand, was designated in 2008 for black tea procession (Chiu et al. 2009). These cultivars are most urgently desirable for variety identification in Taiwan.
Tea commercial products are manufactured through the application ofhigh temperature and the use of fermentation treatments at a panning step. These processes could eventually lead to dramatic DNA degradation. Additionally, tea merchants or farmers often blend the tea with different varieties to increase its flavor or reduce material cost. To solve the above problems, we have reported that DNA markers less than 1 kb are less affected by procession treatments and are useful for variety identification. Moreover, the chloroplast DNA markers with haploid genotypes and maternal inheritance could be effectively applied to identify the mixed-varieties of tea products (Hu et al. 2006). Since most STS and CAPS markers in this study are less than 850 bp, they may have application potential in identifying different varieties or mixed-varieties of processed tea.
Genetic diversity of tea germplasm in Taiwan
The consistent results of germplasm classification were found in the principal coordinate analysis and cluster analysis. A total of 55 germplasm can be divided into three groups: sinensis type (S and SA), assamica type (A and AS) and Taiwan wild species (F and FY). The sinensis type (S and SA) and assamica type (A and AS) are generally called cultivated tea (C. sinensis). The former is a shrub with small leaves and can withstand cold climates; while the latter has tall tree-like structure with large leaves and is suitable for warm tropical climates (Banerjee 1992). Besides, the latter has more flavanols content so it was found to be more suitable for making black tea. Meanwhile, the sinensis type has been found to be suitable for manufacturing green tea or Oolong tea (Takeo 1992). In Taiwan, cultivated tea is mainly distributed in Nantou County (48.4%), Chiayi County (15.5%) and New Taipei City (10.6%) (Council of Agriculture 2012). The assamica type tea retains about 3.9% acreage which is mainly distributed in Nantou County, while the sinensis type is about 96.1% which is widely distributed in Taiwan (estimated by Tea Research and Extension Station in 2011). On the other hand, wild tea species is distributed in the central, southern and eastern regions of Taiwan. Various names have been given to the wild species, and Camellia formosensis is the official name based on the RPB2 (large sub-unit of RNA polymerase) gene of nuclear DNA sequence and morphological analyses (Su et al. 2007; Su et al. 2009). It can be well distinguished from cultivated tea (C. sinensis) by the glabrous ovaries and winter buds (Su et al. 2007). In this study, the results of both principal coordinate analysis and cluster analysis have supported that the wild species (C. formosensis) is monophyletic and independent from the cultivated tea (C. sinensis).
The genetic diversity can be accessed by many parameters. The NA (observed number of alleles) is a count of the mean number of alleles with nonzero frequency across loci; the Ne (effective number of alleles) is an estimate of the mean number of equally frequent alleles in an ideal population; the Ho (observed heterozygosity) is an estimate proportion of observed heterozygotes at a given locus; the H (Nei’s gene diversity) is estimated proportion of expected heterozygotes under random mating; the I (Shannon information index) is an index as a measure of gene diversity (Yeh and Boyle 1997). According to the genetic diversity analysis, all parameters or indices showed that higher genetic diversity or genetic variation were detected in the sinensis type (S and SA) and the assamica type (A and AS) than wild species (Table 4). One possible explanation is that the cultivated tea (A, AS, S and SA) originated from diverse regions (China, Myanmar, Thailand, India, and so on) and had frequent inter-crossings. However, genetic recombination only occurred in a limited local wild species in Taiwan. This rationalization differs from that of Lai et al. (2001), in which they used RAPD and ISSR markers to evaluate the gene diversity of 37 tea samples in Taiwan. They reported that the native Taiwan wild species had the highest genetic diversity, followed by the sinensis type and the assamica type (Lai et al. 2001). There are two contrarieties that could be raised against this: first, two (Laitou and Shueijing wild tea) of six native Taiwan wild tea samples in Lai et al. (2001) are C. furfuracea instead of C. formosensis authenticated by Su (2007). This would lead to overestimate the diversity of native wild species. Second, all of three assamica varieties surveyed in Lai et al. (2001) merely originated from India, which are not representative of the tea wild species.
The tea industry in Taiwan began in the Jiaqing era of Ching Dynasty (AD 1796 to 1820), and a few tea varieties were introduced from China (Jun 1997). During the Japanese occupation period (AD 1896 to 1945), four landraces including Chin-Shin-Oolong, Dah-Yeh-Oolong, Chin-Shin-Dahpan, and Ying-Jy-Horng-Shin were recommended to the tea farmers. In addition, Hwang-Gan and Horng-Shin-Dahpan were also the prevailing cultivars at that time. Since 1945, the above six varieties have been used as female parents for hybridization breeding (Sanui 2011; Shyu and Juan 1993). According to cluster analysis in this study (Figure 3), Chin-Shin-Oolong and Horng-Shin-Dahpan belonged to Group IIIa, Hwang-Gan was classified in Group IIIb, and the remaining three landraces (Dah-Yeh-Oolong, Chin-Shin-Dahpan, Ying-Jy-Horng-Shin) were categorized in Group IIIc. However, all of these six varieties were introduced from Fukien or Guangdong of China (Sanui 2011).
Genetic vulnerability is a common problem in most of the tea-production countries, because only a few specific varieties are grown in large-scale and not many varieties have been used as the breeding parents (Yao et al. 2008). For example, a famous cultivar Yabukita contributes more than 80% of its tea acreage in Japan for making green tea (Kaundun and Matsumoto 2004). Besides, the other prevailing varieties including Kanayamidori, Sayamakaori, Saemidori, Okumidori, Meiryoku etc. were selected from Yabukita (Tanaka 2012). This could possibly lead some alleles to be eliminated and result in genetic erosion when most cultivars are replaced by a few varieties. Once the dramatically biotic or abiotic stress occurs, it is more likely to cause reduction in the production of the same or close-related cultivars, which could induce a crisis in the tea industry, leading to its possible collapse. In fact, a similar problem also exists in Taiwan. The top three prevailing cultivars in Taiwan take over 84.2% acreage including Chin-Shin-Oolong (57.3%), TTES-12 (13.7%) and Shy-Jih-Chuen (13.2%) (estimated by Tea Research and Extension Station in 2011). According to leaf morphological characters and ISSR DNA markers, a high similarity between Chin-Shin-Oolong and Shy-Jih-Chuen was found previously (Hu 2004). In this study, the genetic distance between these two cultivars (0.21) is far below the average (0.49) (Table 4 and Additional file 1: Table S2), and the alleles of all 10 cytoplasmic markers are identical (Additional file 1: Table S1). It was confirmed that Shy-Jih-Chuen originated from Chin-Shin-Oolong. In addition, these two cultivars accounting for 70.5% of all tea plantations in Taiwan, and Chin-Shin-Oolong is also the male parent of another two new varieties, TTES-19 and TTES-20, which were released in 2004 (Tsai et al. 2004a). In order to avoid the genetic vulnerability and increase the genetic diversity of tea varieties in Taiwan, the new parental lines could be referred to as the dendrogram of cluster analysis in this study (Figure 3). The elite parents from different geographical origins or genetic background could also be chosen.
Tea is an important economic crop in Taiwan. Attributed to different eras and production areas, many unique types of tea have been expanded in the island, and accordingly, various genetic resources including introduced varieties, landraces, bred varieties and wild species were adopted. In order to develop a stable, fast and reliable marker system for variety identification and assessing genetic diversity of germplasm in Taiwan, 37 CAPS and two STS markers were successfully designed. Above all, five core markers have been found to be sufficient in identifying the prevailing varieties.
According to the genetic diversity analysis, principal coordinate analysis and cluster analysis on tea germplasm in Taiwan, three points of perception have been proposed. First, the high genetic diversity was found between the cultivated (C. sinensis) and wild species (C. formosensis) in Taiwan, although the genetic resources of wild species have not been used very well. Next, the genetic diversity of wild species among different areas of Taiwan was relatively small. Finally, the genetic relationship among the top prevailing cultivars is too close. Therefore, broadening the genetic diversity of the tea varieties is necessary for tea breeding in Taiwan.
Amplified fragment length polymorphism
Cleaved amplified polymorphic sequence
Expressed sequence tag
Expressed sequence tag - simple sequence repeat
Inter-simple sequence repeat
Modified Roger’s distance
Principal coordinate analysis
Polymerase chain reaction - restriction fragment length polymorphism
Polymorphism information content
Randomly amplified polymorphic DNA
Single nucleotide polymorphism
Sequence tagged site
Unweighted pair group method with arithmetic mean algorithm.
Bandyopadhyay T: Molecular marker technology in genetic improvement of tea. Int J Plant Breed Genet 2011, 5(1):23–33.
Banerjee B: Botanical classification of tea. In Tea: cultivation to consumption. Edited by: Willson KC, Clifford MN. London: Chapman & Hall Press; 1992:25–51.
Barua PK: Classification of the tea plant. Two and a Bud 1963, 10: 3–11.
Botstein D, White RL, Skolnick M, Davis RW: Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet 1980, 32(3):314–331.
Chiu TF: Tea production and research in Taiwan. In Recent development in tea production. Edited by: Chiu TF, Wang CH. Taoyuan, Taiwan: Taiwan Tea Experiment Station; 1988:121–129.
Chiu CF, Lin JC, Huang CC, Lin JH, Shiau JH: New cultivar of black tea - TTES No. 21. Taiwan Tea Res Bull 2009, 28: 1–18. (in Chinese with English abstract) (in Chinese with English abstract)
Collard BCY, Jahufer MZZ, Brouwer JB, Pang ECK: An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica 2005, 142(1–2):169–196.
Council of Agriculture: Agricultural statistics yearbook in 2011. Taiwan, R.O.C., Taipei, Taiwan: Council of Agriculture, Executive Yuan; 2012.
Doyle JJ, Doyle JL: Isolation of plant DNA from fresh tissue. Focus 1990, 12: 13–15.
Drenkard E, Glazebrook J, Preuss D, Ausubel FM: Use of cleaved amplified polymorphic sequences (CAPS) for genetic mapping and typing. In DNA markers: protocols, applications, and overviews. Edited by: Caetano-Anolles G, Gresshoff PM. New York: Wiley-Liss, Inc.; 1997:187–198.
Gunasekare MTK: Applications of molecular markers to the genetic improvement of Camellia sinensis L. (tea) - a review. J Hortic Sci Biotechnol 2007, 82(2):161–169.
Hu CY: Studies on the variations in leaf characters and DNA sequences of tea germplasm in Taiwan. Taipei, Taiwan: Master thesis, National Taiwan University; 2004. (in Chinese with English abstract) (in Chinese with English abstract)
Hu CY, Tsai YZ, Lin SF: Using ISSR DNA markers to evaluate genetic diversity of tea germplasm in Taiwan. J Agri Assoc China 2005, 6(5):463–480. (in Chinese with English abstract) (in Chinese with English abstract)
Hu CY, Tsai YZ, Lin SF: Evaluating the feasibility of molecular identification for made tea varieties. J Agri Assoc China 2006, 7(5):499–510. (in Chinese with English abstract) (in Chinese with English abstract)
Hu CY, Lin YC, Hsieh WT, Tseng YH, Lin SF, Tsai YZ: Using EST-SSR markers to identify tea ( Camellia sinensis ) cultivars in Taiwan. Taiwan Tea Res Bull 2011, 30: 9–22. (in Chinese with English abstract) (in Chinese with English abstract)
Ishii T, McCouch SR, Xu Y: Nuclear- and chloroplast-microsatellite variation in A-genome species of rice. Genome 2001, 44: 658–666. 10.1139/g01-044
Jones N, Ougham H, Thomas H, Pasakinskiene I: Markers and mapping revisited: finding your gene. New Phytol 2009, 183(4):935–966. 10.1111/j.1469-8137.2009.02933.x
Jun IM: The culture and science of tea in Taiwan. Taiwan Tea Res Bull 1997, 16: 79–85. (in Chinese with English abstract) (in Chinese with English abstract)
Jun IM, Lin ML: Present status of tea industry in Taiwan. Taiwan Tea Res Bull 1997, 16: 87–97.
Katoh Y, Katoh M, Takeda Y, Omori M: Genetic diversity within cultivated teas based on nucleotide sequence comparison of ribosomal RNA maturase in chloroplast DNA. Euphytica 2003, 134(3):287–295.
Kaundun SS, Matsumoto S: Heterologous nuclear and chloroplast microsatellite amplification and variation in tea, Camellia sinensis . Genome 2002, 45(6):1041–1048. doi:Doi 10.1139/G02–070 doi:Doi 10.1139/G02-070 10.1139/g02-070
Kaundun SS, Matsumoto S: Identification of processed Japanese green tea based on polymorphisms generated by STS-RFLP analysis. J Agric Food Chem 2003, 51(7):1765–1770. 10.1021/jf020821i
Kaundun SS, Matsumoto S: PCR-based amplicon length polymorphisms (ALPs) at microsatellite loci and indels from non-coding DNA regions of cloned genes as a means of authenticating commercial Japanese green teas. J Sci Food Agric 2004, 84(8):895–902. 10.1002/jsfa.1665
Kaundun S, Matsumoto S: Molecular evidence for maternal inheritance of the chloroplast genome in tea, Camellia sinensis (L.) O. Kuntze. J Sci Food Agric 2011, 91(14):2660–2663. 10.1002/jsfa.4508
Konieczny A, Ausubel FM: A procedure for mapping Arabidopsis mutations using co-dominant ecotype-specific PCR-based markers. Plant J 1993, 4(2):403–410. 10.1046/j.1365-313X.1993.04020403.x
Lai JA, Yang WC, Hsiao JY: An assessment of genetic relationships in cultivated tea clones and native wild tea in Taiwan using RAPD and ISSR maskers. Bot Bull Acad Sin 2001, 42(2):93–100.
Lin SY, Chen IZ, Tsai CM, Chen YL: Detection of genetic relationship in Taiwan tea variety ( Camellia sinensis (L.) O. Kuntze) with RAPD markers. J Chinese Soc Hort Sci 2005, 51(4):357–366. (in Chinese with English abstract) (in Chinese with English abstract)
Liu K, Muse SV: PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 2005, 21(9):2128–2129. 10.1093/bioinformatics/bti282
Miller MP: TFPGA (Tools for population genetic analyses) ver. 1.3. Flagstaff, Arizona: Northern Arizona University; 1997.
Olson M, Hood L, Cantor C, Botstein D: A common language for physical mapping of the human genome. Science 1989, 245(4925):1434–1435. 10.1126/science.2781285
Palmer JD: Mitichondrial DNA in plant Systematics: applications and limitations. In Molecular systematics of plants. Edited by: Soltis PS, Soltis DE, Doyle JJ. New York: Chapman & Hall; 1992:36–49.
Powell W, Machray GC, Provan J: Polymorphism revealed by simple sequence repeats. Trends Plant Sci 1996, 1(7):215–222. 10.1016/1360-1385(96)86898-1
Rohlf FJ: NTSYS-pc: numerical taxonomy and multivariate analysis system. Setauket, New York, USA: Exeter Software; 1997.
Rozen S, Skaletsky HJ: Primer3 on the WWW for general users and for biologist programmers. In Bioinformatics methods and protocols: methods in molecular biology. Edited by: Krawetz S, Misener S. Totowa, NJ: Humana Press; 2000:365–386.
Sanui H: Tea breeding in Taiwan. In Tea breeding in Taiwan during Japanese occupation period. Edited by: Shyu YS. Tea Research and Extension Station: Taoyaun, Taiwan; 2011:4–51. (in Chinese) (in Chinese)
Schaal BA, Olsen KM: Gene genealogies and population variation in plants. Proc Natl Acad Sci USA 2000, 97(13):7024–7029. 10.1073/pnas.97.13.7024
Shu Y, Li Y, Zhu Y, Zhu Z, Lv D, Bai X, Cai H, Ji W, Guo D: Genome-wide identification of intron fragment insertion mutations and their potential use as SCAR molecular markers in the soybean. Theor Appl Genet 2010, 121(1):1–8. 10.1007/s00122-010-1285-x
Shyu YS, Juan IM: Retrospect of tea breeding in Taiwan. Taiwan Tea Res Bull 1993, 12: 1–17. (in Chinese with English abstract) (in Chinese with English abstract)
Su MH: Taxonomic study of Camellia formosensis (Masamune et Suzuki) M. H. Su, C. F. Hsieh et C. H. Tsou (Theaceae). Taipei, Taiwan: Doctoral dissertation, National Taiwan University; 2007. in Chinese with English abstract in Chinese with English abstract
Su MH, Tsou CH, Hsieh CF: Morphological comparisons of Taiwan native wild tea plant ( Camellia sinensis (L.) O. Kuntze forma formosensis Kitamura) and two closely related taxa using numerical methods. Taiwania 2007, 52(1):70–83.
Su MH, Hsieh CF, Tsou CH: The confirmation of Camellia formosensis (Theaceae) as an independent species based on DNA equence analyses. Bot Stud 2009, 50(4):477–485.
Takeo T: Green and semi-fermented teas. In Tea: cultivation to consumption. Edited by: Willson KC, Clifford MN. London: Chapman & Hall Press; 1992:413–457.
Tanaka J: Japanese tea breeding history and the future perspective. In Global tea breeding: achievements, challenges and perspectives. Edited by: Chen L, Apostolides Z, Chen Z-M. Zhejiang, China: Zhejiang University Press; 2012:227–239.
Thiel T, Kota R, Grosse I, Stein N, Graner A: SNP2CAPS: a SNP and INDEL analysis tool for CAPS marker development. Nucleic Acids Res 2004, 32(1):e5. 10.1093/nar/gnh006
Tsai HT, Tsai IC, Liaw WR, Chang CK, Wang YW: Study on the genetic diversity among the selected Taiwan tea cultivars/lines using AFLP and RAPD markers. Taiwan Tea Res Bull 2003, 22: 17–32. (in Chinese with English abstract) (in Chinese with English abstract)
Tsai CM, Chang CK, Chen IZ, Chen KR, Tsai YZ, Chiou CF, Lin IC, Fan HJ: Breeding report of two 2004 registered new tea cultivar TTES No. 19 and 20. Taiwan Tea Res Bull 2004, 23: 57–78. (in Chinese with English abstract) (in Chinese with English abstract)
Tsai YS, Liu SL, Wang HF, Ou SM: Comparison of catechins contents and antioxidant activity of green teas from Taiwan major tea cultural cultivars. Taiwan Tea Res Bull 2004, 23: 115–132. (in Chinese with English abstract) (in Chinese with English abstract)
Wright S: Evolution and the genetics of populations., vol 4. Variability within and among natural populations. Chicago: University of Chicago Press; 1978.
Yao MZ, Chen L, Liang YR: Genetic diversity among tea cultivars from China, Japan and Kenya revealed by ISSR markers and its implication for parental selection in tea breeding programmes. Plant Breed 2008, 127(2):166–172. doi:10.1111/j.1439–0523.2007.01448.x doi:10.1111/j.1439-0523.2007.01448.x 10.1111/j.1439-0523.2007.01448.x
Yeam I, Faber N, Jahn MM, Frantz JD, Kang BC, Lindeman W: Allele-specific CAPS markers based on point mutations in resistance alleles at the pvr1 locus encoding eIF4E in Capsicum. Theor Appl Genet 2005, 112: 178–186. 10.1007/s00122-005-0120-2
Yeh FC, Boyle TJB: Population genetic analysis of co-dominant and dominant markers and quantitative traits. Belg J Bot 1997, 129: 157.
The authors declare that they have no competing interests.
C-Y H designed primers, developed markers, analyzed data, and wrote the manuscript. Y-Z T contributed suggestions and discussions for this study. S-F L provided guidance and editorial assistance for the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1:The band patterns of each STS and CAPS marker for all 55 core tea germplasm in Taiwan. Table S2. Matrix of genetic distance among pairs of 55 tea germplasm in Taiwan based on modified Roger’s distance coefficients. Figure S1. A. Partial nucleotide sequences of three cultivars amplified with G01 primer set, and arrow points indicated SNP sites. B. G01 CAPS marker designed from SNP sites of G01 primer set. Figure S2. The cleaved fragment patterns of each STS and CAPS marker for 12 prevailing tea cultivars in Taiwan. (DOCX 3 MB)
About this article
Cite this article
Hu, CY., Tsai, YZ. & Lin, SF. Development of STS and CAPS markers for variety identification and genetic diversity analysis of tea germplasm in Taiwan. Bot Stud 55, 12 (2014). https://doi.org/10.1186/1999-3110-55-12