Using a zero-inflated model to assess gene flow risk and coexistence of Brassica napus L. and Brassica rapa L. on a field scale in Taiwan

Su, Yuan-Chih; Wang, Po-Shung; Yang, Jhih-Ling; Hong, Hong; Lin, Tzu-Kai; Tu, Yuan-Kai; Kuo, Bo-Jein

doi:10.1186/s40529-020-00294-2

Original Article
Open access
Published: 20 May 2020

Using a zero-inflated model to assess gene flow risk and coexistence of Brassica napus L. and Brassica rapa L. on a field scale in Taiwan

Yuan-Chih Su¹,
Po-Shung Wang¹,
Jhih-Ling Yang¹,
Hong Hong¹,
Tzu-Kai Lin²,
Yuan-Kai Tu^1,3 &
…
Bo-Jein Kuo ORCID: orcid.org/0000-0001-5144-9586^1,4

Botanical Studies volume 61, Article number: 17 (2020) Cite this article

2936 Accesses
3 Citations
1 Altmetric
Metrics details

Abstract

Background

The cropping area of genetically modified (GM) crops has constantly increased since 1996. However, currently, cultivating GM crops is associated with many concerns. Transgenes are transferred to non-GM crops through pollen-mediated gene flow, which causes environmental problems such as superweeds and introgressive hybridization. Rapeseed (Brassica napus L.), which has many GM varieties, is one of the most crucial oil crops in the world. Hybridization between Brassica species occurs spontaneously. B. rapa grows in fields as a weed and is cultivated as a crop for various purposes. Both B. rapa weeds and crops participate in gene flow among rapeseed. Therefore, gene flow risk and the coexistence of these two species should be studied.

Results

In this study, field experiments were conducted at two sites for 4 years to evaluate gene flow risk. In addition, zero-inflated models were used to address the problem of excess zero values and data overdispersion. The difference in the number of cross-pollination (CP) events was nonsignificant between upwind and downwind plots. The CP rate decreased as the distance increased. The average CP rates at distances of 0.35 and 12.95 m were 2.78% and 0.028%, respectively. In our results, zero-inflated negative binomial models were comprehensively superior to zero-inflated Poisson models. The models predicted isolation distances of approximately 1.36 and 0.43 m for the 0.9% and 3% threshold labeling levels, respectively.

Conclusions

Cultivating GM crops is prohibited in Taiwan; however, the study results can provide a reference for the assessment of gene flow risk and the coexistence of these two species in Asian countries establishing policies for GM crops.

Background

The acreage of global genetically modified (GM) crops has increased to approximately 191,700,000 ha since 1996 (ISAAA 2018). The most common GM crops are maize, soybean, cotton, and canola. GM crops can be classified according to herbicide tolerance (HT), insect resistance, stacked traits, virus tolerance, and other traits; HT GM crops are the most common. Because of the increasing world population, GM crops are considered a solution for ensuring the food security of the world population (Taheri et al. 2017). For example, HT GM crops can provide convenient weed control at a relatively low price (Brookes and Barfoot 2016). Although GM crops have benefits, some issues should be considered. GM crop cultivation is associated with several concerns, including biodiversity, economics, agricultural production, and consumer choice (Smyth et al. 2002). GM crops affect non-GM crops through gene flow and cause the contamination of non-GM crops with transgenes. Therefore, many countries have established a threshold of GM content among non-GM products. The strictest threshold is 0.9% in the Regulation (EC) No. 1830/2003 of the European Union (EU). Therefore, the coexistence of GM and non-GM crops is an issue that must be discussed.

Rapeseed (Brassica napus L.) is a cross-pollinated crop of the Brassica genus that is typically pollinated by insects. Bees (Apoidea superfamily) are its main pollinator (Scheffler et al. 1995). Although B. napus is typically pollinated by insects, studies have indicated that B. napus can be pollinated without insects (Eisikowitch 1981). Research on gene flow between non-GM and GM B. napus has been conducted in the past few years (Beckie et al. 2003). There is evidence that pollination occurs between B. napus and its related species (Warwick et al. 2003), and the probability of GM genes being transferring to related species should be examined. Gene flow between B. napus and B. juncea L. was evaluated in a previous study (Zhang et al. 2018a). Studies have also reported that spontaneous hybridization is more likely to occur between B. napus and Brassica. rapa L. than between B. napus and other Brassica crops (Landbo et al. 1996). A wild B. rape population near a B. napus field was revealed to have a hybridization rate of 1.1–17.5% (Simard et al. 2006). Furthermore, a study indicated that introgression hybridization may have occurred between B. napus and B. rapa (Hansen et al. 2001). Hence, the risk of gene flow between B. napus and B. rapa is relatively higher than that between B. napus and other Brassica species.

The most common measure used for determining the coexistence of GM and non-GM B. napus is isolation distance. Models that fit the relationship between the cross-pollination (CP) rate and isolation distance have been developed in previous studies, and the optimal isolation distance can be derived from the model with the best fit (Funk et al. 2006; Walklate et al. 2004; Weekes et al. 2005). The pollen dispersal model can be divided into empirical and mechanistic models. Because mechanistic models are difficult to set up for insect pollination, the B. napus pollen dispersal model is classified as an empirical model (Klein et al. 2006). The variability of data from dispersal experiments is typically great (Bensadoun et al. 2016). Data are overdispersed when the observed variance is higher than the theoretical variance because of the excess zero values in the observed dispersal count data. To fit this type of data, the zero-inflated Poisson (ZIP) distribution is an appropriate method (Bensadoun et al. 2014).

Small farming systems are common in many Asian countries. In Taiwan, fields are small and scattered (Hsu 2014). An average of 0.3 ha of agricultural land is owned by each person among farming families (Council of Agriculture 2017). Gene flow in Asian farm systems has not been thoroughly studied. Therefore, establishing an optimal field design for GM and non-GM crops to coexist would be beneficial for Asian agricultural development. Few studies have assessed the coexistence of B. napus and B. rapa on a small field scale. In Taiwan, B. rapa is cultivated in fields as a green manure, vegetable, or honey plant. Therefore, adjacent fields of B. rapa and GM B. napus may cause unexpected gene flow between these species. This study provides new insights into gene flow between non-GM B. rapa and GM B. napus and how the wind direction and distance affect gene flow during a 4-year experiment. Models that fit the CP rate (%) were also developed. This study provides a valuable reference for researchers and growers interested in preventing gene flow in coexisting of non-GM B. rapa and GM B. napus.

Materials and methods

Plant materials

Non-GM B. napus “Deza oil No. 18” (AACC, 2n = 38) was used as the pollen donor in this study. This cultivar has recessive genetic male sterility and is double cross variety, and its growth period is approximately 224 days. The pollen recipient plant was the open-pollination variety (Nongxing 80-day) of B. rapa (AA, 2n = 20), which is mainly used as a green manure crop in Taiwan. B. napus seedlings were treated with vernalization to ensure flowering in Taiwan. B. napus seedlings were cooled to 4 °C for at least 30 days. After B. napus vernalization, B. napus and B. rapa seedlings were planted in 128-well plastic trays in a greenhouse. The seedlings were transplanted to a field until the five-leaf stage.

Experiment design

The pollen dispersal experiments were conducted from the fall of 2013 to the spring of 2017 at Taiwan Agricultural Research Institute (TARI), Council of Agriculture (COA), Executive Yuan (24° 03′ N, 120° 69′ E), and Agricultural Experiment Station (AES), College of Agriculture and Natural Resources, National Chung Hsing University (24° 07′ N, 120° 71′ E). The experiments were replicated eight times, four times for each site. The total area of the two experimental sites was approximately 0.054 ha (36 × 15 m²; Fig. 1; Hong et al. 2016; Su 2015; Wang 2017; Yang 2018).

The two pollen recipient plots were located next to the pollen donor plot to simulate adjacent field arrangements in Taiwan (Nieh et al. 2014). The field design of the experiment was established at TARI, where the two recipient plots were located on the north and south sides of the source plot. At the AES site, the two recipient plots were set up on the west and east sides of the source plot. Each experimental field had 12 furrows, and each furrow had two rows. There were 696 and 1776 B. napus and B. rapa plants in each field, respectively. Blooming was controlled through cutting early flowers to assure flower synchronization.

Meteorological information was recorded by a weather station located at TARI. The daily maximum frequency of the wind direction was taken as the field prevailing wind direction of each day. The proportion of each wind direction during the flowering period was defined as the field prevailing wind direction.

The recipient plants were sampled in two rows of each furrow (except the first and last furrow) at different distances. The sampling distance was in the range of 0.35–12.95 m at 0.7-m intervals. One or two flower stalks were cut for each plant. Mature pods were dried, threshed, and stored for inspecting the hybridization of recipient plants.

Hybrid progeny screening

A previous study discovered that the hybrids of B. rapa × B. napus could be distinguished from their parents through morphology (Jørgensen and Andersen 1994; Lu et al. 2001; Tu et al. 2020). The morphology characteristics of B. napus, B. rapa and B. rapa × B. napus (F1) were described in Tu et al. (2020). The difference between F1 hybrid and parents also showed in the genome size and molecular marker (Tu et al. 2020). In this study, leaf characteristics were used to differentiate between hybrid and nonhybrid progenies at the two-leaf stage. The hybrid leaves were circular, dark green, and displayed a trichome and strong dentation at the margin (Fig. 2a, b). By contrast, the nonhybrid leaves were thin oval shape, light green (Fig. 2c, d).

For each sample, 384 seeds were sowed in plastic trays, and the number of hybrid progenies was counted. The CP rate (%) was calculated by counting the number of outcrossing progenies in each seeding sample, as follows (Eq. 1):

$$ {\text{CP }}\left( {\text{\% }} \right) = \frac{{n_{c} }}{N} \times 100\% $$

(1)

where n_c is the number of hybrid progenies, and N is the total seedling number of the sample. Because of model fitting requirements, the CP rates were transformed into count data by multiplying them by 384 and rounding the value.

Zero-inflated model

According to previous studies, the CP rate decreases with increasing distance (Beckie et al. 2003; Damgaard and Kjellsson 2005). Therefore, this may result in a relatively large number of zero values in the CP data. Most of models typically demonstrate poor predictive performance when fitted with excess zero values (Rodriguez 2013). The zero-inflated model has been proposed to address the problem of excess zero-count data (Greene 1994; Lambert 1992).

The ZIP model is a model consisting of a fixed zero count and a Poisson distribution. The ZIP model increases the probability of the occurrence of zero values to address excess zero counts. Assume that the probability of zero counts is π_i, and the response variable Y_i, i = 1, 2, 3…, n, is a counting variable with a probability density function (pdf; Eq. 2):

$$ {\text{P}}\left( {Y_{i} = y_{i} ;\mu_{i} , \pi_{i} } \right) = \left\{ {\begin{array}{*{20}l} {\pi_{i} + \left( {1 - \pi_{i} } \right)e^{{ - \mu_{i} }} , \quad y_{i} = 0} \\ {\left( {1 - \pi_{i} } \right)\frac{{\mu_{i}^{{y_{i} }} }}{{y_{i} !}}e^{{ - \mu_{i} }} , \qquad y_{i} > 0} \\ \end{array} } \right. $$

(2)

where μ_i is the parameter of the Poisson distribution. The parameter μ_i satisfies the log link function (Eq. 3). We defined the predictor of μ_i as Q × r(x, y). The parameter Q and dispersal kernel function r(x, y) were introduced in a previous study (Bullock et al. 2017). Dispersal kernel functions include log-sech, exponential power, power law, logistic, 2Dt, gamma, WALD, Weibull, Exponential, log-normal, and Gaussian. Variables x and y are the two-dimensional coordinates. The parameter π_i is defined as the logit link function (Eq. 4). The predictor for π_i is the same as that for μ_i.

$$ \mu_{i} = \exp \left( {Q \times r\left( {x,y} \right)} \right) $$

(3)

$$ \pi_{i} = \frac{{\mu_{i} }}{{1 + \mu_{i} }} $$

(4)

Bias may remain in parameter estimation when the ZIP model fits the overdispersed data. Therefore, another zero-inflated model, the zero-inflated negative binomial (ZINB) model, was suggested to solve this problem. The concept of the ZINB model is similar to that of the ZIP model. Because the ZINB model adds a parameter to evaluate the dispersion of data, it is more suitable for overdispersed data. The pdf of the ZINB model is analogous to that of the ZIP model (Eq. 5).

$$ {\text{P}}\left( {Y_{i} = y_{i} ; \mu_{i} , \pi_{i} } \right) = \left\{ {\begin{array}{*{20}l} {\pi_{i} + \left( {1 - \pi_{i} } \right) \cdot g\left( {y_{i} } \right), \quad y_{i} = 0} \\ {\left( {1 - \pi_{i} } \right) \cdot g\left( {y_{i} } \right), \qquad y_{i} > 0} \\ \end{array} } \right. $$

(5)

$$ g\left( {y_{i} } \right) = \frac{{\varGamma \left( {y_{i} + \alpha^{ - 1} } \right)}}{{\varGamma \left( {\alpha^{ - 1} } \right)\varGamma \left( {y_{i} + 1} \right)}}\left( {\frac{1}{{1 + \alpha \mu_{i} }}} \right)^{{\alpha^{ - 1} }} \left( {\frac{{\alpha \mu_{i} }}{{1 + \alpha \mu_{i} }}} \right)^{{y_{i} }} $$

(6)

The function ɡ(y_i) is the pdf of the negative binominal distribution, where Γ is the gamma function, and α is the shape parameter. The definitions of μ_i and π_i in the ZINB model are the same as those in the ZIP model (Eqs. 3 and 4).

To apply the two-dimensional function r(x, y), the distance between individual plants is calculated using Eq. 7. The experimental field is considered a two-dimensional coordinate plane where plant positions are defined by a coordinate point. In Eq. 7, coordinate points (x, y) and (x’, y’) define the positions of the recipient and donor plants, respectively.

$$ {\text{distance}} = \sqrt {\left( {x - x^{\prime}} \right)^{2} - \left( {y - y^{\prime}} \right)^{2} } $$

(7)

Statistical analysis

We expected that wind would not influence the number of CP events. A CP event was defined as the occurrence of CP at a sampling point. We compared the number of CP events in the two recipient plots by using a z-test to evaluate the wind effect. In addition, this study conducted an ANOVA to test the wind effect to the variation of CP rate. Examination of excess zero values was conducted by counting the frequency of zero values among the data and comparing this with the predicted zero frequency of the Poisson distribution. There were excess zero values if the number of zero events was more than expected. Overdispersion was examined based on the assumption of Poisson distribution. If the variance was higher than the mean, then overdispersion may have occurred in the data. In addition, we calculated the deviance by fitting the data with the Poisson distribution, and we computed the ratio of deviance to the degree of freedom (d.f.). A dataset with a ratio of > 1 is considered to be overdispersed (McCullagh and Nelder 1989).

The data of each year and site were combined, and 70% of the total data were randomly selected to train the model. The remaining 30% of the data were the validation dataset. The performance of model fitting was evaluated based on root mean square error (RMSE), adjusted coefficient of determination (adj. R²), Akaike’s information criterion (AIC), and Schwarz’s Bayesian information criterion (BIC; Akaike 1974; Schwarz 1978). We selected models with small values of RMSE, AIC, and BIC. A large adj. R² value demonstrated a good model fit. The predictive capability of the model was assessed based on the mean squared prediction error (MSPR). In our study, a model with a small MSPR value was selected as the best model (Jung and Hu 2015). The model selection procedures identified models with a good predictive ability based on the aforementioned criteria recommended for application. The conservative isolation distance at various thresholds was estimated through 500 bootstrapping simulations. The 95th percentile of the distance generated through the simulations was considered the conservative isolation distance. All statistical analyses were performed using SAS 9.4 (SAS Institute Inc., Cary, NC, USA) and R v 3.4.0 (R Development Core Team 2017) software.

Results

Wind direction during the flowering period

An overlap of at least 24 days occurred between the donor and recipient plant flowering periods during the eight experiments (Additional file 1: Table S1). In most experiments, the overlap period was longer than 1 month. The AES and TARI sites were located nearby; therefore, we applied meteorological data from the same weather station to both sites. The prevailing wind direction during the flowering period was north (Additional file 2: Table S2).

The relative frequency of northerly winds ranged from 25 to 88%. The two recipient plots in the TARI experiments were assumed to be upwind and downwind plots to evaluate the wind effect on pollination. Because the field arrangement and prevailing wind direction were not parallel, the recipient plots in the AES experiments could not be defined as upwind and downwind plots.

Distance and wind effects on CP

In the TARI experiments, the CP rates of the upwind and downwind plots were observed separately. The CP rates of both recipient plots of the AES experiments were observed jointly. The average CP rate fluctuated between 0.48% and 5.07% over the shortest distance (0.35 m; Table 1).

Table 1 Mean and standard deviation of the observed cross-pollination rate (%)

Full size table

The maximum and minimum CP rates over 0.35 m were 13.75% and 0%, respectively, in the TARI experiments. The mean CP rate decreased rapidly with increasing distance and was less than 1% at 1.75 m. The CP rate was relatively stable at distances exceeding 5.25 m. Some CP events were still observed at the maximum distance in most experiments. The mean CP rate in the upwind plots was higher than that in the downwind plots at the minimum distance. The standard deviation of the CP rate at 0.35 m was also higher in upwind plots, except for in the 2013-1 experiment. The z-values of experiments 2013-1, 2014-1, 2015-1, and 2016-1 were − 0.7133, − 0.225, 0, and − 0.724, respectively (Additional file 3: Table S3). Based on the z-test results, wind effects on these four experiments were nonsignificant (all p > 0.05). In addition, we combined the data and calculated the z-values. The overall z-value was − 0.8208, and the wind effect remained nonsignificant. The result of ANOVA showed that the wind direction do not have an effect on the variation of CP rate. (Additional file 4: Table S4).

Testing of excess zeros and overdispersion

In our study, a zero event was defined as an event with a CP rate of 0%. The proportion of zero events among each experiment was 74%, 75%, 71%, 90%, 75%, 77%, 76%, and 79% (Additional file 5: Table S5). The expected proportion of zero events was calculated with the pdf of the Poisson distribution and compared with the observed proportion of zero events. All expected proportions of zero events were smaller than the observed proportions. Therefore, all experimental data had the problem of excess zeros.

The mean and variance of the CP progeny numbers were calculated and compared to roughly check data overdispersion. In each experiment, the variance of the CP progeny number was larger than its mean (Additional file 6: Table S6). Thus, data from the eight experiments may have been overdispersed. Furthermore, all ratios of deviance to d.f. were larger than 1, except for experiment 2014-2. Both approaches indicated that the experimental data were over dispersed.

Model fitting

Given the absence of overdispersion, the data for model training and validation excluded the data of experiment 2014-2. The remaining data were divided into training and validation datasets, which contained 70% and 30% of the total data, respectively. This study applied the ZIP and ZINB models with dispersal kernel functions to fit the excess zeros and overdispersed data. The ZIP and ZINB models were fitted with the training dataset and were evaluated separately.

According to the criteria, the ZIP model with logistic (ZIP-logistic), 2Dt (ZIP-2Dt), and Weibull (ZIP-Weibull) dispersal kernel functions were the three preferred candidate models (Table 2). All RMSE values of these models were 0.01043. The ZIP-logistic and ZIP-2Dt models were identified as the best models based on the adj. R² criterion (both adj. R² = 0.01097). AIC and BIC also indicated that ZIP-logistic and ZIP-2Dt were the best models (AIC = − 16,978; BIC = − 16,969). The adj. R², AIC, and BIC values of the ZIP-Weibull model were 0.01064, −16,977, and −16,968, respectively. Based on the adj. R² criterion, we selected ZINB-Weibull, ZINB-exponential power, and ZINB-log-sech as the preferred candidate models. All RMSE values of these models were 0.00823. AIC and BIC also identified these models as the best among the ZINB models. The adj. R², AIC, and BIC values of ZINB-Weibull, which is the optimal ZINB model, were 0.38947, −17,860, and −17,853, respectively. The ZINB models were superior to the ZIP models. Even the ZINB model with the worst fitting criterion values performed better than did the ZIP-Weibull model. Consequently, the candidate ZINB models were chosen for the validation procedure.

Table 2 Fitting criteria of the ZIP and ZINB models with the training dataset

Full size table

Model validation and isolation distance recommendation

In accordance with the MSPR, the ZINB-log-sech, ZINB-exponential power, ZINB-gamma, and ZINB-Weibull models had a good predictive ability in the new dataset. These models had small MSPR values of 0.000068767, 0.000068742, 0.000068764, and 0.000068764, respectively (Table 3). The MSPR values of these four models were similar. Based on the best fit, the ZINB-exponential power and ZINB-Weibull models were selected as the final models. The predicted CP rates of the ZINB-exponential power and ZINB-Weibull models were compared with the observed CP rate. The predicted CP rates were higher than the observed CP rates at distances of 0.35, 1.05, and 1.75 m (Table 4). At distances of 2.45, 3.15, 3.85, and 4.55 m, both models underestimated the CP rate. The predicted CP rate varied little and was overestimated at distances larger than 5.25 m.

Table 3 Predicting criteria of the ZIP and ZINB models with the validation dataset

Full size table

Table 4 Actual and predicted cross-pollination rate (%) of the ZINB models by distance

Full size table

The thresholds of cross-pollination rates for recommendation were 3%, 1%, and 0.9%, with reference to regulations in Taiwan, Australia, and the EU, respectively. The recommended distance of each threshold was approached in both models (Table 5). For the 3% threshold, 0.425 and 0.431 m were the distances recommended by the ZINB-exponential power and ZINB-Weibull models, respectively. A distance of approximately 1.35 m was recommended to avoid exceeding the 0.9% threshold.

Table 5 Isolation distance (m) evaluated by both zero-inflated negative binomial (ZINB)-exponential power and ZINB-Weibull models under threshold values 3%, 1%, and 0.9%, respectively

Full size table

Discussion

Estimating the CP rate involved setting targets to develop strategies to eliminate hybridization as part of the hybridization risk assessment between GM B. napus and B. rapa (Wilkinson et al. 2003). According to a study of gene flow between GM and non-GM B. napus, the mean CP rates at 2 and 16 m were 2.33% and 0.46%, respectively (Zhang et al. 2018a).

In other studies, the average CP rates observed at 0.5, 1, and 15 m were 2.50%, 1.28%, and 0.13%, respectively (Zhao et al. 2013). The mean CP rates of 2.88% and 1.02% at 0.35 and 1.05 m, respectively, in our study were similar to those in previous studies. However, the mean CP rate of 0.45% and 0.030% at 1.75 and 12.95 m, respectively, in our study were lower than those in previous studies. Given the pollen competition between species and the plant density, the relatively low CP rate was predictable. The spontaneous hybridization rate between GM B. napus and B. rapa was 0.196% when the two species were planted in adjacent rows (Xiao et al. 2009). A hybridization rate of 1.46% was observed in a wild B. rapa population within 30 m of B. napus fields in the United Kingdom (Wilkinson et al. 2003). For B. rapa interplantation with a B. napus field, the hybridization rate was approximately 7% (Warwick et al. 2003). The results of gene flow may differ under particular conditions. In this study, the results reflected gene flow between two small adjacent fields, a typical field arrangement in Asian countries. According to the average CP rate in our experiments, B. rapa plants within 1.05 m contained approximately 1.8% of hybrid progenies. Those hybrids may result in immediate harvest loss for a farmer. Furthermore, hybrids containing a transgene may develop into a volunteer population. The volunteer population with the transgene may become transgene donors or herbicide-resistant weeds and cause economic loss in the future. A volunteer population with a transgene may affect the agricultural ecosystem. Consequently, the coexistence of these two species and evaluation of long-term effects on the environment are necessary in Asian countries.

Brassica napus and B. rapa are pollinated by insects. Several studies have indicated that the wind direction does not affect the gene flow of B. napus (Funk et al. 2006; Rieger et al. 2002). To evaluate the wind effect, the prevailing wind direction was recorded in the TARI field, and the recipient plots were established on the upwind and downwind sides of the donor plot. For each experiment at TARI, the proportion of CP events in the two recipient plots was nonsignificantly different. Even after combining data from the four experiments, the proportion of CP events between the upwind and downwind plots was not considerably different. This indicated that wind did not influence gene flow. Another study posited that wind only affects gene flow and contributes to pollination when insect pollinators are scarce (Hayter and Cresswell 2006). A study investigated wind pollination without insects by using nets (Zhang et al. 2018b). Therefore, the contribution of pollination to B. napus gene flow may depend on the abundance of insects. Insect pollinators were sufficiently abundant for pollination in the experimental fields; thus, the wind effect was minor in this study.

In a previous study, the ZIP model was introduced to fit the corn CP rate data (Bensadoun et al. 2014). The number of cross-pollinated progenies was assumed to follow a Poisson distribution. However, the CP data typically presented excess zeros and overdispersion; thus, the CP data were assumed to follow a ZIP distribution. In the present study, the test results for excess zeros and overdispersion indicated that our experimental data contained excess zeros and overdispersion, except for experiment 2014-2. The CP rate for short distances was lower in the 2014-2 experiment than in the other experiments, and overdispersion was not present in the 2014-2 experimental data. Due to data characteristics, we used the ZIP and ZINB models to estimate the CP rate. The experimental data were combined, with the exclusion of the 2014-2 experimental data. According to all criteria, the ZINB model was superior to the ZIP model, and the ZINB model was more appropriate for handling count data in excess zeros and overdispersion (Zulkifli et al. 2011). The ZINB-exponential power and ZINB-Weibull models were the two best models for fitting the data. The adj. R² values for the ZINB-exponential power and ZINB-Weibull models were 0.38925 and 0.38947, respectively. The performance of both models was better than the results obtained in a previous study that modeled the CP rate between B. napus and its relatives (Zhang et al. 2018a). Model fitting was affected by the variation of the CP rate at short distances. High variation at short distances has also been observed in other studies (Beckie et al. 2003; Damgaard and Kjellsson 2005). The CP rate variation within a few meters of the donor plot may be attributed to insect behavior (Funk et al. 2006). Although the predicted CP rates for the two models were overestimated within 1.75 m, it was acceptable because of the high variation at a short distance. The overall predicted CP rates within 4 m were similar to the average CP rate. The recommended distances were similar for both models. The ZINB-Weibull model provided a relatively conservative isolation distance at strict thresholds. The recommended distance for GM and non-GM B. napus at a 0.9% threshold was 0 m (Weekes et al. 2005). For gene flow between B. napus and B. rapa, 1.35 m was applicable for the 0.9% threshold in our study. The CP process was affected by many factors: differences in experimental scale, species, and model may have led to various results. A method that can integrate all factors is necessary to predict scenarios in future research.

Conclusion

This study conducted eight experiments at two sites for 4 years to evaluate the risk of gene flow between B. napus and B. rapa on a small field scale, similar to typical field sizes in Taiwan. The multiple sites and years of these experiments addressed variation in field conditions of each year and site. Therefore, the result was robust to different years and sites. The risk of long-distance gene flow between B. napus and B. rapa was negligible. However, the risk remains beyond the short distances of adjacent fields. The experiments provided a preliminary gene flow risk assessment between these two species in Taiwan and provided insights for further research and coexistence strategies.

Availability of data and materials

The data used and analyzed in this study can be provided from the corresponding author for scientific, non-profit purpose.

Abbreviations

GM:: Genetically modified
CP:: Cross-pollination
HT:: Herbicide tolerance
EU:: European Union
ZIP:: Zero-inflated Poisson
TARI:: Taiwan Agricultural Research Institute
AES:: Agricultural Experiment Station
ZINB:: Zero-inflated negative binomial
RMSE:: Root mean square error
adj. R² :: Adjusted coefficient of determination
AIC:: Akaike’s information criterion
BIC:: Schwarz’s Bayesian information criterion
MSPR:: Mean squared prediction error

References

Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19:716–723
Google Scholar
Beckie HJ, Warwick SI, Nair H, Séguin-Swartz G (2003) Gene flow in commercial fields of herbicide-resistant canola (Brassica napus). Ecol Appl 13:1276–1294
Article Google Scholar
Bensadoun A, Monod H, Angevin F, Makowski D, Messéan A (2014) Modeling of gene flow by a Bayesian approach: a new perspective for decision support. Ag Bio Forum 17:213–220
Google Scholar
Bensadoun A, Monod H, Makowski D, Messéan A (2016) A Bayesian approach to model dispersal for decision support. Environ Model Softw 78:179–190. https://doi.org/10.1016/j.envsoft.2015.12.018
Article Google Scholar
Brookes G, Barfoot P (2016) Global income and production impacts of using GM crop technology 1996-2014. GM Crops Food 7:38–77. https://doi.org/10.1080/21645698.2016.1176817
Article PubMed PubMed Central Google Scholar
Bullock JM, Mallada González L, Tamme R, Götzenberger L, White SM, Pärtel M, Hooftman DAP (2017) A synthesis of empirical plant dispersal kernels. J Ecol 105:6–19. https://doi.org/10.1111/1365-2745.12666
Article Google Scholar
Council of Agriculture (2017) Agricultural statistics yearbook 2017. Council of Agriculture, Executive Yuan, Taipei
Google Scholar
Damgaard C, Kjellsson G (2005) Gene flow of oilseed rape (Brassica napus) according to isolation distance and buffer zone. Agric Ecosyst Environ 108:291–301. https://doi.org/10.1016/j.agee.2005.01.007
Article Google Scholar
Eisikowitch D (1981) Some aspects of pollination of B. napus (Brassica napus L.). J Agric Sci 96:321–326
Article Google Scholar
European Commission (2003) Commission Recommendation of 23 July 2003 on guidelines for the development of national strategies and best practices to ensure the coexistence of genetically modified crops with conventional and organic farming. Off J Eur Commun 189:36–47
Google Scholar
Funk T, Wenzel G, Schwarz G (2006) Outcrossing frequencies and distribution of transgenic oilseed rape (Brassica napus L.) in the nearest neighbourhood. Eur J Agron 24:26–34. https://doi.org/10.1016/j.eja.2005.04.002
Article Google Scholar
Greene WH (1994) Accounting for excess zeros and sample selection in poisson and negative binomial regression models. Working paper EC-94-10. Department of Economics, Stern School of Business, New York University, New York, NY
Hansen LB, Siegismund HR, Jørgensen RB (2001) Introgression between oilseed rape (Brassica napus L.) and its weedy relative B. rapa L. in a natural population. Genet Resour Crop Evol 48:621–627. https://doi.org/10.1023/A:1013825816443
Article Google Scholar
Hayter KE, Cresswell JE (2006) The influence of pollinator abundance on the dynamics and efficiency of pollination in agricultural Brassica napus: implications for landscape-scale gene dispersal. J Appl Ecol 43:1196–1202. https://doi.org/10.1111/j.1365-2664.2006.01219.x
Article Google Scholar
Hong H, Lin TK, Yu YK, Kuo BJ (2016) Identifying the F1 hybrids of the Simulated GM Brassica napus and Brassica rapa. Crop Environ Bioinform 13:53–66 (in Chinese with English abstract)
Google Scholar
Hsu YH (2014) Maize Pollen dispersal model: using nonlinear, quasi-mechanistic models and neural networks to evaluate the recommended isolation distance for coexistence between GM and non-GM crops in Taiwan. Dissertation, National Chung Hsing University, Taichung, Taiwan. (in Chinese with English abstract)
ISAAA (2018) Global status of commercialized biotech/GM Crops in 2018: biotech crop continuous to help meet the challenges of increased population and climate change. ISAAA Brief No. 54. ISAAA, Ithaca, NY
Jørgensen RB, Andersen B (1994) Spontaneous hybridization between B. napus (Brassica napus) and weedy B. rapa (Brassicaceae): a risk of growing genetically modified B. napus. Am J Bot 1:1620–1626
Article Google Scholar
Jung Y, Hu J (2015) A K-fold averaging cross-validation procedure. J Nonparametr Stat 27:167–179
Article Google Scholar
Klein EK, Lavigne C, Picault H, Renard M, Gouyon PH (2006) Pollen dispersal of oilseed rape: estimation of the dispersal function and effects of field dimension. J Appl Ecol 43:141–151. https://doi.org/10.1111/j.1365-2664.2005.01108.x
Article Google Scholar
Lambert D (1992) Zero-Inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34:1–14
Article Google Scholar
Landbo L, Andersen B, Jørgensen RB (1996) Natural hybridisation between oilseed rape and a wild relative: hybrids among seeds from weedy B. campestris. Hereditas 125:89–91
Article Google Scholar
Lu C, Shen F, Hu K (2001) Heterosis in interspecific hybrids between Brassica napus and B. rapa. SABRAO J Breed Genet 33:73–85
Google Scholar
McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman and Hall, London
Book Google Scholar
Nieh SC, Lin WS, Hsu YH, Shieh GJ, Kuo BJ (2014) The effect of flowering time and distance between pollen source and recipient on maize. GM Crops Food 5:287–295. https://doi.org/10.4161/21645698.2014.947805
Article PubMed PubMed Central Google Scholar
Rieger MA, Lamond M, Preston C, Powles SB, Roush RT (2002) Pollen-mediated movement of herbicide resistance between commercial canola fields. Science 296:2386–2388. https://doi.org/10.1126/science.1071682
Article CAS PubMed Google Scholar
Rodriguez G (2013) Models for count data with overdispersion. Princet Stat: 1-7
Scheffler JA, Parkinson R, Dale PJ (1995) Evaluating the effectiveness of isolation distances for field plots of oilseed rape (Brassica napus) using a herbicide-resistance transgene as a selectable marker. Plant Breed 114:317–321. https://doi.org/10.1111/j.1439-0523.1995.tb01241.x
Article Google Scholar
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464
Article Google Scholar
Simard MJ, Légère A, Warwick SI (2006) Transgenic Brassica napus fields and Brassica rapa weeds in Quebec: sympatry and weed-crop in situ hybridization. Can J Bot 84:1842–1851. https://doi.org/10.1139/b06-135
Article CAS Google Scholar
Smyth S, Khachatourians GG, Phillips PWB (2002) Liabilities and economics of transgenic crops. Nat Biotechnol 20:537–541. https://doi.org/10.1038/nbt0602-537
Article CAS PubMed Google Scholar
Su YC (2015) Using 1D and 2D models to simulate the pollen-mediate gene flow (PMGF) between GM Brassica napus and Brassica rapa: a case study in Wufeng Districty, Taichung City. Thesis, Chung Hsing University, Taichung (in Chinese with English abstract)
Google Scholar
Taheri F, Azadi H, D’Haese M (2017) A world without hunger: organic or GM crops? Sustain 9:1–17. https://doi.org/10.3390/su9040580
Article Google Scholar
Tu YK, Chen HW, Tseng KY, Lin YC, Kuo BJ (2020) Morphological and genetic characteristics of F₁ hybrids introgressed from Brassica napus to B. rapa in Taiwan. Bot Stud 61:1. https://doi.org/10.1186/s40529-019-0279-5
Article CAS PubMed PubMed Central Google Scholar
Walklate PJ, Hunt JCR, Higson HL, Sweet JB (2004) A model of pollen-mediated gene flow for oilseed rape. Proc R Soc B Biol Sci 271:441–449. https://doi.org/10.1098/rspb.2003.2578
Article CAS Google Scholar
Wang PS (2017) Establishing the pollen dispersal model to simulate the isolation distance between GM Brassica napus and Brassica rapa: a case study in Wufeng District, Taichung City. Thesis, Chung Hsing University, Taichung (in Chinese with English abstract)
Google Scholar
Warwick SI, Simard MJ, Légère A, Beckie HJ, Braun L, Zhu B, Mason P, Séguin-Swartz G, Stewart CN (2003) Hybridization between transgenic Brassica napus L. and its wild relatives: Brassica rapa L., Raphanus raphanistrum L., Sinapis arvensis L., and Erucastrum gallicum (Willd.) O.E. Schulz. Theor Appl Genet 107:528–539. https://doi.org/10.1007/s00122-003-1278-0
Article CAS PubMed Google Scholar
Weekes R, Deppe C, Allnutt T, Boffey C, Morgan D, Morgan S, Bilton M, Daniels R, Henry C (2005) Crop-to-crop gene flow using farm scale sites of oilseed rape (Brassica napus) in the UK. Transgenic Res 14:749–759. https://doi.org/10.1007/s11248-005-0943-2
Article CAS PubMed Google Scholar
Wilkinson MJ, Elliott LJ, Allainguillaume J, Shaw MW, Norris C, Welters R, Alexander M, Sweet J, Mason DC (2003) Hybridization between Brassica napus and B. rapa on a National Scale in the United Kingdom. Science 302:457–459. https://doi.org/10.1126/science.1088200
Article CAS PubMed Google Scholar
Xiao L, Lu C, Zhang B, Bo H, Wu Y, Wu G, Cao Y, Yu D (2009) Gene transferability from transgenic Brassica napus L. to various subspecies and varieties of Brassica rapa. Transgenic Res 18:733–746. https://doi.org/10.1007/s11248-009-9261-4
Article CAS PubMed Google Scholar
Yang JL (2018) Using zero-inflated models to simulate the distance of pollen dispersal between GM Brassica napus and Brassica rapa: a case study in Wufeng District, Taichung City. Thesis, Chung Hsing University, Taichung (in Chinese with English abstract)
Google Scholar
Zhang CJ, Yook MJ, Park HR, Lim SH, Kim JW, Nah G, Song HR, Jo BH, Roh KH, Park S, Kim DS (2018a) Assessment of potential environmental risks of transgene flow in smallholder farming systems in Asia: Brassica napus as a case study in Korea. Sci Total Environ 640–641:688–695. https://doi.org/10.1016/j.scitotenv.2018.05.335
Article CAS PubMed Google Scholar
Zhang CJ, Yook MJ, Park HR, Lim SH, Kim JW, Song JS, Nah G, Song HR, Jo BH, Roh KH, Park S, Jang YS, Noua IS, Kim DS (2018b) Evaluation of maximum potential gene flow from herbicide resistant Brassica napus to its male sterile relatives under open and wind pollination conditions. Sci Total Environ 634:821–830. https://doi.org/10.1016/j.scitotenv.2018.03.390
Article CAS PubMed Google Scholar
Zhao XX, Tang T, Chen GM, Liu FX, Wang XL, Bu CP, Lu CM (2013) Rationalizing the isolation distance needed for field trials involving genetically modified rapeseed (Brassica napus L.) in China. Chin Sci Bull 58:1558–1567. https://doi.org/10.1007/s11434-012-5595-z
Article CAS Google Scholar
Zulkifli M, Noriszura I, Ismail N, Razali AM (2011) Zero-inflated Poisson versus zero-inflated negative binomial: application to theft insurance data Time Series Analysis, Data Analysis View project Estimation of Extreme Hydro-meteorology Events using Non-stationary Intensity-Duration-Frequency Curves Based on Bayesian Framework View project Zero-inflated Poisson versus zero-inflated negative binomial:Application to theft insurance data. The 7th IMT-GT International Conference on Mathematics, Statistics and its Applications, Bangkok, Thailand

Download references

Acknowledgements

The experiments were assisted by the Taiwan Agriculture Research Institute Council of Agriculture, Executive Yuan, Taiwan (R.O.C.). This study was partially supported by the “Innovation and Development Center of Sustainable Agriculture” from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan (R.O.C.). This manuscript was edited by Wallace Academic Editing.

Funding

This study was partially funded by the Ministry of Science and Technology (MOST 104-2313-B-005-010-MY3).

Author information

Authors and Affiliations

Department of Agronomy, National Chung Hsing University, No. 145 Xingda Road, South District, Taichung City, 40227, Taiwan (R.O.C.)
Yuan-Chih Su, Po-Shung Wang, Jhih-Ling Yang, Hong Hong, Yuan-Kai Tu & Bo-Jein Kuo
Division of Crop Science, Taiwan Agricultural Research Institute, No. 189, Zhongzheng Road, Wufeng District, Taichung City, 41362, Taiwan (R.O.C.)
Tzu-Kai Lin
Division of Biotechnology, Taiwan Agricultural Research Institute, No. 189, Zhongzheng Road, Wufeng District, Taichung City, 41362, Taiwan (R.O.C.)
Yuan-Kai Tu
Innovation and Development Center of Sustainable Agriculture (IDCSA), National Chung Hsing University, No. 145 Xingda Road, South District, Taichung City, 40227, Taiwan (R.O.C.)
Bo-Jein Kuo

Authors

Yuan-Chih Su
View author publications
You can also search for this author in PubMed Google Scholar
Po-Shung Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jhih-Ling Yang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Hong
View author publications
You can also search for this author in PubMed Google Scholar
Tzu-Kai Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yuan-Kai Tu
View author publications
You can also search for this author in PubMed Google Scholar
Bo-Jein Kuo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

YS, BK conceived the study. PW, JY, HH, TL performed the experiments. PW, JY drafted the manuscript. YS, YT, BK edited the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Bo-Jein Kuo.

Ethics declarations

Ethics approval and consent to participate

Not applicable, the study involves no human participants.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1. Table S1.

Flowering periods and overlapping days in all experiments.

Additional file 2: Table S2

. Wind direction, frequency, and relative frequency in 4-year study period.

Additional file 3: Table S3.

Results of z-test for wind direction and outcrossing events.

Additional file 4: Table S4

. ANOVA result of wind effect to the variation of CP rate

Additional file 5: Table S5.

Percentage of observed zero events and probability of zero events in all experiments.

Additional file 6: Table S6.

Variance and mean of variable counts in all experiments.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Su, YC., Wang, PS., Yang, JL. et al. Using a zero-inflated model to assess gene flow risk and coexistence of Brassica napus L. and Brassica rapa L. on a field scale in Taiwan. Bot Stud 61, 17 (2020). https://doi.org/10.1186/s40529-020-00294-2

Download citation

Received: 03 February 2020
Accepted: 13 May 2020
Published: 20 May 2020
DOI: https://doi.org/10.1186/s40529-020-00294-2

Using a zero-inflated model to assess gene flow risk and coexistence of Brassica napus L. and Brassica rapa L. on a field scale in Taiwan

Abstract

Background

Results

Conclusions

Background

Materials and methods

Plant materials

Experiment design

Hybrid progeny screening

Zero-inflated model

Statistical analysis

Results

Wind direction during the flowering period

Distance and wind effects on CP

Testing of excess zeros and overdispersion

Model fitting

Model validation and isolation distance recommendation

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary information

Additional file 1. Table S1.

Additional file 2: Table S2

Additional file 3: Table S3.

Additional file 4: Table S4

Additional file 5: Table S5.

Additional file 6: Table S6.

Rights and permissions

About this article

Cite this article

Share this article

Keywords