福建省表层土壤和水稻籽实硒含量特征及富硒产区预测

Selenium Content Characteristics of Surface Soil and Rice Seed in Fujian Province and Prediction of Selenium-Enriched Production Areas

  • 摘要: 传统富硒(Se)土地划分仅依据土壤Se含量,容易导致遗漏和误判,对水稻籽实Se含量进行准确预测,可以更精准地划分富硒产区。本文利用福建省大规模的土壤和水稻籽实样品数据,通过机器学习中的随机森林(RF)模型,建立了水稻籽实Se含量预测模型,并采用网格搜索方法提高模型预测的准确性,旨在为研究区富硒土地区划提供数据支撑。结果表明,福建省表层土壤Se含量平均值为0.34mg/kg,且具有很强的空间变异性,其含量处于足硒和富硒状态的比例分别是62.83%、27.05%,不存在硒中毒现象。土壤硒对成土母质具有很强的继承性,煤系地层、硅质岩等高硒岩石发育的土壤中Se含量较高。采样区水稻籽实富硒率为44.09%,对应的根系土富硒率仅为10.54%。土壤pH、铁氧化物含量、有机质含量、磷含量、降雨量和温度是影响水稻籽实Se生物富集系数的重要因素。将随机森林模型预测结果与传统的线性回归模型进行对比,随机森林的均方根误差(RMSE)为0.018,低于线性回归的RMSE值(0.096),相关系数(R)为0.957,高于多元线性回归的R值(0.653),随机森林模型的性能显著优于多元线性回归模型。基于优化的随机森林模型,将福建省富硒产区划分为优先发展区和潜在发展区,该省富硒水稻优先发展区主要位于宁德市西北部、三明市西部和漳州市中南部,潜在发展区主要位于南平市中南部和三明市中部。

     

    Abstract: The traditional classification of selenium-enriched land is based solely on soil selenium content, which may lead to omissions and misjudgments. Accurate prediction of selenium content in rice grains can provide a more precise basis for delineating selenium-enriched production areas. This study utilized large-scale soil and rice grain sample data from Fujian Province to establish a selenium content prediction model for rice grains through machine learning. The grid search method was employed to enhance the accuracy of the model's predictions, aiming to provide data support for the regionalization of selenium-enriched soil in the study area. The results indicated that the average selenium content in the surface soil of Fujian Province was 0.34mg/kg, showing strong spatial variability. The proportions of soil in the sufficient selenium and selenium-enriched states were 62.83% and 27.05%, respectively, with no selenium poisoning observed. Soil selenium content has a strong inheritance from parent material, with higher selenium content in soils developed from high-selenium rocks such as coal-bearing strata and siliceous rocks. The selenium-enriched rate of rice grains in the sampling area was 44.09%, while the selenium-enriched rate of the corresponding root zone soil was only 10.54%. Soil pH, iron oxide content, organic matter content, phosphorus content, rainfall, and temperature were important factors influencing the selenium bioaccumulation coefficient in rice grains. By comparing the prediction results of the random forest model with those of the traditional multiple linear regression model (MLR), the root mean square error (RMSE) of the random forest model was 0.018, lower than that of MLR model (0.096), and the correlation coefficient (R) was 0.957, higher than that of the MLR model (0.653). The performance of the random forest model was significantly superior to that of MLR model. Based on the optimized random forest model, the selenium-enriched production areas in Fujian Province were classified into priority development areas and potential development areas. The priority development areas for selenium-enriched rice in this province are mainly located in the northwest of Ningde City, the west of Sanming City, and the central and southern parts of Zhangzhou City, while the potential development areas are mainly in the central and southern parts of Nanping City and the central part of Sanming City.

     

/

返回文章
返回