高级检索

    基于ChatGPT的科尔沁沙地杨树人工林健康评价

    Health assessment of poplar plantations in the Horqin Sandy Land, Inner Mongolia of northern China based on ChatGPT

    • 摘要:
      目的 现有森林健康评价方法在处理复杂多维数据时存在局限性,难以准确反映森林健康的真实状况。针对这个问题,本研究提出一种结合ChatGPT大语言模型与机器学习算法的新型评价框架,以优化森林健康评价方法,并探索影响科尔沁沙地杨树(Populus spp.)人工林健康的关键指标。
      方法 基于地面调查数据与森林景观照片,构建了包含15个指标的综合健康评价体系,利用ChatGPT模型通过few-shot学习实现健康等级预测,并结合YOLOv5提取图像视觉特征进一步优化健康评估结果,同时采用K均值聚类自动划分森林健康等级。此外,通过DeepSeek大语言模型、模糊综合评价法以及8种机器学习模型对ChatGPT预测结果进行交叉验证和比较,并利用SHAP分析方法识别影响森林健康的主导因子。
      结果 科尔沁沙地杨树人工林健康等级所占比例依次为中健康(53.3%) > 不健康(21.7%) > 亚健康(20.7%) > 健康(4.3%),森林整体处于中健康状态。DeepSeek模型的预测准确率为76.1%;ChatGPT的预测结果与传统方法(模糊评价法)无显著差异(p = 0.29);8种机器学习模型中,随机森林分类模型的验证准确率最高,达到84.2%。研究识别出林分平均高和土壤有机碳密度为影响森林健康的主要指标。
      结论 本研究构建的基于ChatGPT与多源数据融合的森林健康评价方法科学可靠,能够有效提升森林健康等级划分的准确性和解释性,为干旱半干旱地区人工林健康监测与管理提供了新思路。

       

      Abstract:
      Objective Conventional forest health assessment methods face limitations when handling complex, high-dimensional data, making it difficult to accurately reflect the true state of forest health. To address the issue, this paper proposes a novel assessment framework integrating the ChatGPT large language model with machine learning algorithms to improve forest health assessment and identify key indicators affecting the health of poplar (Populus spp.) plantations in the Horqin Sandy Land, Inner Mongolia of northern China.
      Method A comprehensive health evaluation system comprising 15 indicators was developed using field survey data and forest landscape images. The ChatGPT model was applied for health level prediction via few-shot learning. YOLOv5 was used to extract visual features from images to enhance assessment accuracy. K-means clustering was employed to automatically classify forest health levels. Additionally, the predictions from ChatGPT were cross-validated and compared using the DeepSeek large language model, a fuzzy comprehensive evaluation method, and eight machine learning models, respectively. SHAP analysis was used to identify the main factors affecting forest health.
      Result The proportions of forest health levels in poplar plantations were as follows: moderate health (53.3%) > unhealthy (21.7%) > sub-healthy (20.7%) > healthy (4.3%), indicating an overall moderately healthy forest condition. The prediction accuracy of DeepSeek model was 76.1%. No significant difference was observed between the results of ChatGPT and the traditional fuzzy evaluation method (p = 0.29). Among the eight machine learning models, the random forest classifier achieved the highest validation accuracy of 84.2%. Stand mean height and soil organic carbon density were identified as the main factors influencing forest health.
      Conclusion The proposed ChatGPT-based evaluation approach, integrated with multi-source data, is scientifically reliable and effectively enhances both the accuracy and interpretability of forest health classification. This method provides a new pathway for monitoring and managing the health of plantations in arid and semi-arid regions.

       

    /

    返回文章
    返回