thumbnail

Topic

Technologies and technical equipment for agriculture and food industry

Volume

Volume 75 / No. 1 / 2025

Pages : 680-690

Metrics

Volume viewed 0 times

Volume downloaded 0 times

MILLET EAR DETECTION METHOD IN UAV IMAGES BASED ON IMPROVED YOLOX

基于改进YOLOX的无人机图像谷穗检测方法

DOI : https://doi.org/10.35633/inmateh-75-59

Authors

Fumin MA

College of Energy and Power Engineering, Lanzhou University of Technology, Lanzhou 730050, China

(*) Shaonian LI

College of Energy and Power Engineering, Lanzhou University of Technology, Lanzhou 730050, China

Juxia LI

College of Information Science and Engineering, Shanxi Agricultural University, Taigu 030801, China

Yanwen LI

College of Information Science and Engineering, Shanxi Agricultural University, Taigu 030801, China

Lei DUAN

College of Information Science and Engineering, Shanxi Agricultural University, Taigu 030801, China

Linwei LI

College of Information Science and Engineering, Shanxi Agricultural University, Taigu 030801, China

Jing TAN

College of Energy and Power Engineering, Lanzhou University of Technology, Lanzhou 730050, China

Yifan WANG

College of Energy and Power Engineering, Lanzhou University of Technology, Lanzhou 730050, China

(*) Corresponding authors:

Lsn19@163.com |

Shaonian LI

Abstract

Millet ears are critical agronomic indicators for assessing the yield and quality of millet. Rapid and accurate detection of millet ears can provide essential data for yield estimation and phenotypic studies. However, traditional detection methods primarily rely on manual observation, which are both subjective and labor-intensive. To address this, this study employs Unmanned Aerial Vehicle (UAV) for image data collection of millet ear and proposes the YOLOX-CBAM-EIoU model to facilitate real-time detection, focusing on challenges such as small millet ear size, dense distribution, and severe occlusion in the dataset. The model incorporates the CBAM attention module between the Neck and Prediction layers of YOLOX, enabling the reallocation of channel weights to enhance the extraction of fine-grained features and deeper semantic information. Additionally, EIoU Loss is utilized as the loss function for bounding box regression to mitigate missed detections in dense scenes. Results indicate that the improved model achieves an average precision (AP) of 90.30%, a 6.44 percentage point increase over the original YOLOX model, significantly enhancing detection performance for densely distributed millet ear. The modified model also demonstrates a Precision of 91.01%, Recall of 89.45%, and F1-score of 90.22, showcasing strong robustness and generalization capabilities. These findings substantiate the efficacy of the YOLOX-CBAM-EIoU model in improving detection performance under dense distribution and occlusion conditions, providing a valuable technical reference for further UAV-based analyses of millet ear phenotypes and yield predictions.

Abstract in Chinese

谷穗是评估谷子产量和质量的关键农学指标。快速、准确地检测谷穗能为产量预估及表型研究提供重要依据。然而,传统的谷穗检测主要依靠人工观察,不仅主观性强且耗时耗力。为此,本研究通过无人机进行谷穗图像数据采集,主要针对数据集中谷穗体积小、分布密集、遮挡严重等问题提出了YOLOX-CBAM-EIoU模型对谷穗进行实时检测。该模型首先在YOLOX的颈部层和预测层之间引入了CBAM注意力模块,通过重新分配不同通道的权重,获得了更浅层的细粒度特征和更深层的语义信息,以提高对谷穗表型的特征提取能力;其次,采用EIOU 函数作为边界框回归的损失函数,以改善密集场景下谷穗目标的漏检问题。结果表明,改进后的模型检测平均精度(AP)达到90.30%,与原YOLOX模型相比提高了6.44个百分点,显著提高了密集分布的谷穗目标检测性能。改进后模型的精确率达到91.01%,召回率达到89.45%,F1分数达到90.22,表现出较强的鲁棒性和泛化能力。这些结果充分证明了YOLOX-CBAM-EIoU模型能显著提高谷穗在密集分布及遮挡条件下的检测效果,为进一步使用无人机分析谷穗表型和产量预测提供了技术参考。

Indexed in

Clarivate Analytics.
 Emerging Sources Citation Index
Scopus/Elsevier
Google Scholar
Crossref
Road