YOLOV11-SPA: A REAL-TIME VISUAL MODEL FOR SORGHUM SEED DEFECT DETECTION
YOLOV11-SPA:一种用于高粱籽粒缺陷检测的实时视觉模型
DOI : https://doi.org/10.35633/inmateh-78-46
Authors
Abstract
To address the low inspection efficiency and limited recognition accuracy in sorghum grain quality assessment for brewing enterprises, this study proposes YOLOv11-SPA, an efficient and real-time detection model based on an improved YOLOv11n architecture. First, the space-to-depth convolution module (SPDConv) is introduced into the backbone network to replace conventional convolution blocks, effectively mitigating the loss of spatial information for small targets caused by downsampling operations. Second, the parallelized patch-aware attention (PPA) module is integrated into the neck network to enhance local feature representation and improve the detection of subtle defect features such as moldy and cracked grains. Third, an adaptive threshold focal loss (ATFL) is proposed to dynamically adjust sample weights, improving the model’s discrimination capability for visually similar categories (e.g., grains with husk residue and intact grains). Experimental results on a self-constructed sorghum seed dataset show that YOLOv11-SPA achieves 80.1% Precision, 79.7% Recall, and 85.9% mAP50, outperforming the baseline YOLOv11n by 5.6, 5.9, and 6.2 percentage points, respectively. With only 3.4 M parameters, the proposed model achieves an inference speed of 205 FPS, meeting real-time detection requirements while maintaining high accuracy. These results demonstrate that YOLOv11-SPA provides an effective solution for automated sorghum grain defect inspection and offers promising potential for intelligent quality control in the modern brewing industry.
Abstract in Chinese



