VISION-BASED NON-CONTACT DONKEY FACE LOCALIZATION AND INDIVIDUAL IDENTIFICATION USING IMPROVED YOLO11
基于改进YOLO11的非接触式驴脸定位与个体识别
DOI : https://doi.org/10.35633/inmateh-78-119
Authors
Abstract
Accurate individual identification is essential for precision donkey farming, but conventional methods based on manual records or physical tags are labor-intensive, inefficient, and vulnerable to tag loss or damage. To address these limitations, this study proposes MFW-YOLO11, a non-contact donkey face localization and individual identification model based on an improved YOLO11 framework. A total of 6,531 valid donkey face images were collected from a real farm environment and used to construct an individual identity dataset under diverse conditions, including different illumination, poses, occlusions, and backgrounds. In the proposed network, MANet is introduced into the backbone and head to strengthen fine-grained identity-related features, such as the eyes, nasal bridge, muzzle region, facial contour, and coat texture. A MANet-FasterCGLU composite module is further designed to adaptively filter effective facial responses and suppress interference from railings, donkey bodies, troughs, and complex backgrounds. In addition, a weighted feature union module is embedded in the neck to enhance the adaptive fusion of shallow texture details and deep semantic information. Experimental results show that MFW-YOLO11 achieves a precision of 90.7%, recall of 79.8%, mAP50 of 88.0%, mAP50–95 of 74.4%, FPS of 68.93, and GFLOPs of 6.3. Compared with the original YOLO11, the proposed model improves precision, recall, mAP50, and mAP50–95 by 6.5, 8.8, 6.2, and 6.4 percentage points, respectively, while maintaining real-time inference performance. These results indicate that MFW-YOLO11 provides an effective and practical solution for non-contact donkey individual identification in precision livestock management.
Abstract in Chinese



