DESIGN OF POTATO SEGMENTATION ALGORITHM FOR STICKY SOIL ENVIRONMENTS
面向粘性土壤下马铃薯分割算法设计
DOI : https://doi.org/10.35633/inmateh-78-48
Authors
Abstract
To address edge blurring, soil-clod interference, and limited feature representation in potato image segmentation under sticky-soil conditions, this study proposes an improved MTFormer segmentation model. The model combines the complementary strengths of convolutional neural networks and Transformers, and further enhances segmentation performance through a multi-stage optimization strategy. For more informative and robust feature learning, a residual CNN-based extractor is introduced to strengthen multi-level feature representations. In addition, an MS-CAM attention mechanism is used to reduce channel redundancy, which helps mitigate adhesion-related target confusion in challenging scenes. Building on these features, the TBFE module promotes cross-channel feature fusion, while a Fourier-based FFCM structure compresses and reconstructs deep features in the frequency domain to improve feature compactness. Experiments on our self-built dataset show that MTFormer achieves an F1 score of 85.19%, an mIoU of 84.62%, and a pixel accuracy of 95.67%. Compared with the baseline model, U-Net, and DeepLabV3+, pixel accuracy increases by 1.75, 0.35, and 1.67 percentage points, respectively. Overall, the proposed approach improves segmentation reliability by strengthening feature representation while limiting unnecessary computation, providing practical support for accurate potato segmentation in sticky-soil environments.
Abstract in Chinese



