Author
Listed:
- Zhiyong Zhang
(College of Agricultural Engineering, Shanxi Agricultural University, Jinzhong 030801, China
Dryland Farm Machinery Key Technology and Equipment Key Laboratory of Shanxi Province, Jinzhong 030801, China)
- Shuo Wang
(College of Agricultural Engineering, Shanxi Agricultural University, Jinzhong 030801, China)
- Chen Wang
(College of Agricultural Engineering, Shanxi Agricultural University, Jinzhong 030801, China)
- Li Wang
(College of Agricultural Engineering, Shanxi Agricultural University, Jinzhong 030801, China)
- Yanqing Zhang
(College of Agricultural Engineering, Shanxi Agricultural University, Jinzhong 030801, China
Dryland Farm Machinery Key Technology and Equipment Key Laboratory of Shanxi Province, Jinzhong 030801, China)
- Haiyan Song
(College of Agricultural Engineering, Shanxi Agricultural University, Jinzhong 030801, China
Dryland Farm Machinery Key Technology and Equipment Key Laboratory of Shanxi Province, Jinzhong 030801, China)
Abstract
The precise segmentation of Zanthoxylum bungeanum clusters is crucial for developing picking robots. An improved Mask R-CNN model was proposed in this study for the segmentation of Zanthoxylum bungeanum clusters in natural environments. Firstly, the Swin-Transformer network was introduced into the model’s backbone as the feature extraction network to enhance the model’s feature extraction capabilities. Then, the SK attention mechanism was utilized to fuse the detailed information into the mask branch from the low-level feature map of the feature pyramid network (FPN), aiming to supplement the image detail features. Finally, the distance intersection over union (DIOU) loss function was adopted to replace the original bounding box loss function of Mask R-CNN. The model was trained and tested based on a self-constructed Zanthoxylum bungeanum cluster dataset. Experiments showed that the improved Mask R-CNN model achieved 84.0% and 77.2% in detection mAP 50 box and segmentation mAP 50 mask , respectively, representing a 5.8% and 4.6% improvement over the baseline Mask R-CNN model. In comparison to conventional instance segmentation models, such as YOLACT, Mask Scoring R-CNN, and SOLOv2, the improved Mask R-CNN model also exhibited higher segmentation precision. This study can provide valuable technology support for the development of Zanthoxylum bungeanum picking robots.
Suggested Citation
Zhiyong Zhang & Shuo Wang & Chen Wang & Li Wang & Yanqing Zhang & Haiyan Song, 2024.
"Segmentation Method of Zanthoxylum bungeanum Cluster Based on Improved Mask R-CNN,"
Agriculture, MDPI, vol. 14(9), pages 1-15, September.
Handle:
RePEc:gam:jagris:v:14:y:2024:i:9:p:1585-:d:1476437
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jagris:v:14:y:2024:i:9:p:1585-:d:1476437. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.