Author
Listed:
- Jingxi Luo
(Bangor College China, Central South University of Forestry and Technology, Changsha 410004, China
These authors contributed equally to this work and should be considered co-first authors.)
- Zhanwei Yang
(College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha 410004, China
Faculty of Modern Agricultural Engineering, Kunming University of Science and Technology, Kunming 650500, China
These authors contributed equally to this work and should be considered co-first authors.)
- Ying Cao
(College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha 410004, China
Faculty of Modern Agricultural Engineering, Kunming University of Science and Technology, Kunming 650500, China)
- Tao Wen
(College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha 410004, China)
- Dapeng Li
(College of Mechanical and Intelligent Manufacturing, Central South University of Forestry and Technology, Changsha 410004, China)
Abstract
The accurate detection of citrus surface defects is essential for automated citrus sorting to enhance the commercialization of the citrus industry. However, previous studies have only focused on single-modal defect detection using visible light images (RGB) or near-infrared light images (NIR), without considering the feature fusion between these two modalities. This study proposed an RGB-NIR multimodal fusion method to extract and integrate key features from both modalities to enhance defect detection performance. First, an RGB-NIR multimodal dataset containing four types of citrus surface defects (cankers, pests, melanoses, and cracks) was constructed. Second, a Multimodal Compound Domain Attention Fusion (MCDAF) module was developed for multimodal channel fusion. Finally, MCDAF was integrated into the feature extraction network of Real-Time DEtection TRansformer (RT-DETR). The experimental results demonstrated that RT-DETR-MCDAF achieved Precision, Recall, mAP@0.5, and mAP@0.5:0.95 values of 0.914, 0.919, 0.90, and 0.937, respectively, with an average detection performance of 0.598. Compared with the model RT-DETR-RGB&NIR, which used simple channel concatenation fusion, RT-DETR-MCDAF improved the performance by 1.3%, 1.7%, 1%, 1.5%, and 1.7%, respectively. Overall, the proposed model outperformed traditional channel fusion methods and state-of-the-art single-modal models, providing innovative insights for commercial citrus sorting.
Suggested Citation
Jingxi Luo & Zhanwei Yang & Ying Cao & Tao Wen & Dapeng Li, 2025.
"RT-DETR-MCDAF: Multimodal Fusion of Visible Light and Near-Infrared Images for Citrus Surface Defect Detection in the Compound Domain,"
Agriculture, MDPI, vol. 15(6), pages 1-21, March.
Handle:
RePEc:gam:jagris:v:15:y:2025:i:6:p:630-:d:1613713
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jagris:v:15:y:2025:i:6:p:630-:d:1613713. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.