Author
Listed:
- Zhongli Ma
(College of Automation, Chengdu University of Information Technology, Chengdu 610103, China)
- Yi Wan
(College of Automation, Chengdu University of Information Technology, Chengdu 610103, China)
- Jiajia Liu
(College of Automation, Chengdu University of Information Technology, Chengdu 610103, China)
- Ruojin An
(College of Automation, Chengdu University of Information Technology, Chengdu 610103, China)
- Lili Wu
(College of Automation, Chengdu University of Information Technology, Chengdu 610103, China)
Abstract
Visual-based object detection systems are essential components of intelligent equipment for water surface environments. The diversity of water surface target types, uneven distribution of sizes, and difficulties in dataset construction pose significant challenges for water surface object detection. This article proposes an improved YOLOv5 target detection method to address the characteristics of diverse types, large quantities, and multiple scales of actual water surface targets. The improved YOLOv5 model optimizes the extraction of bounding boxes using K-means++ to obtain a broader distribution of predefined bounding boxes, thereby enhancing the detection accuracy for multi-scale targets. We introduce the GAMAttention mechanism into the backbone network of the model to alleviate the significant performance difference between large and small targets caused by their multi-scale nature. The spatial pyramid pooling module in the backbone network is replaced to enhance the perception ability of the model in segmenting targets of different scales. Finally, the Focal loss classification loss function is incorporated to address the issues of overfitting and poor accuracy caused by imbalanced class distribution in the training data. We conduct comparative tests on a self-constructed dataset comprising ten categories of water surface targets using four algorithms: Faster R-CNN, YOLOv4, YOLOv5, and the proposed improved YOLOv5. The experimental results demonstrate that the improved model achieves the best detection accuracy, with an 8% improvement in mAP@0.5 compared to the original YOLOv5 in multi-scale water surface object detection.
Suggested Citation
Zhongli Ma & Yi Wan & Jiajia Liu & Ruojin An & Lili Wu, 2023.
"A Kind of Water Surface Multi-Scale Object Detection Method Based on Improved YOLOv5 Network,"
Mathematics, MDPI, vol. 11(13), pages 1-18, June.
Handle:
RePEc:gam:jmathe:v:11:y:2023:i:13:p:2936-:d:1183715
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:13:p:2936-:d:1183715. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.