UWO-YOLO:一种水下密集场景大菱鲆特征点检测算法
DOI:
CSTR:
作者:
作者单位:

1.青岛科技大学 信息科学技术学院;2.中国水产科学研究院黄海水产研究所

作者简介:

通讯作者:

中图分类号:

S917.4;TP391.41;TP183

基金项目:

国家重点研发计划(2023YFD2400404)


UWO-YOLO: An Algorithm for Feature Point Detection of Flounder and Halibut in Underwater Dense Scenes
Author:
Affiliation:

1.School of Information Science and Technology,Qingdao University of Science and Technology,Qingdao,Shandong;2.Yellow Sea Fisheries Research Institute

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为解决大菱鲆在密集场景下特征点检测不准确的问题,提出基于YOLOv11n的鱼体特征点检测算法UWO-YOLO。算法对基础架构进行了针对性优化,增强模型在大菱鲆水下密集场景的特征点感知能力,首先在检测头引入LQE(Localization Quality Estimation)模块,基于边界框定位质量评估的改进方法,融合分布统计信息与分类分数的优化思路,通过优化边界框质量评分提升定位精度;其次在主干网络集成MambaVision改进的C3K2模块,用“MambaBlock+AttentionBlock”串联结构替换原模块中的Bottleneck模块,实现局部-全局特征融合并兼顾计算效率;最后引入EDFFN(EVSSM模型中的高效频率前馈神经网络模块)改进的C2PSA模块,通过分块注意力与频域增强,协同提升检测精度与推理速度。结果显示,UWO-YOLO在密集场景下较原YOLOv11n检测精确率提升1.7%、mAP@50-95提升1.7%,mAP@50召回率提升1.5%;且资源消耗低参数量降低6.9%,计算量降低10.7%,为渔业养殖环境中水产目标密集情况下的鱼体长度无接触精准测量提供高效技术路径。

    Abstract:

    To address the issue of inaccurate keypoint detection for turbot in dense scenarios, this paper proposes a fish keypoint detection algorithm named UWO-YOLO based on YOLOv11n. The algorithm incorporates targeted optimizations to the basic architecture, enhancing the model's keypoint perception capability in underwater dense scenarios for turbot. Firstly, a Localization Quality Estimation (LQE) module is introduced into the detection head. This improved method, based on bounding box localization quality assessment, integrates the optimization idea of fusing distribution statistical information and classification scores to improve localization accuracy by optimizing bounding box quality scores. Secondly, a modified C3K2 module enhanced by MambaVision is integrated into the backbone network, where the Bottleneck module in the original structure is replaced with a cascaded "MambaBlock+AttentionBlock" configuration to achieve local-global feature fusion while balancing computational efficiency. Finally, a modified C2PSA module improved by the Efficient Frequency Feed-Forward Network (EDFFN) from the EVSSM model is introduced, which synergistically enhances detection accuracy and inference speed through block-wise attention and frequency-domain enhancement. Experimental results show that compared with the original YOLOv11n, UWO-YOLO achieves a 1.7% improvement in detection precision, a 1.7% increase in mAP@50-95, and a 1.5% rise in mAP@50 recall in dense scenarios. Additionally, it features low resource consumption with a 6.9% reduction in parameter count and a 10.7% decrease in computational complexity. This study provides an efficient technical approach for non-contact accurate measurement of fish length in dense aquatic target scenarios within aquaculture environments.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-11-20
  • 最后修改日期:2026-01-07
  • 录用日期:2026-01-07
  • 在线发布日期:
  • 出版日期:
文章二维码