基于RepID-YOLO的智能驾驶车辆行人检测方法

RepID-YOLO: A novel vehicles and pedestrian detection method for intelligent driving

  • 摘要: 针对智能驾驶场景中目标检测面临小目标漏检率偏高、模型计算效率与检测精度难以协同平衡等问题,提出一种轻量化多尺度检测框架RepID-YOLO. 首先,利用大核深度可分离卷积,构建RepID特征提取单元,在此基础上引入卷积加法自注意力机制形成RepIDatt模块,在降低参数量同时强化多尺度特征表达能力;其次,引入SPD空间金字塔下采样策略,有效保留小目标特征细节,降低小目标漏检率;最后,设计加权特征融合与SimAM注意力机制协同策略,实现上下文特征动态筛选与增强,提升对复杂场景的适配能力. 实验结果表示,RepID-YOLO在KITTI数据集以91.6%的mAP@0.5检测精度优于YOLOv11等主流YOLO系列算法,并且提升了小目标检测能力,在参数量和计算量得到降低的同时保证了推理速度.

     

    Abstract: Aiming at the problems that object detection in intelligent driving scenarios is faced with, such as a high missing detection rate of small objects and the difficulty in achieving a collaborative balance between model computational efficiency and detection accuracy, a lightweight multi-scale detection framework RepID-YOLO is proposed. Firstly, large-kernel depthwise separable convolution is used to construct the RepID feature extraction unit. On this basis, a convolutional additive self-attention mechanism is introduced to form the RepIDatt module, which reduces the number of parameters while enhancing the multi-scale feature representation capability. Secondly, the SPD spatial pyramid downsampling strategy is introduced to effectively retain the feature details of small objects and reduce their missing detection rate. Finally, a collaborative strategy of weighted feature fusion and SimAM attention mechanism is designed to realize the dynamic screening and enhancement of contextual features, thus improving the adaptability to complex scenarios. Experimental results show that RepID-YOLO achieves a detection accuracy of 91.6% mAP@0.5 on the KITTI dataset, which is superior to mainstream YOLO series algorithms such as YOLOv11. Meanwhile, it improves the small object detection capability, and ensures the inference speed while reducing the number of parameters and computational complexity.

     

/

返回文章
返回