| 428 | 1 | 233 |
| 下载次数 | 被引频次 | 阅读次数 |
目标检测指对图像内的物体类型进行识别并且定位。一阶段目标检测算法从深层网络输出的特征图中获得分类信息与目标位置信息,然而深层特征由于经过长距离卷积与下采样处理而缺乏空间信息。针对该问题,参考语义分割的思想,通过将骨干网络中的浅层特征与上采样得到的深层特征结合的方法,对一阶段目标检测YOLOv5算法进行改进,利用骨干网络ResNet-50可以对特征图信息进行有效提取。在残差块中引入注意机制,对浅层特征提取阶段进行有效选择对象信息,并且可以将更多的权重分配给小的和弱的对象,以改善特征表达准确探测小物体的能力。另外,根据行人检测的数据特点,对预选框的生成方式和损失函数进行改进。采用INRIA和Caltech这2个数据集进行实验,结果表明,提出的改进模型在检测效果与检索速度方面均有所提升。
Abstract:Target detection refers to the identification and positioning of object types in the image. The one-stage target detection algorithm obtains classification information and target location information from the feature map output by the deep network. However, the deep features lack spatial information due to long distance convolution and down-sampling processing. To solve this problem, refer to the idea of semantic segmentation. The YOLOv5 algorithm for target detection in the first stage is improved by combining the shallow features in the backbone network with the deep features obtained from up sampling. The backbone network is Resnet-50 to effectively extract the feature map information. Attention mechanism is introduced into the residual block to effectively select the object information in the shallow feature extraction stage, and more weights can be allocated to small and weak objects to improve the ability of feature representation to detect small objects accurately. In addition, according to the characteristics of pedestrian detection data, the generation method and loss function of the preselected box are improved. The experimental results on INRIA and Caltech data sets show that the proposed improved model improves the detection effect and retrieval speed.
[1] 顾德英,罗聿伦,李文超.基于改进YOLOv5算法的复杂场景交通目标检测[J].东北大学学报(自然科学版),2022,43(8):1073-1079.
[2] 何东,陈金令,王熙.基于改进YOLOv3的红外行人目标检测[J].中国科技论文,2021,16(7):762-769.
[3] 谢永明,王红蕾.复杂背景下远距离及小尺寸行人检测改进算法[J].计算机工程与设计,2021,42(5):1323-1330.
[4] 翟明浩,张威,黄子龙,等.结合语义分割和特征融合的行人检测方法[J].东华大学学报(自然科学版),2021,47(2):65-72.
[5] ZHANG W W,MA X,ZHANG Y Z,et al.SMYOLO:Lightweight Pedestrian Target Detection Algorithm in Low-altitude Scenarios[J].Future Internet,2022,14(1):21.
[6] YANG L,ZHONG J H,ZHANG Y,et al.An Improving Faster-RCNN with Multi-attention ResNet for Small Target Detection in Intelligent Autonomous Transport with 6G[J].IEEE Transactions on Intelligent Transportation Systems,2023,24(7):7717-7725.
[7] DENG R,GUAN B,LI Z,et al.Pedestrian detection based on multi-layer feature fusion[J].Academic Journal of Computing & Information Science,2021,4(5).
[8] 赵世阳,王晓峰.基于多尺度上下文信息的遮挡行人检测[J].计算机工程与应用,2022,58(11):141-149.
[9] 刘胜利,郭裕兰,王刚.基于注意力机制与动态激活的空间目标检测算法[J].激光与光电子学进展,2022,59(14):236-242.
[10] 曹杰程,陶重犇.基于Stereo RCNN的锚引导3D目标检测算法[J].仪器仪表学报,2021,42(12):191-201.
[11] LYU S L,LI R Y,ZHAO Y W,et al.Green Citrus Detection and Counting in Orchards Based on YOLOv5-CS and AI Edge System[J].Sensors,2022,22(2):576.
[12] YAO J,QI J M,ZHANG J,et al.A Real-time Detection Algorithm for Kiwifruit Defects Based on YOLOv5[J].Electronics,2021,10(14):1711.
[13] BENJUMEA A,TEETI I,CUZZOLIN F,et al.YOLO-Z:Improving Small Object Detection in YOLOv5 for Autonomous Vehicles[J].arXiv preprint arXiv:2112.11798,2021.
[14] 郝帅,高山,马旭,等.基于跨尺度特征聚合与分层注意力映射的红外行人检测[J].光子学报,2022,51(6):419-435.
[15] 宣勇,韩超,沙文瀚.改进的Tiny YOLOv4算法及其在行人检测中的应用[J].激光与光电子学进展,2022,59(12):1215002.
[16] 张本康,胡滨.基于情景记忆的运动小目标行人检测神经网络[J].计算机工程与应用,2022,58(15):169-183.
[17] 何飞麒.四旋翼无人机俯拍视角下的行人检测与轨迹追踪[J].电子测量技术,2022,45(10):50-56.
基本信息:
中图分类号:TP391.41;TP183
引用信息:
[1]兰娅勋.基于改进YOLOv5网络的行人目标检测方法[J].微型电脑应用,2024,40(10):217-222.
2024-10-20
2024-10-20