Article(id=1200070661998736332, tenantId=1146029695717560320, journalId=1189918454225211397, issueId=1200070646895051378, articleNumber=null, orderNo=null, doi=10.20104/j.cnki.1674-6546.20240224, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=null, receivedDateStr=null, revisedDate=1720800000000, revisedDateStr=2024-07-13, acceptedDate=null, acceptedDateStr=null, onlineDate=1764048742055, onlineDateStr=2025-11-25, pubDate=1723651200000, pubDateStr=2024-08-15, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1764048742055, onlineIssueDateStr=2025-11-25, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1764048742055, creator=13701087609, updateTime=1764048742055, updator=13701087609, issue=Issue{id=1200070646895051378, tenantId=1146029695717560320, journalId=1189918454225211397, year='2024', volume='', issue='8', pageStart='1', pageEnd='48', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=null, createTime=1764048738454, creator=13701087609, updateTime=1764049350066, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1200073212257203051, tenantId=1146029695717560320, journalId=1189918454225211397, issueId=1200070646895051378, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1200073212257203052, tenantId=1146029695717560320, journalId=1189918454225211397, issueId=1200070646895051378, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=1, endPage=7, ext={EN=ArticleExt(id=1200070662413972476, articleId=1200070661998736332, tenantId=1146029695717560320, journalId=1189918454225211397, language=EN, title=Vehicle and Pedestrian Target Detection Algorithm Based on Multi-Scale Feature Fusion, columnId=1200070647679386243, journalTitle=Automotive Engineer, columnName=Special Issue on Intelligent Vehicle Environmental Perception and Target Detection Technology, runingTitle=null, highlight=null, articleAbstract=

In response to the complex and diverse nature of the road traffic environment, where vehicle and pedestrian detection is prone to false and missed detections, this paper proposes a vehicle and pedestrian target detection algorithm YOLOv8-RC based on multi-scale feature fusion. Initially, the RCS-OSA module is introduced within the structure of the base network YOLOv8 to replace the original module, thereby enhancing and integrating the extracted feature information. Additionally, a lightweight Context-Aware Adaptive Feature Reorganization (CARAFE) is employed to replace the original upsampling operator, enhancing the network’s capability for global multi-scale information fusion. Subsequently, a detection dataset consisting of 6 000 images of vehicle and pedestrian targets is constructed through public datasets and network collection. The algorithm’s detection performance is quantitatively evaluated using accuracy, recall rate, mean Average Precision at a 50% intersection over union threshold (mAP50), and mAP50-95. Compared to YOLOv8-N, YOLOv8-RC demonstrates an improvement of 1.7 percentage in accuracy, 1.2 percentage in recall rate, 0.9 percentage in mAP50, and 0.5 percentage in mAP50-95, thus validating the algorithm’s effectiveness.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Xiangheng Li, Hongsu Fang, Yalin Yang, Wei Yang), CN=ArticleExt(id=1200070665060577453, articleId=1200070661998736332, tenantId=1146029695717560320, journalId=1189918454225211397, language=CN, title=基于多尺度特征融合的车辆及行人目标检测算法*, columnId=1200070647847158411, journalTitle=汽车工程师, columnName=智能车辆环境感知与目标检测技术专刊, runingTitle=null, highlight=null, articleAbstract=

针对道路交通环境复杂多样,车辆及行人检测易出现错检及漏检的问题,提出一种基于多尺度特征融合的车辆及行人目标检测算法YOLOv8-RC。首先,在基础网络YOLOv8的结构中引入RCS-OSA模块代替原有模块,对所提取的特征信息进行增强及融合,并引入轻量级上采样算子内容感知特征重组(CARAFE)代替原上采样算子,提高网络对全局多尺度信息的融合能力。其次,通过公开数据集及网络收集的方式构建了由6 000张车辆及行人目标图片构成的检测数据集,并采用准确率、召回率、平均精度均值mAP50及mAP50-95对算法检测效果进行定量评价,相比于YOLOv8-N,YOLOv8-RC的精确率提升1.7百分点,召回率提升1.2百分点,mAP50提升0.9百分点,mAP50-95提升0.5百分点,证明了算法的有效性。

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=SM9UsYkGPFPhlO6XLVkcbg==, magXml=8QTL2cXXGRfJ5NNa7/8jcg==, pdfUrl=null, pdf=HuSMjPGEmjLqc/1O/awT1g==, pdfFileSize=8163198, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=nBiZLwIvPZ6aUlpZTEbE6A==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=OVuxnT3tiq1vij1byX+dpA==, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=李相衡, 方虹苏, 杨娅琳, 杨炜)}, authors=[Author(id=1200070665798775012, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1200070665945575664, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070665798775012, language=EN, stringName=Xiangheng Li, firstName=Xiangheng, middleName=null, lastName=Li, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 Henan Polytechnic University, Jiaozuo 454000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1200070666071404795, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070665798775012, language=CN, stringName=李相衡, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 河南理工大学, 焦作 454000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1200070665362567364, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, xref=1, ext=[AuthorCompanyExt(id=1200070665366761670, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665362567364, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 Henan Polytechnic University, Jiaozuo 454000), AuthorCompanyExt(id=1200070665375150279, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665362567364, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 河南理工大学, 焦作 454000)])]), Author(id=1200070666205622534, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1200070666306285841, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070666205622534, language=EN, stringName=Hongsu Fang, firstName=Hongsu, middleName=null, lastName=Fang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 Chang’an University, Xi’an 710064, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1200070666423726364, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070666205622534, language=CN, stringName=方虹苏, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 长安大学, 西安 710061, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1200070665647780049, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, xref=2, ext=[AuthorCompanyExt(id=1200070665660362964, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 Chang’an University, Xi’an 710064), AuthorCompanyExt(id=1200070665681334488, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 长安大学, 西安 710061)])]), Author(id=1200070666541166886, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1200070666687967536, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070666541166886, language=EN, stringName=Yalin Yang, firstName=Yalin, middleName=null, lastName=Yang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 Chang’an University, Xi’an 710064, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1200070666817990967, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070666541166886, language=CN, stringName=杨娅琳, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 长安大学, 西安 710061, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1200070665647780049, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, xref=2, ext=[AuthorCompanyExt(id=1200070665660362964, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 Chang’an University, Xi’an 710064), AuthorCompanyExt(id=1200070665681334488, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 长安大学, 西安 710061)])]), Author(id=1200070666964791615, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1200070667099009354, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070666964791615, language=EN, stringName=Wei Yang, firstName=Wei, middleName=null, lastName=Yang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 Chang’an University, Xi’an 710064, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1200070667262587220, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, authorId=1200070666964791615, language=CN, stringName=杨炜, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 长安大学, 西安 710061, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1200070665647780049, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, xref=2, ext=[AuthorCompanyExt(id=1200070665660362964, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 Chang’an University, Xi’an 710064), AuthorCompanyExt(id=1200070665681334488, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 长安大学, 西安 710061)])])], keywords=[Keyword(id=1200070668487324000, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, orderNo=1, keyword=Deep learning), Keyword(id=1200070668604764522, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, orderNo=2, keyword=Target detection), Keyword(id=1200070668764148087, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, orderNo=3, keyword=YOLOv8), Keyword(id=1200070668902560131, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, orderNo=4, keyword=Pedestrian detection), Keyword(id=1200070669049360781, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, orderNo=1, keyword=深度学习), Keyword(id=1200070669150024084, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, orderNo=2, keyword=目标检测), Keyword(id=1200070669280047523, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, orderNo=3, keyword=YOLOv8), Keyword(id=1200070669422653868, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, orderNo=4, keyword=行人检测)], refs=[Reference(id=1200070671779852880, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=20, issue=7, pageStart=430, pageEnd=436, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=LIU P Y, MA Y X, journalName=Optoelectronics Letters, refType=null, unstructuredReference=LIU P Y, MA Y X. Key Points and Visible Part Fusion Attention Network for Occluded Pedestrian Detection in Traffic Environments[J]. Optoelectronics Letters, 2024, 20(7): 430-436., articleTitle=Key Points and Visible Part Fusion Attention Network for Occluded Pedestrian Detection in Traffic Environments, refAbstract=null), Reference(id=1200070672966840916, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=249, pageEnd=null, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=JAIN D K, ZHAO X D, GARCIA S, journalName=Expert Systems with Applications, refType=null, unstructuredReference=JAIN D K, ZHAO X D, GARCIA S, et al. Robust Multi-Modal Pedestrian Detection Using Deep Convolutional Neural Network with Ensemble Learning Model[J]. Expert Systems with Applications, 2024, 249., articleTitle=Robust Multi-Modal Pedestrian Detection Using Deep Convolutional Neural Network with Ensemble Learning Model, refAbstract=null), Reference(id=1200070673218499166, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=21, issue=4, pageStart=5782, pageEnd=5802, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=LIANG W J, journalName=Mathematical Biosciences and Engineering, refType=null, unstructuredReference=LIANG W J. Research on a Vehicle and Pedestrian Detection Algorithm Based on Improved Attention and Feature Fusion[J]. Mathematical Biosciences and Engineering, 2024, 21(4): 5782-5802., articleTitle=Research on a Vehicle and Pedestrian Detection Algorithm Based on Improved Attention and Feature Fusion, refAbstract=null), Reference(id=1200070673365299815, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=6, issue=2, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[4], rfOrder=3, authorNames=KIRAN K V, DASH S, PARIDA P, journalName=Engineering Research Express, refType=null, unstructuredReference=KIRAN K V, DASH S, PARIDA P. Vehicle Detection in Varied Weather Conditions using Enhanced Deep YOLO with Complex Wavelet[J]. Engineering Research Express, 2024, 6(2)., articleTitle=Vehicle Detection in Varied Weather Conditions using Enhanced Deep YOLO with Complex Wavelet, refAbstract=null), Reference(id=1200070673503711853, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=21, issue=3, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[5], rfOrder=4, authorNames=ANDIKA J L, KHAIRUDDIN A S M, RAMIAH H, journalName=Journal of Real-Time Image Processing, refType=null, unstructuredReference=ANDIKA J L, KHAIRUDDIN A S M, RAMIAH H, et al. Improved Feature Extraction Network in Lightweight YOLOv7 Model for Real-Time Vehicle Detection on Low-Cost Hardware[J]. Journal of Real-Time Image Processing, 2024, 21(3)., articleTitle=Improved Feature Extraction Network in Lightweight YOLOv7 Model for Real-Time Vehicle Detection on Low-Cost Hardware, refAbstract=null), Reference(id=1200070673637929587, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[6], rfOrder=5, authorNames=胡倩, 皮建勇, 胡伟超, journalName=计算机工程, refType=null, unstructuredReference=胡倩, 皮建勇, 胡伟超, 等. 基于改进YOLOv5的密集行人检测算法研究[J/OL]. 计算机工程(2024-04-30)[2024-06-25]. https://doi.org/10.19678/j.issn.1000-3428.0068753., articleTitle=基于改进YOLOv5的密集行人检测算法研究, refAbstract=null), Reference(id=1200070673738592891, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[6], rfOrder=6, authorNames=HU Q, PI J Y, HU W C, journalName=Computer Engineering, refType=null, unstructuredReference=HU Q, PI J Y, HU W C, et al. Research on Dense Pedestrian Detection Algorithm Based on Improved YOLOv5[J/OL]. Computer Engineering (2024-04-30)[2024-06-25]. https://doi.org/10.19678/j.issn.1000-3428.0068753., articleTitle=Research on Dense Pedestrian Detection Algorithm Based on Improved YOLOv5, refAbstract=null), Reference(id=1200070673864422015, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[7], rfOrder=7, authorNames=黄丹丹, 张新茹, 刘智, journalName=吉林大学学报(工学版), refType=null, unstructuredReference=黄丹丹, 张新茹, 刘智, 等. 基于Transformer的双分支检测和重识别的多行人追踪[J/OL]. 吉林大学学报(工学版)(2024-03-28)[2024-06-25]. https://doi.org/10.13229/j.cnki.jdxbgxb.20231415., articleTitle=基于Transformer的双分支检测和重识别的多行人追踪, refAbstract=null), Reference(id=1200070674011222663, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[7], rfOrder=8, authorNames=HUANG D D, ZHANG X R, LIU Z, journalName=Journal of Jilin University (Engineering Edition), refType=null, unstructuredReference=HUANG D D, ZHANG X R, LIU Z, et al. Multi Pedestrian Tracking Based on Transformer’s Dual Branch Detection and Re-Identification[J/OL]. Journal of Jilin University (Engineering Edition) (2024-03-28) [2024-06-25]. https://doi.org/10.13229/j.cnki.jdxbgxb.20231415., articleTitle=Multi Pedestrian Tracking Based on Transformer’s Dual Branch Detection and Re-Identification, refAbstract=null), Reference(id=1200070674107691662, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=60, issue=1, pageStart=87, pageEnd=96, url=null, language=null, rfNumber=[8], rfOrder=9, authorNames=李林红, 杨杰, 冯志成, journalName=南京大学学报(自然科学), refType=null, unstructuredReference=李林红, 杨杰, 冯志成, 等. 面向站口行人检测的改进型Yolov5s算法[J]. 南京大学学报(自然科学), 2024, 60(1): 87-96., articleTitle=面向站口行人检测的改进型Yolov5s算法, refAbstract=null), Reference(id=1200070674241909395, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=60, issue=1, pageStart=87, pageEnd=96, url=null, language=null, rfNumber=[8], rfOrder=10, authorNames=LI L H, YANG J, FENG Z C, journalName=Journal of Nanjing University (Natural Science), refType=null, unstructuredReference=LI L H, YANG J, FENG Z C, et al. Improved Yolov5s Algorithm for Pedestrian Detection at Station Entrances[J]. Journal of Nanjing University (Natural Science), 2024, 60(1): 87-96., articleTitle=Improved Yolov5s Algorithm for Pedestrian Detection at Station Entrances, refAbstract=null), Reference(id=1200070674350961303, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=44, issue=3, pageStart=149, pageEnd=160, url=null, language=null, rfNumber=[9], rfOrder=11, authorNames=周力, 惠飞, 张嘉洋, journalName=长安大学学报(自然科学版), refType=null, unstructuredReference=周力, 惠飞, 张嘉洋, 等. 基于RDB-YOLOv5的遥感图像车辆检测[J]. 长安大学学报(自然科学版), 2024, 44(3): 149-160., articleTitle=基于RDB-YOLOv5的遥感图像车辆检测, refAbstract=null), Reference(id=1200070674451624605, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=44, issue=3, pageStart=149, pageEnd=160, url=null, language=null, rfNumber=[9], rfOrder=12, authorNames=ZHOU L, HUI F, ZHANG J Y, journalName=Journal of Chang’an University (Natural Science Edition), refType=null, unstructuredReference=ZHOU L, HUI F, ZHANG J Y, et al. Remote Sensing Image Vehicle Detection Based on RDB-YOLOv5[J]. Journal of Chang’an University (Natural Science Edition), 2024, 44(3): 149-160., articleTitle=Remote Sensing Image Vehicle Detection Based on RDB-YOLOv5, refAbstract=null), Reference(id=1200070674539704990, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=39, issue=1, pageStart=87, pageEnd=94, url=null, language=null, rfNumber=[10], rfOrder=13, authorNames=沈正坤, 王正超, 尹怀仙, journalName=青岛大学学报(工程技术版), refType=null, unstructuredReference=沈正坤, 王正超, 尹怀仙, 等. 基于VOD-YOLOv5的车辆检测算法研究[J]. 青岛大学学报(工程技术版), 2024, 39(1): 87-94., articleTitle=基于VOD-YOLOv5的车辆检测算法研究, refAbstract=null), Reference(id=1200070674661339812, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=39, issue=1, pageStart=87, pageEnd=94, url=null, language=null, rfNumber=[10], rfOrder=14, authorNames=SHEN Z K, WANG Z C, YIN H X, journalName=Journal of Qingdao University (Engineering Technology Edition), refType=null, unstructuredReference=SHEN Z K, WANG Z C, YIN H X, et al. Research on Vehicle Detection Algorithm Based on VOD-YOLOv5[J]. Journal of Qingdao University (Engineering Technology Edition), 2024, 39(1): 87-94., articleTitle=Research on Vehicle Detection Algorithm Based on VOD-YOLOv5, refAbstract=null), Reference(id=1200070674787168937, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=15, authorNames=DILLON R, JORDAN K, JACQUELINE H, journalName=null, refType=null, unstructuredReference=DILLON R, JORDAN K, JACQUELINE H, et al. Real-Time Flying Object Detection with YOLOv8[EB/OL]. (2024-05-22) [2024-06-25]. https://doi.org/10.48550/arXiv.2305.09972., articleTitle=Real-Time Flying Object Detection with YOLOv8, refAbstract=null), Reference(id=1200070674912998061, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[12], rfOrder=16, authorNames=KANG M, TING C M, TING F F, journalName=null, refType=null, unstructuredReference=KANG M, TING C M, TING F F, et al. RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor Detection[EB/OL]. (2023-10-03) [2024-06-25]. https://doi.org/10.48550/arXiv.2307.16412., articleTitle=RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor Detection, refAbstract=null), Reference(id=1200070675030438578, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[13], rfOrder=17, authorNames=WANG J Q, CHEN K, XU R, journalName=null, refType=null, unstructuredReference=WANG J Q, CHEN K, XU R, et al. CARAFE: Content-Aware ReAssembly of FEatures[EB/OL]. (2019-10-29) [2024-06-25]. https://doi.org/10.48550/arXiv.1905.02188., articleTitle=CARAFE: Content-Aware ReAssembly of FEatures, refAbstract=null)], funds=[Fund(id=1200070671654023748, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, awardId=300102229112, language=CN, fundingSource=*中央高校基金项目(300102229112), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1200070665362567364, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, xref=1, ext=[AuthorCompanyExt(id=1200070665366761670, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665362567364, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 Henan Polytechnic University, Jiaozuo 454000), AuthorCompanyExt(id=1200070665375150279, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665362567364, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 河南理工大学, 焦作 454000)]), AuthorCompany(id=1200070665647780049, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, xref=2, ext=[AuthorCompanyExt(id=1200070665660362964, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 Chang’an University, Xi’an 710064), AuthorCompanyExt(id=1200070665681334488, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, companyId=1200070665647780049, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 长安大学, 西安 710061)])], figs=[ArticleFig(id=1200070669586231739, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=ieqBJn+RnpqTr7MvcutRgw==, figureFileBig=nBiZLwIvPZ6aUlpZTEbE6A==, tableContent=null), ArticleFig(id=1200070669703672261, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=图1, caption=YOLOv8-RC网络架构, figureFileSmall=ieqBJn+RnpqTr7MvcutRgw==, figureFileBig=nBiZLwIvPZ6aUlpZTEbE6A==, tableContent=null), ArticleFig(id=1200070669942747607, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=kw5qX3AVM74n165FxQiLww==, figureFileBig=8Lbtf7e3erzgRUEk5e0enQ==, tableContent=null), ArticleFig(id=1200070670039216608, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=图2, caption=RCS-OSA原理, figureFileSmall=kw5qX3AVM74n165FxQiLww==, figureFileBig=8Lbtf7e3erzgRUEk5e0enQ==, tableContent=null), ArticleFig(id=1200070670160851431, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=JefkdKGI0q6yMx9MPqbfMw==, figureFileBig=3e86XYU8uV+7m8RpnQKemw==, tableContent=null), ArticleFig(id=1200070670395732460, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=图3, caption=轻量级上采样算子CARAFE原理, figureFileSmall=JefkdKGI0q6yMx9MPqbfMw==, figureFileBig=3e86XYU8uV+7m8RpnQKemw==, tableContent=null), ArticleFig(id=1200070670525755893, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=RjTmW6JbXczrCfm2hDq5Qg==, figureFileBig=lBwg8PMQgGETPk6P9FwFgg==, tableContent=null), ArticleFig(id=1200070670622224894, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=图4, caption=YOLOv8-RC及对比模型检测结果可视化, figureFileSmall=RjTmW6JbXczrCfm2hDq5Qg==, figureFileBig=lBwg8PMQgGETPk6P9FwFgg==, tableContent=null), ArticleFig(id=1200070670714499590, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
配置名称 版本参数
操作系统 Windows 11
图形处理器 NVIDA GeForce RTX 4080
处理器 13th Gen Intel Core i9-13900H
深度学习框架 PyTorch 2.2.0
Python版本 Python 3.9.18
CUDA版本 CUDA 11.8
), ArticleFig(id=1200070670827745802, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=表1, caption=

试验软硬件环境

, figureFileSmall=null, figureFileBig=null, tableContent=
配置名称 版本参数
操作系统 Windows 11
图形处理器 NVIDA GeForce RTX 4080
处理器 13th Gen Intel Core i9-13900H
深度学习框架 PyTorch 2.2.0
Python版本 Python 3.9.18
CUDA版本 CUDA 11.8
), ArticleFig(id=1200070670940992019, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
名称 数值
图片大小 480×480
初始学习率 0.01
批量大小(Batch Size) 16
优化器 随机梯度下降(SGD)
权重衰减系数 0.000 5
轮次/轮 200
), ArticleFig(id=1200070671045849626, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=表2, caption=

各模型超参数

, figureFileSmall=null, figureFileBig=null, tableContent=
名称 数值
图片大小 480×480
初始学习率 0.01
批量大小(Batch Size) 16
优化器 随机梯度下降(SGD)
权重衰减系数 0.000 5
轮次/轮 200
), ArticleFig(id=1200070671163290148, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
模型 P R mAP50 mAP50-95
YOLOv3-tiny 82.7 68.5 78.4 52.0
YOLOv5-N 87.0 81.4 90.2 66.2
YOLOv6-N 88.6 81.1 90.4 66.5
YOLOv8-N 86.2 82.2 90.3 67.7
YOLOv8-RC 87.9 83.4 91.2 68.2
), ArticleFig(id=1200070671268147754, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=表3, caption=

对比模型试验结果 %

, figureFileSmall=null, figureFileBig=null, tableContent=
模型 P R mAP50 mAP50-95
YOLOv3-tiny 82.7 68.5 78.4 52.0
YOLOv5-N 87.0 81.4 90.2 66.2
YOLOv6-N 88.6 81.1 90.4 66.5
YOLOv8-N 86.2 82.2 90.3 67.7
YOLOv8-RC 87.9 83.4 91.2 68.2
), ArticleFig(id=1200070671381393974, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
模型 YOLOv8-N Model 1 YOLOv8-RC
RCS-OSA ×
CARAFE × ×
P/% 86.2 88.0 87.9
R/% 82.2 82.0 83.4
mAP50/% 90.3 90.8 91.2
mAP50-95/% 67.7 68.1 68.2
), ArticleFig(id=1200070671486251575, tenantId=1146029695717560320, journalId=1189918454225211397, articleId=1200070661998736332, language=CN, label=表4, caption=

消融实验结果

, figureFileSmall=null, figureFileBig=null, tableContent=
模型 YOLOv8-N Model 1 YOLOv8-RC
RCS-OSA ×
CARAFE × ×
P/% 86.2 88.0 87.9
R/% 82.2 82.0 83.4
mAP50/% 90.3 90.8 91.2
mAP50-95/% 67.7 68.1 68.2
)], attaches=null, journal=Journal(id=1189918244568731652, delFlag=0, nameCn=汽车工程师, nameEn=Automotive Engineer, nameHistory1=null, nameHistory2=null, issn=1674-6546, eissn=null, cn=22-1432/U, coden=null, periodic=0, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=+bJsKkKt/pjz9u6EwhnksQ==, journalPrice=null, startedYear=null, abbrevIsoEn=null, journalRemark=null, publicationField=null, createdTime=1761628217121, updatedTime=1761735708780, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=A, firstLetterEn=A, subjectCode=Engineering, subjectName=Engineering, subjectCodeEn=Engineering, subjectNameEn=null, picCn=+bJsKkKt/pjz9u6EwhnksQ==, picEn=O3Sn3tnYYrh/jm6emnnMWA==, jcr=null, cjcr=null, exts=[JournalExt(id=1190369097415233706, language=CN, name=汽车工程师, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1761735708812, updatedTime=1761735708812, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1190369097553645739, language=EN, name=Automotive Engineer, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1761735708845, updatedTime=1761735708845, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://tjqc.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1189918454225211397, websiteList=[Website(id=1189918982430847716, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1189918454225211397, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcgcs/CN, language=CN, createTime=1761628393037, createBy=18614031015, updateTime=1761628422913, updateBy=18614031015, name=汽车工程师-中文, tplId=1146099689490845704, title=汽车工程师, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1189919800185917791, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=articleTextType, value=kx, createTime=1761628588005, updateTime=1761628588005, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800164946268, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=banner, value=null, createTime=1761628588000, updateTime=1761628588000, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800211083618, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=grayFlag, value=0, createTime=1761628588011, updateTime=1761628588011, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800156557659, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=logo, value=https://castjournals.cast.org.cn/joweb/qcgcs/CN/file/pic?fileId=yiZ96RYoYcnGnRMuWdmkWA==, createTime=1761628587998, updateTime=1761628587998, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800223666532, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=minRunFlag, value=0, createTime=1761628588014, updateTime=1761628588014, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800181723486, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/qcgcs/CN/file/pic, createTime=1761628588004, updateTime=1761628588004, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800215277923, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=silenceFlag, value=0, createTime=1761628588012, updateTime=1761628588012, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800173334877, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1761628588002, updateTime=1761628588002, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800194306400, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=themeColor, value=null, createTime=1761628588007, updateTime=1761628588007, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919800202695009, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982430847716, code=themeStyle, value=null, createTime=1761628588009, updateTime=1761628588009, creator=18614031015, updator=18614031015)]), Website(id=1189918982527316711, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1189918454225211397, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcgcs/EN, language=EN, createTime=1761628393061, createBy=18614031015, updateTime=1761628543075, updateBy=18614031015, name=汽车工程师-英文, tplId=1146101810881728533, title=Automotive Engineer, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1189919837561352952, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=articleTextType, value=kx, createTime=1761628596916, updateTime=1761628596916, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837540381429, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=banner, value=null, createTime=1761628596911, updateTime=1761628596911, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837582324475, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=grayFlag, value=0, createTime=1761628596921, updateTime=1761628596921, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837527798516, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=logo, value=https://castjournals.cast.org.cn/joweb/qcgcs/EN/file/pic?fileId=yiZ96RYoYcnGnRMuWdmkWA==, createTime=1761628596908, updateTime=1761628596908, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837594907389, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=minRunFlag, value=0, createTime=1761628596924, updateTime=1761628596924, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837557158647, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/qcgcs/EN/file/pic, createTime=1761628596915, updateTime=1761628596915, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837586518780, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=silenceFlag, value=0, createTime=1761628596922, updateTime=1761628596922, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837548770038, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1761628596913, updateTime=1761628596913, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837569741561, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=themeColor, value=null, createTime=1761628596918, updateTime=1761628596918, creator=18614031015, updator=18614031015), WebsiteProps(id=1189919837573935866, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189918982527316711, code=themeStyle, value=null, createTime=1761628596919, updateTime=1761628596919, creator=18614031015, updator=18614031015)])], journalTitle=汽车工程师, weixinUrl=null, journalUrl=https://tjqc.cbpt.cnki.net/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Automotive Engineer, journalPhotoCn=+bJsKkKt/pjz9u6EwhnksQ==, journalPhotoEn=O3Sn3tnYYrh/jm6emnnMWA==, journalFirstLetter=A, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/qcgcs/CN/10.20104/j.cnki.1674-6546.20240224, detailUrlEn=https://castjournals.cast.org.cn/joweb/qcgcs/EN/10.20104/j.cnki.1674-6546.20240224, pdfUrlCn=https://castjournals.cast.org.cn/joweb/qcgcs/CN/PDF/10.20104/j.cnki.1674-6546.20240224, pdfUrlEn=https://castjournals.cast.org.cn/joweb/qcgcs/EN/PDF/10.20104/j.cnki.1674-6546.20240224, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
基于多尺度特征融合的车辆及行人目标检测算法*
收藏切换
PDF下载
李相衡 1 , 方虹苏 2 , 杨娅琳 2 , 杨炜 2
汽车工程师 | 智能车辆环境感知与目标检测技术专刊 2024,(8): 1-7
收起
收藏切换
汽车工程师 | 智能车辆环境感知与目标检测技术专刊 2024, (8): 1-7
基于多尺度特征融合的车辆及行人目标检测算法*
全屏
李相衡1, 方虹苏2, 杨娅琳2, 杨炜2
作者信息
  • 1 河南理工大学, 焦作 454000
  • 2 长安大学, 西安 710061
Vehicle and Pedestrian Target Detection Algorithm Based on Multi-Scale Feature Fusion
Xiangheng Li1, Hongsu Fang2, Yalin Yang2, Wei Yang2
Affiliations
  • 1 Henan Polytechnic University, Jiaozuo 454000
  • 2 Chang’an University, Xi’an 710064
出版时间: 2024-08-15 doi: 10.20104/j.cnki.1674-6546.20240224
文章导航
收藏切换

针对道路交通环境复杂多样,车辆及行人检测易出现错检及漏检的问题,提出一种基于多尺度特征融合的车辆及行人目标检测算法YOLOv8-RC。首先,在基础网络YOLOv8的结构中引入RCS-OSA模块代替原有模块,对所提取的特征信息进行增强及融合,并引入轻量级上采样算子内容感知特征重组(CARAFE)代替原上采样算子,提高网络对全局多尺度信息的融合能力。其次,通过公开数据集及网络收集的方式构建了由6 000张车辆及行人目标图片构成的检测数据集,并采用准确率、召回率、平均精度均值mAP50及mAP50-95对算法检测效果进行定量评价,相比于YOLOv8-N,YOLOv8-RC的精确率提升1.7百分点,召回率提升1.2百分点,mAP50提升0.9百分点,mAP50-95提升0.5百分点,证明了算法的有效性。

深度学习  /  目标检测  /  YOLOv8  /  行人检测

In response to the complex and diverse nature of the road traffic environment, where vehicle and pedestrian detection is prone to false and missed detections, this paper proposes a vehicle and pedestrian target detection algorithm YOLOv8-RC based on multi-scale feature fusion. Initially, the RCS-OSA module is introduced within the structure of the base network YOLOv8 to replace the original module, thereby enhancing and integrating the extracted feature information. Additionally, a lightweight Context-Aware Adaptive Feature Reorganization (CARAFE) is employed to replace the original upsampling operator, enhancing the network’s capability for global multi-scale information fusion. Subsequently, a detection dataset consisting of 6 000 images of vehicle and pedestrian targets is constructed through public datasets and network collection. The algorithm’s detection performance is quantitatively evaluated using accuracy, recall rate, mean Average Precision at a 50% intersection over union threshold (mAP50), and mAP50-95. Compared to YOLOv8-N, YOLOv8-RC demonstrates an improvement of 1.7 percentage in accuracy, 1.2 percentage in recall rate, 0.9 percentage in mAP50, and 0.5 percentage in mAP50-95, thus validating the algorithm’s effectiveness.

Deep learning  /  Target detection  /  YOLOv8  /  Pedestrian detection
李相衡, 方虹苏, 杨娅琳, 杨炜. 基于多尺度特征融合的车辆及行人目标检测算法*. 汽车工程师, 2024 , (8) : 1 -7 . DOI: 10.20104/j.cnki.1674-6546.20240224
Xiangheng Li, Hongsu Fang, Yalin Yang, Wei Yang. Vehicle and Pedestrian Target Detection Algorithm Based on Multi-Scale Feature Fusion[J]. Automotive Engineer, 2024 , (8) : 1 -7 . DOI: 10.20104/j.cnki.1674-6546.20240224
车辆及行人检测作为当前无人驾驶领域的核心技术之一,旨在自动识别图像或视频中的车辆及行人区域。目前,道路车辆及行人检测算法可分为传统检测算法和基于深度学习的检测算法。传统检测算法在无遮挡物、车辆及行人目标清晰的道路上检测效果较好,但难以应用于无人驾驶场景。
深度学习算法应用于图像处理领域,可以实现高精度的车辆及行人目标检测,为辅助驾驶系统提供安全基础。Liu等[1]提出一种用于检测被遮挡行人的关键点和可视部分融合网络,引入人体关键点和可见部分包围盒构建2个注意力模块,分别抑制行人的通道特征和空间特征中的遮挡部分。Liang等[2]提出一种基于增强YOLOv5s的道路目标检测算法,引入加权增强极化自注意力增强特征表达,提高了算法对复杂场景的适应性。Jain等[3]使用集成学习模型和深度卷积神经网络的鲁棒多模态行人检测模型实现了精确的行人检测。Kiran等[4]使用改进YOLOv4进行了不同光照条件下运动车辆的识别。Andika等[5]改进了轻量级YOLOv7模型中的特征提取网络,实现了低成本硬件上的实时车辆检测。胡倩等[6]基于改进YOLOv5算法实现了密集场景下的行人检测。黄丹丹[7]基于Transformer实现了双分支检测和重识别的多行人追踪。李林红[8]等提出一种改进的YOLOv5s模型,用于对站口行人的高效检测。周力等[9]将基于旋转边界框(Rotated Bounding Box,RDB)的YOLOv5模型应用于遥感图像车辆检测领域。沈正坤等[10]提出基于YOLOv5的车辆目标检测算法,提升了模糊图像中小目标的检测精度。
上述文献利用不同检测算法实现了车辆及行人的检测,但在实际检测中,仍然会出现各种错检及漏检问题。基于此,本文提出一种基于改进YOLOv8的车辆及行人目标检测算法YOLOv8-RC,首先引入多尺度特征提取模块RCS-OSA(RepVGG/RepConv ShuffleNet based One-Shot Aggregation)加强网络的特征融合能力,然后将YOLOv8的上采样算子替换为轻量级的内容感知特征重组(Content-Aware ReAssembly of FEatures,CARAFE)模块,从而在网络上采样的过程中再次提高其特征融合能力。最后,构建车辆及行人检测数据集,对改进算法进行综合评估。
YOLOv8目标检测算法[11]总体架构包括3个部分:骨干(Backbone)模块对输入的图像进行下采样操作,可利用不同的网络深度得到不同层次的特征图信息;颈部(Neck)模块包含上采样及多个特征融合层等系列网络模块;头部(Head)模块用于输出检测结果,并对网络所提取的特征信息进行解码,同时负责预测每个候选框的类别及位置。在车辆及行人检测任务中,为对所提取的特征信息进行更好地融合及增强,本文在YOLOv8网络中引入一种多尺度特征融合模块RCS-OSA。此外,为更好地保留及提取特征,进一步加强网络的特征融合能力,将原网络的上采样算子改进为CARAFE模块,保持YOLOv8网络中快速空间金字塔池化(Spatial Pyramid Pooling-Fast,SPPF)模块不变,形成改进算法YOLOv8-RC,其架构如图1所示。图1中所使用的卷积(Conv)模块中卷积核尺寸k为3、移动步长s为2、填充参数p设置为1。
RCS(RepVGG/RepConv ShuffleNet)-YOLO通过增强特征提取及融合能力,提高了目标检测的性能[12]。故本文将该算法的RCS-OSA模块引入YOLOv8网络,其架构如图2所示,其中C为输入通道数量,n为堆叠的RCS模块的数量。
RCS-OSA模块首先对输入的特征图进行均匀分块,其中的一半特征图利用RepVGG网络的3×3卷积进行特征提取,另一半特征图信息用RCS模块进行特征提取。其中RCS模块将输入的特征再次进行分块,再采用通道拆分(Channel Split)及RepVGG操作将特征图相加得到新的特征图,最后利用通道混洗(Channel Shuffle)操作将特征图重新排列,从而使不同组之间的特征更好地交流及融合,以来增强特征的混合和传递效果。总体来说,RCS-OSA模块结合了3×3卷积及丰富的特征融合模块,有助于更有效地捕捉图像特征中的重要信息。将其应用于YOLOv8网络中可对特征图信息进行更加充分的增强及融合,从而捕捉不同大小检测目标的细节信息。
YOLOv8采用上采样(Upsample)算子执行上采样操作,目的在于提高特征图的空间分辨率,以提升网络的感知范围及定位精度。然而,该算子采用插值的方法填充新的像素,可能导致细节信息丢失,且不能准确还原原始图像细节。为解决这一问题,本文引入CARAFE轻量级上采样算子[13]替换原上采样算子,其原理如图3所示。
CARAFE上采样算子包含上采样核预测模块(Kernel Prediction Module)及特征重组模块(Content-aware Reassembly Module)。上采样核预测模块中的HWCm分别为输入特征图的高度、宽度及通道数量,σ为上采样率,σW、σH分别为输出特征图的高度、宽度。内容编码器(Content Encoder)用于编码输入特征图的内容信息,内核归一化器(Kernel Normalizer)用于归一化处理预测的内核,通道压缩器(Channel Compressor)用于减少特征数量。特征重组模块中的(x1,kup)表示该位置的邻域特征,W1为用于重组操作的权重,X′为通过内容感知重组模块后的输出特征图尺寸,其对应大小为(CWH)。图3中,X为输入特征图的尺寸。
该上采样算子的预测模块首先对输入的特征图进行通道压缩,然后利用卷积层进行内容编码及上采样核预测。其中上采样核的尺寸参数设定为:生成重组核的编码器卷积核大小Kenc=3、重组核的大小Kup=5,预测得到的上采样核通过归一化指数函数(Softmax函数)进行归一化,以此确保权重之和为1。此外,特征重组模块将输出特征图的每个位置映射回输入特征图,并提取以该位置为中心的区域。随后,该区域与相应位置的上采样核进行点积操作以获得输出值。最后,为保持特征的一致性及共享性,使不同位置的通道可共享相同的上采样核,在加入CARAFE算子后,可动态调整上采样核以适应不同的上采样需求,从而更好地保留和提取特征信息,在保持计算效率的同时,还可减少信息丢失。此外,以上过程通过融合不同尺度的特征图,还可获得更为丰富的上下文信息,以增强网络的特征表达能力,从而提高算法的感知范围及定位精度。
为评估改进算法在车辆及行人检测任务中的表现,搭建了如表1所示的试验软硬件环境。此外,为控制变量,所有模型的训练均未采用预训练模型,同时均采用混合精度训练模式。
完成试验环境搭建后,本文所构建的所有网络模型在训练过程中也均采取相同的超参数设置,具体配置如表2所示。
为验证本文算法在道路交通环境下对车辆及行人目标的检测性能,选取多目标车辆检测公开数据集UA-DETRAC作为基础数据集,将该数据集中所有的车辆标注进行合并,并对包含行人的图像进行重新标注,得到仅含车辆及行人标注的目标检测数据集。为全面检验及分析所提出算法的效果并平衡数据集中的标签分布,通过网络收集的方式构建了更多包含行人目标的检测数据集,最后将两类数据集进行筛选整合,获得由6 000张检测目标较为均衡的车辆及行人图片构成的检测数据集,并将其划分为训练集、测试集及验证集,分别包含5 000张、500张、500张图片。
本文采用采用精确率(Precision)P、召回率(Recall)R、平均精度(Average Precision,AP)和平均精度均值(mean Average Precision,mAP)对算法性能进行定量评价:
P=NTP/(NTP+NFP)
R=NTP/(NTP+NFN)
${p}_{avg}={\int }_{0}^{1}P\left(R\right)dR$
${m}_{p-avg}=\frac{1}{m}\sum _{i}{p}_{avg}\left(i\right)$
式中:NTPNFPNFN分别为真正例、假正例、假负例的数量,m为检测目标的类别数量,i为类别序号,pavg为平均精度(AP),mp-avg为平均精度均值(mAP)。
AP作为单类别目标检测中常用的评价指标,其值为这一类目标的P-R曲线下的面积;mAP则代表了整个数据集中所有类别的AP均值,一般将交并比(Intersection over Union,IoU)阈值设置为0.5,即IoU大于0.5的预测框有效,用mAP50表示。在IoU的取值为0.50~0.95范围内,每间隔0.05计算一次mAP,并计算所有mAP的均值,记为mAP50-95。
为验证改进模型对车辆及行人目标的检测效果,选取YOLO系列中的典型算法作为对比模型。此外,YOLO算法根据不同的网络尺度可分为N、S、M、L及X版本,考虑到模型在实际检测应用过程中的部署需求,将轻量级算法YOLOv8-N作为基础模型进行改进,同时选取其余对比模型的轻量级算法版本,其中YOLOv3则选取tiny版本。通过200轮的迭代训练,最后在测试集上得到各自模型的检测结果,如表3所示。
表3可知,改进算法YOLOv8-RC在所构建的车辆及行人检测数据集上的评价指标结果均高于基础算法YOLOv8-N。综合而言,YOLOv8-RC的各项评价指标在整体上得到了显著提升,较各对比模型具有更为优异的检测性能。
为验证YOLOv8-RC中改进模块RCS-OSA及CARAFE的提升效果,进行了如表4所示的消融实验。其中Model 1为加入RCS-OSA模块后的YOLOv8-N。
表4可以看出:改进算法YOLOv8-RC在加入RCS-OSA模块后,除平均召回率外,其余指标均得到了显著提高;引入CARAFE后,可更好地融合特征信息,综合指标再次提升,更适用于车辆及行人检测任务。
为验证改进算法YOLOv8-RC相比于其余对比模型的检测效果,从车辆及行人检测数据集的测试集中随机抽取2张典型的场景图片(分别为多车辆少行人场景及行人街道场景)来进行检测效果的可视化对比,其检测效果如图4所示。
图4可知,在行人街道场景下,各算法均未出现漏检及错检情况,仅在预测置信度上存在部分差异,而改进算法YOLOv8-RC在多车辆少行人场景下对行人目标具备更加准确的检测性能,并且在车辆的预测上具备更高的置信度。
本文提出了一种基于改进YOLOv8的车辆及行人检测算法YOLOv8-RC,通过引入多尺度特征融合模块RCS-OSA加强全局特征信息的融合能力,并将基础网络YOLOv8的上采样算子改进为轻量级上采样算子CARAFE,提高了网络的感知范围及定位精度。通过多目标车辆检测公开数据集UA-DETRAC及网络搜集的方式构建了由6 000张含有车辆及行人的图片构成的检测数据集,并选取YOLO系列的多个典型代表算法进行对比验证,结果表明,YOLOv8-RC在车辆及行人检测任务上具备优异的检测性能。
虽然YOLOv8-RC算法相比于基础算法YOLOv8-N具备更为优异的检测效果,但在后续研究中,还可继续对数据集进行优化和扩充,以适应更多的应用场景。此外,车辆及行人被遮挡的情况仍是未来的研究重点,可进一步对网络算法进行优化调整,以实现综合检测效果持续提升。
  • *中央高校基金项目(300102229112)
参考文献 引证文献
排序方式:
[1]
LIU P Y, MA Y X. Key Points and Visible Part Fusion Attention Network for Occluded Pedestrian Detection in Traffic Environments[J]. Optoelectronics Letters, 2024, 20(7): 430-436.
[2]
JAIN D K, ZHAO X D, GARCIA S, et al. Robust Multi-Modal Pedestrian Detection Using Deep Convolutional Neural Network with Ensemble Learning Model[J]. Expert Systems with Applications, 2024, 249.
[3]
LIANG W J. Research on a Vehicle and Pedestrian Detection Algorithm Based on Improved Attention and Feature Fusion[J]. Mathematical Biosciences and Engineering, 2024, 21(4): 5782-5802.
[4]
KIRAN K V, DASH S, PARIDA P. Vehicle Detection in Varied Weather Conditions using Enhanced Deep YOLO with Complex Wavelet[J]. Engineering Research Express, 2024, 6(2).
[5]
ANDIKA J L, KHAIRUDDIN A S M, RAMIAH H, et al. Improved Feature Extraction Network in Lightweight YOLOv7 Model for Real-Time Vehicle Detection on Low-Cost Hardware[J]. Journal of Real-Time Image Processing, 2024, 21(3).
[6]
胡倩, 皮建勇, 胡伟超, 等. 基于改进YOLOv5的密集行人检测算法研究[J/OL]. 计算机工程(2024-04-30)[2024-06-25]. https://doi.org/10.19678/j.issn.1000-3428.0068753.
HU Q, PI J Y, HU W C, et al. Research on Dense Pedestrian Detection Algorithm Based on Improved YOLOv5[J/OL]. Computer Engineering (2024-04-30)[2024-06-25]. https://doi.org/10.19678/j.issn.1000-3428.0068753.
[7]
黄丹丹, 张新茹, 刘智, 等. 基于Transformer的双分支检测和重识别的多行人追踪[J/OL]. 吉林大学学报(工学版)(2024-03-28)[2024-06-25]. https://doi.org/10.13229/j.cnki.jdxbgxb.20231415.
HUANG D D, ZHANG X R, LIU Z, et al. Multi Pedestrian Tracking Based on Transformer’s Dual Branch Detection and Re-Identification[J/OL]. Journal of Jilin University (Engineering Edition) (2024-03-28) [2024-06-25]. https://doi.org/10.13229/j.cnki.jdxbgxb.20231415.
[8]
李林红, 杨杰, 冯志成, 等. 面向站口行人检测的改进型Yolov5s算法[J]. 南京大学学报(自然科学), 2024, 60(1): 87-96.
LI L H, YANG J, FENG Z C, et al. Improved Yolov5s Algorithm for Pedestrian Detection at Station Entrances[J]. Journal of Nanjing University (Natural Science), 2024, 60(1): 87-96.
[9]
周力, 惠飞, 张嘉洋, 等. 基于RDB-YOLOv5的遥感图像车辆检测[J]. 长安大学学报(自然科学版), 2024, 44(3): 149-160.
ZHOU L, HUI F, ZHANG J Y, et al. Remote Sensing Image Vehicle Detection Based on RDB-YOLOv5[J]. Journal of Chang’an University (Natural Science Edition), 2024, 44(3): 149-160.
[10]
沈正坤, 王正超, 尹怀仙, 等. 基于VOD-YOLOv5的车辆检测算法研究[J]. 青岛大学学报(工程技术版), 2024, 39(1): 87-94.
SHEN Z K, WANG Z C, YIN H X, et al. Research on Vehicle Detection Algorithm Based on VOD-YOLOv5[J]. Journal of Qingdao University (Engineering Technology Edition), 2024, 39(1): 87-94.
[11]
DILLON R, JORDAN K, JACQUELINE H, et al. Real-Time Flying Object Detection with YOLOv8[EB/OL]. (2024-05-22) [2024-06-25]. https://doi.org/10.48550/arXiv.2305.09972.
[12]
KANG M, TING C M, TING F F, et al. RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor Detection[EB/OL]. (2023-10-03) [2024-06-25]. https://doi.org/10.48550/arXiv.2307.16412.
[13]
WANG J Q, CHEN K, XU R, et al. CARAFE: Content-Aware ReAssembly of FEatures[EB/OL]. (2019-10-29) [2024-06-25]. https://doi.org/10.48550/arXiv.1905.02188.
2024年第卷第8期
PDF下载
195
80
引用本文
BibTeX
文章信息
doi: 10.20104/j.cnki.1674-6546.20240224
  • 首发时间:2025-11-25
  • 出版时间:2024-08-15
补充材料
相关文章
文章信息
作者
出版历史
  • 修回日期:2024-07-13
基金
*中央高校基金项目(300102229112)
作者信息
    1 河南理工大学, 焦作 454000
    2 长安大学, 西安 710061
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/qcgcs/CN/10.20104/j.cnki.1674-6546.20240224
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏