Article(id=1251226683102146681, tenantId=1146029695717560320, journalId=1251194772300279900, issueId=1251226682309423223, articleNumber=null, orderNo=null, doi=10.20079/j.issn.1001-893x.240527001, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1716739200000, receivedDateStr=2024-05-27, revisedDate=1723996800000, revisedDateStr=2024-08-19, acceptedDate=null, acceptedDateStr=null, onlineDate=1776245287919, onlineDateStr=2026-04-15, pubDate=1764259200000, pubDateStr=2025-11-28, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1776245287919, onlineIssueDateStr=2026-04-15, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1776245287919, creator=13041195026, updateTime=1776245287919, updator=13041195026, issue=Issue{id=1251226682309423223, tenantId=1146029695717560320, journalId=1251194772300279900, year='2025', volume='65', issue='11', pageStart='1729', pageEnd='1954', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=1, specialIssue=null, createTime=1776245287729, creator=13041195026, updateTime=1776246742124, updator=13041195026, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1251232782568080068, tenantId=1146029695717560320, journalId=1251194772300279900, issueId=1251226682309423223, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1251232782568080069, tenantId=1146029695717560320, journalId=1251194772300279900, issueId=1251226682309423223, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=1773, endPage=1780, ext={EN=ArticleExt(id=1251226684167499914, articleId=1251226683102146681, tenantId=1146029695717560320, journalId=1251194772300279900, language=EN, title=UAV Target Recognition Based on Multi-scale Features, columnId=1251226683223781499, journalTitle=Telecommunication Engineering, columnName=Application Fundamental Research and Advanced Technology, runingTitle=null, highlight=null, articleAbstract=

In view of the problems of false detection and missed detection when an unmanned aerial vehicle (UAV) detects targets at different scales,a YOLOv8-FDT UAV algorithm model with a multi-scale fusion mechanism is proposed. First, a dynamic upsampling module is added to the Neck layer of the baseline model to reduce the number of model parameters and improve the real-time performance of the model for target recognition. In addition, in order to enable the entire algorithm model to capture different scale semantic information of the target in the feature fusion stage,adaptive downsampling and depth convolution are integrated to design the feature diffusion pyramid network(FDPN). Finally,experiments on the UAV aerial photography dataset VisDrone2019 show that the mean average precision(mAP) of all categories of the improved model is increased by 6.24% compared with that of the baseline model.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Bowen ZHANG, Bo XUE), CN=ArticleExt(id=1251226691637555562, articleId=1251226683102146681, tenantId=1146029695717560320, journalId=1251194772300279900, language=CN, title=基于多尺度特征的无人机目标识别算法, columnId=1251226683383165054, journalTitle=电讯技术, columnName=应用基础与前沿技术, runingTitle=null, highlight=null, articleAbstract=

针对无人机对于不同尺度的目标识别存在误检和漏检等问题,提出了一种多尺度融合机制的YOLOv8-FDT无人机目标识别算法。首先,在基线模型Neck层中添加动态上采样模块,旨在降低模型参数量,提高模型对于目标识别的实时性。此外,为了使得整个算法模型在特征融合阶段能够捕捉目标的不同尺度语义信息,融合自适应下采样和深度卷积,设计特征聚焦扩散金字塔网络(Feature Diffusion Pyramid Network,FDPN)。通过无人机航拍数据集VisDrone2019的实验表明,改进后模型所有类别平均精度均值(mean Average Precision,mAP)相较基线模型提升6.24%。

, correspAuthors=null, authorNote=null, correspAuthorsNote=
薛波 Email:
, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=Mu5rkvrYSVHeOQAmG3+lVw==, magXml=hF99DOkJHmhiP/eN2mt9Mw==, pdfUrl=null, pdf=0iESrgb5XlhmP3wsS+OrSg==, pdfFileSize=14039645, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=gqz6fnZRvd2J7J5njRIyIw==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=qSYmlt/TY9c1p3E/qYkhBg==, mapNumber=null, authorCompany=null, fund=null, authors=

张博文 男,1999年生于江苏连云港,硕士研究生,主要研究方向机电产品检测与智能控制。

薛波 男,1982年生于江苏徐州,博士,副教授、硕士生导师,目前主要从事机器学习、统计信号处理、无线传感器网络等方面的研究。

, authorsList=张博文, 薛波)}, authors=[Author(id=1251226692069568896, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251226692182815109, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, authorId=1251226692069568896, language=EN, stringName=Bowen ZHANG, firstName=Bowen, middleName=null, lastName=ZHANG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electrical and Information Engineering,Jiangsu Institute of Technology,Changzhou 213000,China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251226692283478411, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, authorId=1251226692069568896, language=CN, stringName=张博文, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=江苏理工学院 电气信息工程学院,江苏 常州 213000, bio={"content":"

张博文 男,1999年生于江苏连云港,硕士研究生,主要研究方向机电产品检测与智能控制。

"}, bioImg=null, bioContent=

张博文 男,1999年生于江苏连云港,硕士研究生,主要研究方向机电产品检测与智能控制。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251226691918573943, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, xref=null, ext=[AuthorCompanyExt(id=1251226691926962552, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, companyId=1251226691918573943, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electrical and Information Engineering,Jiangsu Institute of Technology,Changzhou 213000,China), AuthorCompanyExt(id=1251226691943739770, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, companyId=1251226691918573943, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=江苏理工学院 电气信息工程学院,江苏 常州 213000)])]), Author(id=1251226692392530321, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=dxxb@jsut.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251226692535136664, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, authorId=1251226692392530321, language=EN, stringName=Bo XUE, firstName=Bo, middleName=null, lastName=XUE, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electrical and Information Engineering,Jiangsu Institute of Technology,Changzhou 213000,China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251226692635799966, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, authorId=1251226692392530321, language=CN, stringName=薛波, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=江苏理工学院 电气信息工程学院,江苏 常州 213000, bio={"content":"

薛波 男,1982年生于江苏徐州,博士,副教授、硕士生导师,目前主要从事机器学习、统计信号处理、无线传感器网络等方面的研究。

"}, bioImg=null, bioContent=

薛波 男,1982年生于江苏徐州,博士,副教授、硕士生导师,目前主要从事机器学习、统计信号处理、无线传感器网络等方面的研究。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251226691918573943, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, xref=null, ext=[AuthorCompanyExt(id=1251226691926962552, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, companyId=1251226691918573943, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electrical and Information Engineering,Jiangsu Institute of Technology,Changzhou 213000,China), AuthorCompanyExt(id=1251226691943739770, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, companyId=1251226691918573943, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=江苏理工学院 电气信息工程学院,江苏 常州 213000)])])], keywords=[Keyword(id=1251226692757434792, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, orderNo=1, keyword=UAV), Keyword(id=1251226692845515183, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, orderNo=2, keyword=small target recognition), Keyword(id=1251226692921012663, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, orderNo=3, keyword=focus feature), Keyword(id=1251226692988121531, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, orderNo=4, keyword=multi-scale fusion), Keyword(id=1251226693080396228, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, orderNo=1, keyword=无人机), Keyword(id=1251226693151699404, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, orderNo=2, keyword=目标识别), Keyword(id=1251226694741340626, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, orderNo=3, keyword=特征聚焦), Keyword(id=1251226694837809625, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, orderNo=4, keyword=多尺度融合)], refs=[Reference(id=1251226699745145621, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2022, volume=62, issue=1, pageStart=138, pageEnd=146, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=黄巍, 陈俊良, 李犹海, journalName=电讯技术, refType=null, unstructuredReference=黄巍, 陈俊良, 李犹海.无人机自组网技术综述与发展展望[J].电讯技术, 2022, 62(1):138-146., articleTitle=无人机自组网技术综述与发展展望, refAbstract=null), Reference(id=1251226699854197532, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=765, pageEnd=781, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=LAW H, DENG J, journalName=null, refType=null, unstructuredReference=LAW H, DENG J. CornerNet:detecting objects as paired keypoints[C]//2018 European Conference on Computer Vision. Munich:IEEE, 2018:765-781., articleTitle=CornerNet:detecting objects as paired keypoints, refAbstract=null), Reference(id=1251226699946472228, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=385, pageEnd=400, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=LIU S, HUANG D, journalName=null, refType=null, unstructuredReference=LIU S, HUANG D. Receptive field block net for accurate and fast object detection[C]//2018 European Conference on Computer Vision. Cham:Springer, 2018:385-400., articleTitle=Receptive field block net for accurate and fast object detection, refAbstract=null), Reference(id=1251226700017775404, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2015, volume=null, issue=null, pageStart=91, pageEnd=99, url=null, language=null, rfNumber=[4], rfOrder=3, authorNames=REN S Q, HE K M, GIRSHICK R, journalName=null, refType=null, unstructuredReference=REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN:towards real-time object detection with region proposal networks[C]//The 28th International Conference on Neural Information Processing Systems. Cambridge:MIT Press, 2015:91-99., articleTitle=Faster R-CNN:towards real-time object detection with region proposal networks, refAbstract=null), Reference(id=1251226700097467190, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2022, volume=62, issue=2, pageStart=259, pageEnd=265, url=null, language=null, rfNumber=[5], rfOrder=4, authorNames=杨艳红, 钟宝江, 徐云龙, journalName=电讯技术, refType=null, unstructuredReference=杨艳红, 钟宝江, 徐云龙.改进的SSD算法在智慧交通中的应用[J].电讯技术, 2022, 62(2):259-265., articleTitle=改进的SSD算法在智慧交通中的应用, refAbstract=null), Reference(id=1251226700210713409, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=181, pageEnd=186, url=null, language=null, rfNumber=[6], rfOrder=5, authorNames=LIM J S, ASTRID M, YOON H J, journalName=null, refType=null, unstructuredReference=LIM J S, ASTRID M, YOON H J, et al. Small object detection using context and attention[C]//2021 International Conference on Artificial Intelligence in Infor-mation and Communication. Jeju Island: IEEE, 2021:181-186., articleTitle=Small object detection using context and attention, refAbstract=null), Reference(id=1251226700332348234, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2024, volume=32, issue=5, pageStart=727, pageEnd=739, url=null, language=null, rfNumber=[7], rfOrder=6, authorNames=李大湘, 辛嘉妮, 刘颖, journalName=光学精密工程, refType=null, unstructuredReference=李大湘, 辛嘉妮, 刘颖.位置敏感Transformer航拍图像目标识别模型[J].光学精密工程, 2024, 32(5):727-739., articleTitle=位置敏感Transformer航拍图像目标识别模型, refAbstract=null), Reference(id=1251226700407845710, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=3, pageEnd=19, url=null, language=null, rfNumber=[8], rfOrder=7, authorNames=WOO S, PARK J, LEE J Y, journalName=null, refType=null, unstructuredReference=WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//2018 European Conference on Computer Vision. Cham:Springer, 2018:3-19., articleTitle=CBAM: convolutional block attention module, refAbstract=null), Reference(id=1251226700537869143, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2023, volume=53, issue=7, pageStart=1528, pageEnd=1535, url=null, language=null, rfNumber=[9], rfOrder=8, authorNames=罗旭鸿, 刘永春, 楚国铭, journalName=无线电工程, refType=null, unstructuredReference=罗旭鸿, 刘永春, 楚国铭, .基于改进YOLO5无人机图像目标检测算法[J].无线电工程, 2023, 53(7):1528-1535., articleTitle=基于改进YOLO5无人机图像目标检测算法, refAbstract=null), Reference(id=1251226700651115360, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=2980, pageEnd=2988, url=null, language=null, rfNumber=[10], rfOrder=9, authorNames=LIN T Y, GOYAL P, GIRSHICK R, journalName=null, refType=null, unstructuredReference=LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss fordense object detection[C]//2017 IEEE International Conference on Computer Vision. Venice: IEEE, 2017:2980-2988., articleTitle=Focal loss fordense object detection, refAbstract=null), Reference(id=1251226700785333097, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=7263, pageEnd=7271, url=null, language=null, rfNumber=[11], rfOrder=10, authorNames=REDMON J, FARHADI A, journalName=null, refType=null, unstructuredReference=REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Los Alamitos: IEEE, 2017:7263-7271., articleTitle=YOLO9000: better, faster, stronger, refAbstract=null), Reference(id=1251226700873413487, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=7464, pageEnd=7475, url=null, language=null, rfNumber=[12], rfOrder=11, authorNames=WANG C Y, BOCHKOVSKIY A, LIAO H Y M, journalName=null, refType=null, unstructuredReference=WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv-7:trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver:IEEE, 2023:7464-7475., articleTitle=YOLOv-7:trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, refAbstract=null), Reference(id=1251226700995048312, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=213, pageEnd=226, url=null, language=null, rfNumber=[13], rfOrder=12, authorNames=DU D, ZHU P, WEN L, journalName=null, refType=null, unstructuredReference=DU D, ZHU P, WEN L, et al. VisDrone-DET2019: the vision meets drone object detection in image challengeresults[C]//2019 IEEE/CVF International Conference on Computer Vision Workshops. Seoul:IEEE, 2019:213-226., articleTitle=VisDrone-DET2019: the vision meets drone object detection in image challengeresults, refAbstract=null), Reference(id=1251226701091517309, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2024, volume=60, issue=3, pageStart=129, pageEnd=137, url=null, language=null, rfNumber=[14], rfOrder=13, authorNames=张利丰, 田莹, journalName=计算机工程与应用, refType=null, unstructuredReference=张利丰, 田莹.改进YOLOv8的多尺度轻量型车辆目标检测算法[J].计算机工程与应用, 2024, 60(3):129-137., articleTitle=改进YOLOv8的多尺度轻量型车辆目标检测算法, refAbstract=null), Reference(id=1251226701179597697, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=7746, pageEnd=775, url=null, language=null, rfNumber=[15], rfOrder=14, authorNames=TANG W, SUN J, WANG G, journalName=null, refType=null, unstructuredReference=TANG W, SUN J, WANG G. Horizontal feature pyramid network for object detection in UAV images[C]//2021 China Automation Congress. Beijing:IEEE, 2021:7746-775., articleTitle=Horizontal feature pyramid network for object detection in UAV images, refAbstract=null), Reference(id=1251226701259289478, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=122, pageEnd=126, url=null, language=null, rfNumber=[16], rfOrder=15, authorNames=CHENG Q Q, WANG H J, AING X C, journalName=null, refType=null, unstructuredReference=CHENG Q Q, WANG H J, AING X C, et al. A UAV target detection algorithm based on YOLOv4-Tiny and improved WBF[C]//The 14th International Conference on Wireless Communications and Signal Processing. Nanjing: IEEE, 2022:122-126., articleTitle=A UAV target detection algorithm based on YOLOv4-Tiny and improved WBF, refAbstract=null), Reference(id=1251226701364147085, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=10778, pageEnd=10787, url=null, language=null, rfNumber=[17], rfOrder=16, authorNames=TAN M X, PANG R M, GUO S L, journalName=null, refType=null, unstructuredReference=TAN M X, PANG R M, GUO S L. EffcientDet:scalable and effcient object detection[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle:IEEE, 2020:10778-10787., articleTitle=EffcientDet:scalable and effcient object detection, refAbstract=null), Reference(id=1251226701473199001, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=15, pageStart=1, pageEnd=11, url=null, language=null, rfNumber=[18], rfOrder=17, authorNames=LIU W J, QIANG J, LI X X, journalName=Mobile Information Systems, refType=null, unstructuredReference=LIU W J, QIANG J, LI X X, et al. UAV image small object detection based on composite backbone network[J]. Mobile Information Systems, 2022(15):1-11., articleTitle=UAV image small object detection based on composite backbone network, refAbstract=null), Reference(id=1251226701569667999, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2023, volume=23, issue=16, pageStart=7190, pageEnd=7197, url=null, language=null, rfNumber=[19], rfOrder=18, authorNames=WANG G, CHEN Y, AN P, journalName=Sensors, refType=null, unstructuredReference=WANG G, CHEN Y, AN P. UAV-YOLOv8: a small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios[J]. Sensors, 2023, 23(16):7190-7197., articleTitle=UAV-YOLOv8: a small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios, refAbstract=null), Reference(id=1251226701775188903, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2024, volume=8, issue=3, pageStart=84, pageEnd=89, url=null, language=null, rfNumber=[20], rfOrder=19, authorNames=TAHIR N U A, LONG Z, ZHANG Z, journalName=Drones, refType=null, unstructuredReference=TAHIR N U A, LONG Z, ZHANG Z. PVswin-YOLOv8s:UAV-based pedestrian and vehicle detection for traffic management in smart cities using improved YOLOv8[J]. Drones, 2024, 8(3):84-89., articleTitle=PVswin-YOLOv8s:UAV-based pedestrian and vehicle detection for traffic management in smart cities using improved YOLOv8, refAbstract=null), Reference(id=1251226701892629424, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2024, volume=23, issue=8, pageStart=1, pageEnd=13, url=null, language=null, rfNumber=[21], rfOrder=20, authorNames=孙佳宇, 徐民俊, 张俊鹏, journalName=计算机工程与应用, refType=null, unstructuredReference=孙佳宇, 徐民俊, 张俊鹏.优化改进YOLOv8无人机视角下目标检测算法[J].计算机工程与应用, 2024, 23(8):1-13., articleTitle=优化改进YOLOv8无人机视角下目标检测算法, refAbstract=null), Reference(id=1251226701989098426, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, doi=null, pmid=null, pmcid=null, year=2024, volume=46, issue=7, pageStart=30, pageEnd=36, url=null, language=null, rfNumber=[22], rfOrder=21, authorNames=李佳霖, 李卫东, 王廉钧, journalName=现代雷达, refType=null, unstructuredReference=李佳霖, 李卫东, 王廉钧, .基于残差网络高分辨距离像的无人机识别[J].现代雷达, 2024, 46(7):30-36., articleTitle=基于残差网络高分辨距离像的无人机识别, refAbstract=null)], funds=null, companyList=[AuthorCompany(id=1251226691918573943, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, xref=null, ext=[AuthorCompanyExt(id=1251226691926962552, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, companyId=1251226691918573943, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electrical and Information Engineering,Jiangsu Institute of Technology,Changzhou 213000,China), AuthorCompanyExt(id=1251226691943739770, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, companyId=1251226691918573943, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=江苏理工学院 电气信息工程学院,江苏 常州 213000)])], figs=[ArticleFig(id=1251226695013970408, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=Ti11xaCR2WneB4Iziarc7A==, figureFileBig=gqz6fnZRvd2J7J5njRIyIw==, tableContent=null), ArticleFig(id=1251226695118828018, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图1, caption=YOLOv8算法结构示意, figureFileSmall=Ti11xaCR2WneB4Iziarc7A==, figureFileBig=gqz6fnZRvd2J7J5njRIyIw==, tableContent=null), ArticleFig(id=1251226695408235016, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=cr/cjrIJBc9fyeSDhaBr6A==, figureFileBig=zMgEQ6x+kEQI0JvmrRLbDg==, tableContent=null), ArticleFig(id=1251226695550841364, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图2, caption=YOLOv8-FDT算法结构示意, figureFileSmall=cr/cjrIJBc9fyeSDhaBr6A==, figureFileBig=zMgEQ6x+kEQI0JvmrRLbDg==, tableContent=null), ArticleFig(id=1251226695647310364, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=0TdUVAgoEybe66u22fuuYA==, figureFileBig=jKPJczolGahnG7LG5jr3IQ==, tableContent=null), ArticleFig(id=1251226695756362281, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图3, caption=动态上采样示意, figureFileSmall=0TdUVAgoEybe66u22fuuYA==, figureFileBig=jKPJczolGahnG7LG5jr3IQ==, tableContent=null), ArticleFig(id=1251226695857025589, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=CVh1wur6KvbVCT+fb3gE7g==, figureFileBig=SU3Y4yXdzOxMyBOQ0iAvhA==, tableContent=null), ArticleFig(id=1251226695945105982, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图4, caption=动态采样点生成器示意, figureFileSmall=CVh1wur6KvbVCT+fb3gE7g==, figureFileBig=SU3Y4yXdzOxMyBOQ0iAvhA==, tableContent=null), ArticleFig(id=1251226696037380676, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=DM2hVTPKDUZ/Idxpa1MANA==, figureFileBig=XchDvx1sHIxcxBHIMLnXPw==, tableContent=null), ArticleFig(id=1251226696142238290, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图5, caption=特征聚焦结构示意, figureFileSmall=DM2hVTPKDUZ/Idxpa1MANA==, figureFileBig=XchDvx1sHIxcxBHIMLnXPw==, tableContent=null), ArticleFig(id=1251226696217735770, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=iFIhUmlcbvBcah48qSRrKA==, figureFileBig=vzzVByfJVHFUqk/AOYhh1g==, tableContent=null), ArticleFig(id=1251226696360342117, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图6, caption=自适应下采样结构示意, figureFileSmall=iFIhUmlcbvBcah48qSRrKA==, figureFileBig=vzzVByfJVHFUqk/AOYhh1g==, tableContent=null), ArticleFig(id=1251226696507142773, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=BwlIpQ45FjTj0KP9VrXD3Q==, figureFileBig=o8WexJgh0ydFyklnF7T+IQ==, tableContent=null), ArticleFig(id=1251226696628777598, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图7, caption=不同类别的平均精度对比, figureFileSmall=BwlIpQ45FjTj0KP9VrXD3Q==, figureFileBig=o8WexJgh0ydFyklnF7T+IQ==, tableContent=null), ArticleFig(id=1251226696762995337, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=fq953Pbs4zqddW16LVl4xA==, figureFileBig=2vADoOqpmdQ2Og/lk50HIA==, tableContent=null), ArticleFig(id=1251226696884630165, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图8, caption=混淆矩阵, figureFileSmall=fq953Pbs4zqddW16LVl4xA==, figureFileBig=2vADoOqpmdQ2Og/lk50HIA==, tableContent=null), ArticleFig(id=1251226696985293467, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=9noLFO00CQHKM/Fs2OvUkA==, figureFileBig=JY/PEIndwBjJrvIXq9rlyw==, tableContent=null), ArticleFig(id=1251226697090151079, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图9, caption=模型识别的结果, figureFileSmall=9noLFO00CQHKM/Fs2OvUkA==, figureFileBig=JY/PEIndwBjJrvIXq9rlyw==, tableContent=null), ArticleFig(id=1251226697174037167, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=XVcAdRHltfEAiR2YFTRIPQ==, figureFileBig=eghTpxz/9YZsCQ4vyNLJNg==, tableContent=null), ArticleFig(id=1251226697299866299, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=图10, caption=不同场景下的目标识别结果, figureFileSmall=XVcAdRHltfEAiR2YFTRIPQ==, figureFileBig=eghTpxz/9YZsCQ4vyNLJNg==, tableContent=null), ArticleFig(id=1251226697438278341, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
参数名称参数配置
Workers8
Data enhancementMosaic
Patience100
), ArticleFig(id=1251226697543135949, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=表1, caption=

训练参数设置

, figureFileSmall=null, figureFileBig=null, tableContent=
参数名称参数配置
Workers8
Data enhancementMosaic
Patience100
), ArticleFig(id=1251226697660576473, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
YOLOv8nFDPNDySampleCarafePrecision/%Recall/%mAP50/%参数量/106
50.3136.1336.253.1
54.0836.3339.333.3
53.2338.0640.012.9
54.1736.7939.453.4
58.8839.5442.493.0
50.4735.1636.503.6
), ArticleFig(id=1251226699233440485, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=表2, caption=

消融实验

, figureFileSmall=null, figureFileBig=null, tableContent=
YOLOv8nFDPNDySampleCarafePrecision/%Recall/%mAP50/%参数量/106
50.3136.1336.253.1
54.0836.3339.333.3
53.2338.0640.012.9
54.1736.7939.453.4
58.8839.5442.493.0
50.4735.1636.503.6
), ArticleFig(id=1251226699346686702, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
模型Precision/%Recall/%mAP50/%mAP50-95/%参数量/106模型大小/MB
YOLOv3-Tiny[20]37.2024.1031.5018.04.524.4
YOLOvX[21]42.7031.3031.2016.002.026.5
YOLOv5s[19]46.4034.6034.4019.007.214.0
YOLOv6[20]39.8029.4029.1017.005.08.7
YOLOv7[20]50.2041.1037.9019.903.072.0
YOLOv8n50.3136.1336.2524.423.122.5
YOLOv8-FDT58.8839.5442.4927.883.020.1
), ArticleFig(id=1251226699447350003, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=表3, caption=

不同YOLO模型对比

, figureFileSmall=null, figureFileBig=null, tableContent=
模型Precision/%Recall/%mAP50/%mAP50-95/%参数量/106模型大小/MB
YOLOv3-Tiny[20]37.2024.1031.5018.04.524.4
YOLOvX[21]42.7031.3031.2016.002.026.5
YOLOv5s[19]46.4034.6034.4019.007.214.0
YOLOv6[20]39.8029.4029.1017.005.08.7
YOLOv7[20]50.2041.1037.9019.903.072.0
YOLOv8n50.3136.1336.2524.423.122.5
YOLOv8-FDT58.8839.5442.4927.883.020.1
), ArticleFig(id=1251226699531236089, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
模型mAP50/%mAP50-95/%
Faster-RCNN37.2021.90
Cascade R-CNN39.1024.30
RetinaNet19.1010.60
CenterNet33.7018.80
YOLOv8-FDT42.4927.88
), ArticleFig(id=1251226699615122182, tenantId=1146029695717560320, journalId=1251194772300279900, articleId=1251226683102146681, language=CN, label=表4, caption=

传统算法模型对比

, figureFileSmall=null, figureFileBig=null, tableContent=
模型mAP50/%mAP50-95/%
Faster-RCNN37.2021.90
Cascade R-CNN39.1024.30
RetinaNet19.1010.60
CenterNet33.7018.80
YOLOv8-FDT42.4927.88
)], attaches=null, journal=Journal(id=1251193998841266264, delFlag=0, nameCn=电讯技术, nameEn=Telecommunication Engineering, nameHistory1=null, nameHistory2=null, issn=1001-893X, eissn=null, cn=51-1267/TN, coden=null, periodic=0, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=CpBmHoMzpESavU+iEMTBmw==, journalPrice=null, startedYear=null, abbrevIsoEn=Telecommunication Engineering, journalRemark=null, publicationField=null, createdTime=1776237495387, updatedTime=1776238086301, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=T, firstLetterEn=T, subjectCode=Engineering, subjectName=null, subjectCodeEn=Engineering, subjectNameEn=null, picCn=CpBmHoMzpESavU+iEMTBmw==, picEn=jCOIy2zOaGJZ/y3z2gPZzg==, jcr=null, cjcr=null, exts=[JournalExt(id=1251196477385687352, language=CN, name=电讯技术, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1776238086315, updatedTime=1776238086315, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://www.teleonline.cn/dxjs/ch/author/login.aspx, submissionEditorUrl=https://www.teleonline.cn/dxjs/ch/login.aspx, submissionReviewUrl=https://www.teleonline.cn/dxjs/ch/auditor/login.aspx, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1251196477469573433, language=EN, name=Telecommunication Engineering, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1776238086335, updatedTime=1776238086335, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://www.teleonline.cn/dxjs/ch/author/login.aspx, submissionEditorUrl=https://www.teleonline.cn/dxjs/ch/login.aspx, submissionReviewUrl=https://www.teleonline.cn/dxjs/ch/auditor/login.aspx, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1251194772300279900, websiteList=[Website(id=1251197148327522670, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1251194772300279900, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/dxjs/CN, language=CN, createTime=1776238246280, createBy=18614031015, updateTime=1776238378770, updateBy=18614031015, name=电讯技术-中文, tplId=1146099689490845704, title=电讯技术, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1251197904854135502, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=articleTextType, value=kx, createTime=1776238426650, updateTime=1776238426650, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904833163979, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=banner, value=null, createTime=1776238426645, updateTime=1776238426645, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904870912721, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=grayFlag, value=0, createTime=1776238426654, updateTime=1776238426654, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904824775370, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=logo, value=https://castjournals.cast.org.cn/joweb/dxjs/CN/file/pic?fileId=BBd4SC9puESjyaw04bneig==, createTime=1776238426643, updateTime=1776238426643, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904883495635, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=minRunFlag, value=0, createTime=1776238426657, updateTime=1776238426657, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904845746893, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/dxjs/CN/file/pic, createTime=1776238426648, updateTime=1776238426648, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904875107026, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=silenceFlag, value=0, createTime=1776238426655, updateTime=1776238426655, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904841552588, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1776238426647, updateTime=1776238426647, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904858329807, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=themeColor, value=null, createTime=1776238426651, updateTime=1776238426651, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197904866718416, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148327522670, code=themeStyle, value=null, createTime=1776238426653, updateTime=1776238426653, creator=18614031015, updator=18614031015)]), Website(id=1251197148512072052, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1251194772300279900, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/dxjs/EN, language=EN, createTime=1776238246324, createBy=18614031015, updateTime=1776238398944, updateBy=18614031015, name=电讯技术-英文, tplId=1146101810881728533, title=Telecommunication Engineering, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1251197930175152619, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=articleTextType, value=kx, createTime=1776238432687, updateTime=1776238432687, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930154181096, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=banner, value=null, createTime=1776238432682, updateTime=1776238432682, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930200318446, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=grayFlag, value=0, createTime=1776238432693, updateTime=1776238432693, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930141598183, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=logo, value=https://castjournals.cast.org.cn/joweb/dxjs/EN/file/pic?fileId=BBd4SC9puESjyaw04bneig==, createTime=1776238432679, updateTime=1776238432679, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930212901360, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=minRunFlag, value=0, createTime=1776238432696, updateTime=1776238432696, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930170958314, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/dxjs/EN/file/pic, createTime=1776238432686, updateTime=1776238432686, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930204512751, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=silenceFlag, value=0, createTime=1776238432694, updateTime=1776238432694, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930162569705, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1776238432684, updateTime=1776238432684, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930183541228, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=themeColor, value=null, createTime=1776238432689, updateTime=1776238432689, creator=18614031015, updator=18614031015), WebsiteProps(id=1251197930191929837, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251197148512072052, code=themeStyle, value=null, createTime=1776238432691, updateTime=1776238432691, creator=18614031015, updator=18614031015)])], journalTitle=电讯技术, weixinUrl=null, journalUrl=https://www.teleonline.cn/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Telecommunication Engineering, journalPhotoCn=CpBmHoMzpESavU+iEMTBmw==, journalPhotoEn=jCOIy2zOaGJZ/y3z2gPZzg==, journalFirstLetter=T, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/dxjs/CN/10.20079/j.issn.1001-893x.240527001, detailUrlEn=https://castjournals.cast.org.cn/joweb/dxjs/EN/10.20079/j.issn.1001-893x.240527001, pdfUrlCn=https://castjournals.cast.org.cn/joweb/dxjs/CN/PDF/10.20079/j.issn.1001-893x.240527001, pdfUrlEn=https://castjournals.cast.org.cn/joweb/dxjs/EN/PDF/10.20079/j.issn.1001-893x.240527001, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
基于多尺度特征的无人机目标识别算法
收藏切换
PDF下载
张博文 , 薛波
电讯技术 | 应用基础与前沿技术 2025,65(11): 1773-1780
收起
收藏切换
电讯技术 | 应用基础与前沿技术 2025, 65(11): 1773-1780
基于多尺度特征的无人机目标识别算法
全屏
张博文, 薛波
作者信息
  • 江苏理工学院 电气信息工程学院,江苏 常州 213000
  • 张博文 男,1999年生于江苏连云港,硕士研究生,主要研究方向机电产品检测与智能控制。

    薛波 男,1982年生于江苏徐州,博士,副教授、硕士生导师,目前主要从事机器学习、统计信号处理、无线传感器网络等方面的研究。

通讯作者:

薛波 Email:
UAV Target Recognition Based on Multi-scale Features
Bowen ZHANG, Bo XUE
Affiliations
  • School of Electrical and Information Engineering,Jiangsu Institute of Technology,Changzhou 213000,China
出版时间: 2025-11-28 doi: 10.20079/j.issn.1001-893x.240527001
文章导航
收藏切换

针对无人机对于不同尺度的目标识别存在误检和漏检等问题,提出了一种多尺度融合机制的YOLOv8-FDT无人机目标识别算法。首先,在基线模型Neck层中添加动态上采样模块,旨在降低模型参数量,提高模型对于目标识别的实时性。此外,为了使得整个算法模型在特征融合阶段能够捕捉目标的不同尺度语义信息,融合自适应下采样和深度卷积,设计特征聚焦扩散金字塔网络(Feature Diffusion Pyramid Network,FDPN)。通过无人机航拍数据集VisDrone2019的实验表明,改进后模型所有类别平均精度均值(mean Average Precision,mAP)相较基线模型提升6.24%。

无人机  /  目标识别  /  特征聚焦  /  多尺度融合

In view of the problems of false detection and missed detection when an unmanned aerial vehicle (UAV) detects targets at different scales,a YOLOv8-FDT UAV algorithm model with a multi-scale fusion mechanism is proposed. First, a dynamic upsampling module is added to the Neck layer of the baseline model to reduce the number of model parameters and improve the real-time performance of the model for target recognition. In addition, in order to enable the entire algorithm model to capture different scale semantic information of the target in the feature fusion stage,adaptive downsampling and depth convolution are integrated to design the feature diffusion pyramid network(FDPN). Finally,experiments on the UAV aerial photography dataset VisDrone2019 show that the mean average precision(mAP) of all categories of the improved model is increased by 6.24% compared with that of the baseline model.

UAV  /  small target recognition  /  focus feature  /  multi-scale fusion
张博文, 薛波. 基于多尺度特征的无人机目标识别算法. 电讯技术, 2025 , 65 (11) : 1773 -1780 . DOI: 10.20079/j.issn.1001-893x.240527001
Bowen ZHANG, Bo XUE. UAV Target Recognition Based on Multi-scale Features[J]. Telecommunication Engineering, 2025 , 65 (11) : 1773 -1780 . DOI: 10.20079/j.issn.1001-893x.240527001
近年来,无人机因具备快速启动、降落和重新部署的能力,从而能提高工作效率[1],在各领域得到了迅速普及[2]。将先进的目标识别算法与无人机技术相结合,运用复杂的计算机视觉技术[3],使得无人机能够实时准确地识别和跟踪目标。
目前主流的目标识别方法是引入特征金字塔网络,该网络在算法模型中结合了多级特征表示,以增强对小物体的感知,例如Faster R-CNN[4]、SSD[5]和YOLO[6]等算法。此外,研究人员致力于改进网络架构,同时结合注意力机制、特定损失函数等方法进一步提高目标识别能力。李大湘等人[7]使用位置敏感目标检测模型以应对无人机在复杂场景中的目标识别,较基线模型精度提高4.1%。Woo等人[8]采用卷积注意力模块(Convolutional Block Attention Module,CBAM),将空间注意力机制和通道注意力机制相结合,实现对关键信息的关注和不必要信息的抑制,提高了无人机对于小目标的特征的提取能力。罗旭鸿等人[9]在YOLOv5模型的基础上,添加一层小目标检测层,提高小目标识别的准确性。Lin等人[10]利用加权双向特征金字塔网络(Bidirectional Feature Pyramid Network,BiFPN)实现更复杂的特征融合,进一步捕捉小目标的语义信息。
以上方法虽然能进一步提升目标识别能力,但依然出现识别精度低和模型复杂度高等问题,因此,本文以单阶段目标检测YOLOv8模型为基础,提出一种改进的无人机目标识别算法YOLOv8-FDT。主要工作如下:设计特征聚焦扩散金字塔网络(Focusing Diffusion Pyramid Network,FDPN),将Backbone不同尺度的信息进行融合,使得网络具有丰富的上下文信息;在Neck层加入动态上采样模块,利用动态可调采用因子获取的采样集,提高目标识别的准确性并降低模型参数量。
无人机在目标识别过程中,通常需要同时监测多种类型的目标,而在目标出现重叠和特征信息模糊等情况下,不能有效地识别。目前主流的YOLO算法模型不仅能更为准确地识别小目标,例如人、车辆、建筑物等,而且能够提高目标识别的实时性。融合YOLOv8的无人机目标识别算法,通过实验,处理图片信息的帧速率可达到180.7,提高了无人机在处理大量目标时的效率[11]。YOLOv8模型在Backbone和Neck层使用多个Conv,其结构主要由Conv2d、BatchNorm2d、SiLU组成,多个Conv排列组合使得模型具有较高的检测精度和速度,并且利用特征重用来解决梯度弥散问题,从而保证在很大程度上保留原始数据[12]
图1所示,输入特征具有多尺度信息,存在远小近大情况。首先,模型将图片尺寸调整为640× 640。其次,在Backbone层中所有模块按照顺序进行连接,采用跨级特征融合网络(CSPDarknet53 to 2-Stage FPN,C2f)模块和快速空间金字塔池化(Fast Spatial Pyramid Pooling,SPPF)模块,利用C-2f的梯度分流,并结合SPPF融合大尺度信息,从而增强模型特征提取能力。然后,在Neck层使用自上而下的特征融合方式[13],结合特征金字塔网络和路径聚合网络,有助于增强特征图信息流,提高颈部网络特征融合能力。最后,引入无锚框目标检测(Ancher-Free)检测头[14],检测头由回归和分类分支组成,使用二元交叉熵损失进行分类,使用分布式焦点损失进行定位,更好地权衡预测框和实际框[15]
无人机结合YOLOv8算法能够实现对目标的识别,但存在一定的局限性。在复杂场景中目标存在不同的尺度(目标尺寸大小不统一),如行人、车辆等目标在输入特征图中所占像素比例偏小,并且伴随着目标出现远小近大的问题,单一尺度的特征提取往往无法有效地捕捉到不同尺度目标的重要特征,最终导致模型出现误检或漏检等情况[16]。因此,本文基于YOLOv8n模型,对无人机目标识别算法进行改进,提出了YOLOv8-FDT。改进算法框图如图2所示,在P14、P21将传统卷积替换DySample,在Neck层的P10、P17添加FocusFeature以融合Backbone层P4、P6、P9特征图信息。
上采样的本质属于点采样,动态上采样(Dynamic Sample,DySample)在网络训练过程中并不需要定制CUDA包,并且参数量和CPU内存占用率进一步降低。如图3所示,动态上采样的一般步骤如下:首先,输入特征图XH×W×C,经过动态采样生成器;其次,产生采样集S,采样因子为s;最后,网格采样函数通过采样集的位置,对输入特征进行重采样,生成上采样特征X′,如式(1)所示:
具体实现动态的上采样,需在动态采样生成器中生成动态的偏移量。图4所示为静态采样与动态采样之间的对比,展示了动态采样集是如何产生的。
对于静态采样点生成器,输入特征图XH×W× C,使用输入和输出通道数为C和2s2的线性层,生成2s2×H×W的偏移量O,如式(2)所示:
设置采样因子s为0.25。经过像素重组,重塑为2×sH×sW的偏移量O,表达式如下:
式中:O表示偏移量;G表示原始采样网络;采样集S表示偏移量O与原始采样网络G之和。图3图4中,g为将特征图分为g组。
而动态采样点生成器,结合双线性差值法,依然采用线性投影得到偏移量,通过结合sigmoid函数与0.5静态因子的乘积,得到[0,0.5]可调的动态因子,然后经过像素重塑得到偏移量O,如式(5)所示:
特征聚焦扩散金字塔网络,主要思想是借用特征聚焦(Focus Feature)这一结构,使得在特征融合阶段能够进一步获取目标的不同尺度语义信息。如图5所示,Focus Feature主要由自适应下采样和深度卷积组成。首先,特征聚焦模块接收3个尺度特征的输入,经过自适应下采样和1×1的卷积操作;然后,将3个不同尺度信息进行整合,并且利用一组并行深度卷积和身份映射,且每个卷积核的大小设置为1、3、5、7以获得更多的感受野;最后,使用1×1卷积调整通道数,与Concat中的目标信息进行整合相加输出。在FDPN网络中,如图2所示,将Backbone层中的P4、P6、P9作为3个不同尺度的输入,在Neck层中引入Focus Feature,整合与Backbone中的语义信息,使得浅层与深层信息得以充分融合,以提高模型对于复杂背景下小目标识别的能力。
自适应下采样(Adaptive Downsample,Adown)模块引入平均池化层和最大池化层,如图6所示。平均池化在输入端将特征图分为两部分,将区域内的图像取平均值,如式(6)所示:
式中:khkw表示池化核的高度和宽度;xi,j表示输入特征图(ij)的像素值。最大的池化则是将池化区域最大值作为池化结果,而不考虑具体的位置信息。这使得最大池化具有对于输入数据的平移操作不敏感的特性,即使输入数据发生平移,选取的最大值仍然保持不变,如式(7)所示:
在Focus Feature结构中加入Adown通过减少参数量来降低模型的复杂度,有助于提高模型的运行效率。
深度卷积(Depthwise Convolution,DWConv)是一种通道之间独立进行卷积操作的方式,对输入的每个通道单独进行卷积操作,一个卷积核只处理一个输入通道,若输入特征图具有3个输入通道,则由3个独立的卷积核进行处理,适用于移动设备等资源受限的场景[17]。在Focus Feature结构中使用一组并行的DWConv,并且对整合的图片信息采用不同的卷积核,使得DWConv可以捕获输入数据中的局部空间关系,对不同通道的特征进行独立学习。
VisDrone2019数据集[18]由天津大学和AISKYEYE团队收集。数据集包括训练集6471张,验证集548张,测试集3190张,一共有10个类别,并且数据集中每个类别数量由各种无人机摄像头捕获,覆盖范围广泛,包括行人、车辆、自行车等。
实验环境和硬件平台包括CPU R9-7945HX、GPU RTX4060(8 GB内存)、Ubuntu操作系统、CUDA10.2、Python3.9、Pytorch11.8。
在模型训练过程中,输入图像采用640×640,轮数200,Batch-size为16,优化器使用SGD,并采用数据增强,其他参数信息如表1所示。
为了展示改进算法的识别效果,采用精确度(Precision,P)、召回率(Recall,R)、平均精度均值(mean Average Precision,mAP)、参数量作为参考指标。PR的表达式如下所示:
式中:TP表示预测目标与实际目标都为真;FP表示类别预测为真,实际为假;FN表示类别预测为假,实际为真。
由于在无人机目标识别过程中存在多个类别,例如车、行人、自行车、公交车等,单从PR无法衡量模型性能。在模型的训练过程中,每个类别都可以得到对应的PR曲线,曲线下的面积为平均精度值(Average Precision,AP),mAP表示所有类别的平均精度值,因此模型评估使用mAP来衡量网络运行效率,表达式如下所示:
对所提算法进行消融实验,评估指标为Precision、Recall、mAP50和mAP50-95。相比于基线模型,待检测目标的平均精度提高6.24%。同时,为了验证本文提出DySample性能,对比Carafe下采样,实验表明DySample的参数量比Carafe下采样参数量更低,具体信息如表2所示。
为了验证本文提出的模型对小目标识别的性能优势,将YOLOv8n基线模型、改进模型与相关领域具有代表性的目标检测算法如YOLOv5s、YOLOv6、YOLOv7等进行对比,评估指标主要为PR、mAP50、mAP50-95、参数量、模型大小。实验结果如表3所示,改进的模型在保证目标识别精度的同时使得模型参数量降低,提高了无人机目标识别速率。同时,本文将YOLOv8-FDT与Faster-RCNN[19]、Cascade R-CNN[19]、RetinaNet[19]、CenterNet[19]算法进行对比,如表4所示。相比于传统算法,改进的模型在mAP 50和mAP50-95上提升明显。
将数据集中的10个目标类别的平均精度与改进模型YOLOv8-FDT进行比较,如图7所示,包括Pedestrain、Bicyle、People、Montor等在特征图中占据较少像素点的目标。实验结果表明,改进模型在目标识别方面的平均精度优于基线模型。另外,如Van、Bus、Awning-tricycle等特征图中占据较多像素点的目标,改进模型的mAP 50提高了约10%,证明本文提出的改进模型能够有效处理多尺度目标识别任务场景。
使用混淆矩阵将目标检测问题转化为一个回归问题来预测目标的边界框和类别,如图8所示,(a)表示基线模型训练结果,(b)表示改进模型训练结果。每一列代表真实类别,每一行代表预测类别,对角线表示正确识别样本数量。上三角为假阳性,表示模型将背景或其他类别分类成当前类别,区域数值越高表明模型存在较多误检。下三角为假阴性,表示模型未能正确检测真实存在的样本,区域的数值较高时,表明模型在这些类别上存在漏检问题。可以看出改进的模型较基线模型能够有效地降低漏检误检概率。
图9为在Visdrone2019验证集里的图片训练出来的效果,图9(a)所示基线模型会出现误检情况,基线模型将桶识别成车辆,而利用改进的模型,能够准确识别不同的类别,并未出现误检情况。此外,目标识别过程中会出现重叠覆盖的情况,如图9(b)所示当车辆和行人重叠时,基线模型将其识别在同一目标框内,而改进的模型将行人与车辆分开识别。图9(c)所示基线模型出现漏检情况,而改进的模型一定程度克服了漏检问题,并且每个类别的识别精度均有提升。图9(d)所示为在城市和人口密集区域下的目标识别结果。
改进模型在特殊场景下的识别能力如图10所示。在雾天中,能见度较低,图像被雾气所遮挡,导致目标物体的边缘模糊不清。其次,进行噪声测试。噪声会对目标的识别造成干扰,增加识别难度。然后,进行黑夜场景的测试。在黑夜中,光线不足,图像变得昏暗,目标物体的细节难以区分。最后,进行多尺度、复杂背景和目标被遮挡情况的测试。实验结果表明,改进的模型在特殊场景下均能实现小目标的识别,并能保持良好的识别精度。
无人机在目标识别[22]过程中存在漏检、误检和识别精度低等问题。本文在YOLOv8n基线模型上提出YOLOv8-FDT模型,设计扩散聚焦金字塔网络,获取不同尺度的图片语义信息,以提高算法在密集场景下的表现。同时,将基线模型Neck层传统卷积替换DySample,降低模型参数量以实现检测的实时性。通过Visdrone2019数据集实验表明,改进的模型较基线模型平均精度提高6.24%,使得无人机能够应对复杂场景的目标识别任务。
下一步工作将继续提高算法对于小目标的识别能力,同时降低模型的参数量和推理时间,提高模型在不同数据集上和更为复杂场景的泛化能力。
参考文献 引证文献
排序方式:
[1]
黄巍, 陈俊良, 李犹海.无人机自组网技术综述与发展展望[J].电讯技术, 2022, 62(1):138-146.
[2]
LAW H, DENG J. CornerNet:detecting objects as paired keypoints[C]//2018 European Conference on Computer Vision. Munich:IEEE, 2018:765-781.
[3]
LIU S, HUANG D. Receptive field block net for accurate and fast object detection[C]//2018 European Conference on Computer Vision. Cham:Springer, 2018:385-400.
[4]
REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN:towards real-time object detection with region proposal networks[C]//The 28th International Conference on Neural Information Processing Systems. Cambridge:MIT Press, 2015:91-99.
[5]
杨艳红, 钟宝江, 徐云龙.改进的SSD算法在智慧交通中的应用[J].电讯技术, 2022, 62(2):259-265.
[6]
LIM J S, ASTRID M, YOON H J, et al. Small object detection using context and attention[C]//2021 International Conference on Artificial Intelligence in Infor-mation and Communication. Jeju Island: IEEE, 2021:181-186.
[7]
李大湘, 辛嘉妮, 刘颖.位置敏感Transformer航拍图像目标识别模型[J].光学精密工程, 2024, 32(5):727-739.
[8]
WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//2018 European Conference on Computer Vision. Cham:Springer, 2018:3-19.
[9]
罗旭鸿, 刘永春, 楚国铭, .基于改进YOLO5无人机图像目标检测算法[J].无线电工程, 2023, 53(7):1528-1535.
[10]
LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss fordense object detection[C]//2017 IEEE International Conference on Computer Vision. Venice: IEEE, 2017:2980-2988.
[11]
REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Los Alamitos: IEEE, 2017:7263-7271.
[12]
WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv-7:trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver:IEEE, 2023:7464-7475.
[13]
DU D, ZHU P, WEN L, et al. VisDrone-DET2019: the vision meets drone object detection in image challengeresults[C]//2019 IEEE/CVF International Conference on Computer Vision Workshops. Seoul:IEEE, 2019:213-226.
[14]
张利丰, 田莹.改进YOLOv8的多尺度轻量型车辆目标检测算法[J].计算机工程与应用, 2024, 60(3):129-137.
[15]
TANG W, SUN J, WANG G. Horizontal feature pyramid network for object detection in UAV images[C]//2021 China Automation Congress. Beijing:IEEE, 2021:7746-775.
[16]
CHENG Q Q, WANG H J, AING X C, et al. A UAV target detection algorithm based on YOLOv4-Tiny and improved WBF[C]//The 14th International Conference on Wireless Communications and Signal Processing. Nanjing: IEEE, 2022:122-126.
[17]
TAN M X, PANG R M, GUO S L. EffcientDet:scalable and effcient object detection[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle:IEEE, 2020:10778-10787.
[18]
LIU W J, QIANG J, LI X X, et al. UAV image small object detection based on composite backbone network[J]. Mobile Information Systems, 2022(15):1-11.
[19]
WANG G, CHEN Y, AN P. UAV-YOLOv8: a small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios[J]. Sensors, 2023, 23(16):7190-7197.
[20]
TAHIR N U A, LONG Z, ZHANG Z. PVswin-YOLOv8s:UAV-based pedestrian and vehicle detection for traffic management in smart cities using improved YOLOv8[J]. Drones, 2024, 8(3):84-89.
[21]
孙佳宇, 徐民俊, 张俊鹏.优化改进YOLOv8无人机视角下目标检测算法[J].计算机工程与应用, 2024, 23(8):1-13.
[22]
李佳霖, 李卫东, 王廉钧, .基于残差网络高分辨距离像的无人机识别[J].现代雷达, 2024, 46(7):30-36.
2025年第65卷第11期
PDF下载
130
63
引用本文
BibTeX
文章信息
doi: 10.20079/j.issn.1001-893x.240527001
  • 接收时间:2024-05-27
  • 首发时间:2026-04-15
  • 出版时间:2025-11-28
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2024-05-27
  • 修回日期:2024-08-19
基金
作者信息
    江苏理工学院 电气信息工程学院,江苏 常州 213000

通讯作者:

薛波 Email:
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/dxjs/CN/10.20079/j.issn.1001-893x.240527001
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏