Article(id=1241060182034272948, tenantId=1146029695717560320, journalId=1238841944844054536, issueId=1241060178263601474, articleNumber=null, orderNo=null, doi=10.12347/j.ycyk.20231114001, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1699891200000, receivedDateStr=2023-11-14, revisedDate=1703779200000, revisedDateStr=2023-12-29, acceptedDate=null, acceptedDateStr=null, onlineDate=1773821405179, onlineDateStr=2026-03-18, pubDate=null, pubDateStr=null, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1773821405179, onlineIssueDateStr=2026-03-18, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1773821405179, creator=13701087609, updateTime=1773821405179, updator=13701087609, issue=Issue{id=1241060178263601474, tenantId=1146029695717560320, journalId=1238841944844054536, year='2024', volume='45', issue='2', pageStart='1', pageEnd='123', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=null, createTime=1773821404281, creator=13701087609, updateTime=1773821891324, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1241062221128724613, tenantId=1146029695717560320, journalId=1238841944844054536, issueId=1241060178263601474, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1241062221128724614, tenantId=1146029695717560320, journalId=1238841944844054536, issueId=1241060178263601474, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=1, endPage=9, ext={EN=ArticleExt(id=1241060182399177399, articleId=1241060182034272948, tenantId=1146029695717560320, journalId=1238841944844054536, language=EN, title=Nearshore Ship Object Detection Method Based on Appearance Fine-grained Discrimination Network, columnId=1241060179823882575, journalTitle=Journal of Telemetry, Tracking and Command, columnName=Artificial Intelligence Technology, runingTitle=null, highlight=null, articleAbstract=

Offshore ship object detection is a very challenging task and has received widespread attention from scholars and experts. Detectors based on Convolutional Neural Networks (CNN) and attention mechanisms have made significant progress in offshore ship object detection. However, the problem of false detection in the detection process is caused by the apparent similarity and background interference of ship targets. In order to solve this problem, this paper proposes a detection head module for fine-grained appearance discrimination implemented with Faster RCNN. This module includes a category fine-grained branch and an efficient full-dimensional dynamic convolution localization branch. The category fine-grained branch mines and utilizes category fine-grained identification features through global feature modeling and flexible perception range. The efficient omni-dimensional dynamic convolution positioning branch distinguishes objects and backgrounds through the efficient and flexible perception of ship boundary information, thereby reducing false and missed detections. Through experimental verification on the offshore ship public dataset Seaships7000, the proposed algorithm reduces false detections and missed detections and improves detector performance.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Lingtong MIN, Ziman FAN, Feiyang DOU, Qinyi LYU, Xin LI), CN=ArticleExt(id=1241060187570754425, articleId=1241060182034272948, tenantId=1146029695717560320, journalId=1238841944844054536, language=CN, title=基于表观细粒度辨别网络的近海船舶目标检测方法, columnId=1241060179987460434, journalTitle=遥测遥控, columnName=人工智能技术, runingTitle=null, highlight=null, articleAbstract=

近海船舶目标检测是一项非常具有挑战性的任务,受到学者专家广泛关注。基于卷积神经网络(CNN)和注意力机制的检测器在近海船舶目标检测方面的应用取得了显著成就。然而,船舶目标检测存在着表观相似和背景干扰导致检测过程中出现误检的问题。为此,本文提出了一种用于Faster RCNN(更快的基于区域的卷积神经网络)的表观细粒度辨别的检测头模块。该模块包括类别细粒度分支和高效全维动态卷积定位分支。其中类别细粒度分支通过全局特征建模和灵活的感知范围来挖掘和利用类别细粒度辨别特征,高效全维动态卷积定位分支通过高效灵活的感知船舶边界信息来区分目标与背景,从而减少误检漏检问题。通过在近海船舶公开数据集Seaships7000 上进行实验验证,本文算法减少了误检漏检,提升了检测器性能。

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=QNCvnibUbKp5dNcXLkOJKw==, magXml=XvmJr3yz7euUiGkK4pNDjA==, pdfUrl=null, pdf=V37KAEh9VEj/VqjqF050fw==, pdfFileSize=36550205, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=tXblVMbm5YsVT5Q96a2a+Q==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=+Q6VS1qKuQV3a1DIrAmzlw==, mapNumber=null, authorCompany=null, fund=null, authors=

闵令通 1987年生,副教授,硕士生导师。

范子满 2000年生,硕士研究生。

窦飞阳 2001年生,硕士研究生。

吕勤毅 1990年生,副教授,硕士生导师。

李鑫 1984年生,副教授,硕士生导师。

, authorsList=闵令通, 范子满, 窦飞阳, 吕勤毅, 李鑫)}, authors=[Author(id=1241060187897910160, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241060187985990553, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060187897910160, language=EN, stringName=Lingtong MIN, firstName=Lingtong, middleName=null, lastName=MIN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241060188090848157, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060187897910160, language=CN, stringName=闵令通, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=西北工业大学电子信息学院 西安 710072, bio={"content":"

闵令通 1987年生,副教授,硕士生导师。

"}, bioImg=null, bioContent=

闵令通 1987年生,副教授,硕士生导师。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241060187776275336, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, xref=null, ext=[AuthorCompanyExt(id=1241060187784663945, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China), AuthorCompanyExt(id=1241060187788858250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=西北工业大学电子信息学院 西安 710072)])]), Author(id=1241060188187317158, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241060188296369072, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188187317158, language=EN, stringName=Ziman FAN, firstName=Ziman, middleName=null, lastName=FAN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241060188409615285, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188187317158, language=CN, stringName=范子满, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=西北工业大学电子信息学院 西安 710072, bio={"content":"

范子满 2000年生,硕士研究生。

"}, bioImg=null, bioContent=

范子满 2000年生,硕士研究生。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241060187776275336, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, xref=null, ext=[AuthorCompanyExt(id=1241060187784663945, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China), AuthorCompanyExt(id=1241060187788858250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=西北工业大学电子信息学院 西安 710072)])]), Author(id=1241060188493501372, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241060188581581764, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188493501372, language=EN, stringName=Feiyang DOU, firstName=Feiyang, middleName=null, lastName=DOU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241060188661273545, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188493501372, language=CN, stringName=窦飞阳, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=西北工业大学电子信息学院 西安 710072, bio={"content":"

窦飞阳 2001年生,硕士研究生。

"}, bioImg=null, bioContent=

窦飞阳 2001年生,硕士研究生。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241060187776275336, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, xref=null, ext=[AuthorCompanyExt(id=1241060187784663945, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China), AuthorCompanyExt(id=1241060187788858250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=西北工业大学电子信息学院 西安 710072)])]), Author(id=1241060188745159630, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241060188829045722, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188745159630, language=EN, stringName=Qinyi LYU, firstName=Qinyi, middleName=null, lastName=LYU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241060188908737503, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188745159630, language=CN, stringName=吕勤毅, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=西北工业大学电子信息学院 西安 710072, bio={"content":"

吕勤毅 1990年生,副教授,硕士生导师。

"}, bioImg=null, bioContent=

吕勤毅 1990年生,副教授,硕士生导师。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241060187776275336, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, xref=null, ext=[AuthorCompanyExt(id=1241060187784663945, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China), AuthorCompanyExt(id=1241060187788858250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=西北工业大学电子信息学院 西安 710072)])]), Author(id=1241060188996817893, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241060190506767344, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188996817893, language=EN, stringName=Xin LI, firstName=Xin, middleName=null, lastName=LI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241060190649373684, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, authorId=1241060188996817893, language=CN, stringName=李鑫, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=西北工业大学电子信息学院 西安 710072, bio={"content":"

李鑫 1984年生,副教授,硕士生导师。

"}, bioImg=null, bioContent=

李鑫 1984年生,副教授,硕士生导师。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241060187776275336, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, xref=null, ext=[AuthorCompanyExt(id=1241060187784663945, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China), AuthorCompanyExt(id=1241060187788858250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=西北工业大学电子信息学院 西安 710072)])])], keywords=[Keyword(id=1241060190737454072, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, orderNo=1, keyword=Ship object detection), Keyword(id=1241060190808757245, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, orderNo=2, keyword=Similar feature extraction), Keyword(id=1241060190871670787, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, orderNo=3, keyword=Apparent discrimination), Keyword(id=1241060190942973962, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, orderNo=4, keyword=Dynamic convolution), Keyword(id=1241060191031054349, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, orderNo=5, keyword=Self-attention), Keyword(id=1241060191140106258, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, orderNo=1, keyword=船舶目标检测), Keyword(id=1241060191240769560, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, orderNo=2, keyword=类别细粒度), Keyword(id=1241060191320461341, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, orderNo=3, keyword=表观判别), Keyword(id=1241060191446290470, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, orderNo=4, keyword=全维动态卷积), Keyword(id=1241060191530176558, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, orderNo=5, keyword=自注意力)], refs=[Reference(id=1241060195397324966, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2022, volume=27, issue=7, pageStart=2078, pageEnd=2093, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=叶晨, 逯天洋, 肖潏灏, journalName=中国图象图形学报, refType=null, unstructuredReference=叶晨, 逯天洋, 肖潏灏, 等. 海事监控视频舰船目标检测研究现状与展望[J]. 中国图象图形学报, 2022, 27(7):2078-2093., articleTitle=海事监控视频舰船目标检测研究现状与展望, refAbstract=null), Reference(id=1241060195485405353, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2022, volume=27, issue=7, pageStart=2078, pageEnd=2093, url=null, language=null, rfNumber=[1], rfOrder=1, authorNames=YE C, LU T Y, XIAO Y H, journalName=Journal of Image and Graphics, refType=null, unstructuredReference=YE C, LU T Y, XIAO Y H, et al. Maritime surveillance videos based ships detection algorithms: a survey[J]. Journal of Image and Graphics, 2022, 27(7): 2078-2093., articleTitle=Maritime surveillance videos based ships detection algorithms: a survey, refAbstract=null), Reference(id=1241060195581874349, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2021, volume=39, issue=4, pageStart=421, pageEnd=429, url=null, language=null, rfNumber=[2], rfOrder=2, authorNames=文莉莉, 孙苗, 邬满, journalName=吉林大学学报 (信息科学版), refType=null, unstructuredReference=文莉莉, 孙苗, 邬满. 基于Faster R-CNN的海域监管预警方法[J]. 吉林大学学报 (信息科学版), 2021, 39(4):421-429., articleTitle=基于Faster R-CNN的海域监管预警方法, refAbstract=null), Reference(id=1241060195665760433, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2021, volume=39, issue=4, pageStart=421, pageEnd=429, url=null, language=null, rfNumber=[2], rfOrder=3, authorNames=WEN L, SUN M, WU M, journalName=Journal of Jilin University (Information Science Edition), refType=null, unstructuredReference=WEN L, SUN M, WU M. Early warning method of sea area supervision based on faster R-CNN[J]. Journal of Jilin University (Information Science Edition), 2021, 39(4):421-429., articleTitle=Early warning method of sea area supervision based on faster R-CNN, refAbstract=null), Reference(id=1241060195737063603, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2023, volume=10, issue=4, pageStart=131, pageEnd=137, url=null, language=null, rfNumber=[3], rfOrder=4, authorNames=王小虎, journalName=南方能源建设, refType=null, unstructuredReference=王小虎. 海上船舶融合识别算法研究与改进[J]. 南方能源建设, 2023, 10(4): 131-137., articleTitle=海上船舶融合识别算法研究与改进, refAbstract=null), Reference(id=1241060195816755382, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2023, volume=10, issue=4, pageStart=131, pageEnd=137, url=null, language=null, rfNumber=[3], rfOrder=5, authorNames=WANG X, journalName=Southern Energy Constrution, refType=null, unstructuredReference=WANG X. Research and improvement of offshore ship fusion recognition algorithm[J]. Southern Energy Constrution, 2023, 10(4): 131-137., articleTitle=Research and improvement of offshore ship fusion recognition algorithm, refAbstract=null), Reference(id=1241060195913224379, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2022, volume=38, issue=10, pageStart=67, pageEnd=78, url=null, language=null, rfNumber=[4], rfOrder=6, authorNames=周薇娜, 刘露, journalName=电信科学, refType=null, unstructuredReference=周薇娜, 刘露. 复杂场景下多尺度船舶实时检测方法[J].电信科学, 2022, 38(10): 67-78., articleTitle=复杂场景下多尺度船舶实时检测方法, refAbstract=null), Reference(id=1241060195992916159, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2022, volume=38, issue=10, pageStart=67, pageEnd=78, url=null, language=null, rfNumber=[4], rfOrder=7, authorNames=ZHOU W N, LIU L, journalName=Telecommunications Science, refType=null, unstructuredReference=ZHOU W N, LIU L. A real-time detection method for multi-scale ships in complex scenes[J]. Telecommunications Science, 2022, 38(10): 67-78., articleTitle=A real-time detection method for multi-scale ships in complex scenes, refAbstract=null), Reference(id=1241060196072607939, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2023, volume=15, issue=16, pageStart=3970, pageEnd=null, url=null, language=null, rfNumber=[5], rfOrder=8, authorNames=MIN L, FAN Z, LV Q, journalName=Remote Sensing, refType=null, unstructuredReference=MIN L, FAN Z, LV Q, et al. YOLO-DCTI: Small object detection in remote sensing base on contextual transformer enhancement[J]. Remote Sensing, 2023, 15(16):3970., articleTitle=YOLO-DCTI: Small object detection in remote sensing base on contextual transformer enhancement, refAbstract=null), Reference(id=1241060196173271237, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=43, issue=4, pageStart=1452, pageEnd=1459, url=null, language=null, rfNumber=[6], rfOrder=9, authorNames=XU Y, FU M, WANG Q, journalName=IEEE Transactions on Pattern Analysis and Machine Intel-ligence, refType=null, unstructuredReference=XU Y, FU M, WANG Q, et al. Gliding vertex on the horizontal bounding box for multi-oriented object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intel-ligence, 2020, 43(4): 1452-1459., articleTitle=Gliding vertex on the horizontal bounding box for multi-oriented object detection, refAbstract=null), Reference(id=1241060196265545926, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2022, volume=45, issue=4, pageStart=735, pageEnd=747, url=null, language=null, rfNumber=[7], rfOrder=10, authorNames=谢星星, 程塨, 姚艳清, journalName=计算机学报, refType=null, unstructuredReference=谢星星, 程塨, 姚艳清, 等. 动态特征融合的遥感图像目标检测[J]. 计算机学报, 2022, 45(4): 735-747., articleTitle=动态特征融合的遥感图像目标检测, refAbstract=null), Reference(id=1241060196332654794, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[8], rfOrder=11, authorNames=REDMON J, FARHADI A, journalName=null, refType=null, unstructuredReference=REDMON J, FARHADI A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv: 2018:, articleTitle=Yolov3: An incremental improvement, refAbstract=null), Reference(id=1241060196424929485, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=9627, pageEnd=9636, url=null, language=null, rfNumber=[9], rfOrder=12, authorNames=TIAN Z, SHEN C, CHEN H, journalName=null, refType=null, unstructuredReference=TIAN Z, SHEN C, CHEN H, et al. Fcos: Fully convolutional one-stage object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 9627-9636., articleTitle=Fcos: Fully convolutional one-stage object detection, refAbstract=null), Reference(id=1241060196525592782, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=7464, pageEnd=7475, url=null, language=null, rfNumber=[10], rfOrder=13, authorNames=WANG C Y, BOCHKOVSKIY A, LIAO H Y M, journalName=null, refType=null, unstructuredReference=WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023: 7464-7475., articleTitle=YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, refAbstract=null), Reference(id=1241060196605284561, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=21, pageEnd=37, url=null, language=null, rfNumber=[11], rfOrder=14, authorNames=LIU W, ANGUELOV D, ERHAN D, journalName=null, refType=null, unstructuredReference=LIU W, ANGUELOV D, ERHAN D, et al. Ssd: Single shot multibox detector[C]// Computer Vision-ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37., articleTitle=Ssd: Single shot multibox detector, refAbstract=null), Reference(id=1241060196689170645, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2017, volume=39, issue=6, pageStart=1137, pageEnd=1149, url=null, language=null, rfNumber=[12], rfOrder=15, authorNames=REN S, HE K, GIRSHICK R, journalName=IEEE Transactions on Pattern Analysis &Machine Intelligence, refType=null, unstructuredReference=REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis &Machine Intelligence, 2017, 39(6): 1137-1149., articleTitle=Faster R-CNN: Towards real-time object detection with region proposal networks, refAbstract=null), Reference(id=1241060196773056728, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=7148, pageEnd=7153, url=null, language=null, rfNumber=[13], rfOrder=16, authorNames=WEI S, CHEN H, ZHU X, journalName=null, refType=null, unstructuredReference=WEI S, CHEN H, ZHU X, et al. Ship detection in remote sensing image based on faster r-cnn with dilated convolution[C]//2020 39th Chinese Control Conference (CCC). IEEE, 2020: 7148-7153., articleTitle=Ship detection in remote sensing image based on faster r-cnn with dilated convolution, refAbstract=null), Reference(id=1241060196869525722, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=58, issue=8, pageStart=5772, pageEnd=5781, url=null, language=null, rfNumber=[14], rfOrder=17, authorNames=GUO H, YANG X, WANG N, journalName=IEEE Transactions on Geoscience and Remote Sensing, refType=null, unstructuredReference=GUO H, YANG X, WANG N, et al. A rotational libra R-CNN method for ship detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(8): 5772-5781., articleTitle=A rotational libra R-CNN method for ship detection, refAbstract=null), Reference(id=1241060197003743456, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2021, volume=13, issue=16, pageStart=3192, pageEnd=null, url=null, language=null, rfNumber=[15], rfOrder=18, authorNames=DONG Y, CHEN F, HAN S, journalName=Remote Sensing, refType=null, unstructuredReference=DONG Y, CHEN F, HAN S, et al. Ship object detection of remote sensing image based on visual attention[J]. Remote Sensing, 2021, 13(16): 3192., articleTitle=Ship object detection of remote sensing image based on visual attention, refAbstract=null), Reference(id=1241060197091823843, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2019, volume=11, issue=13, pageStart=1529, pageEnd=null, url=null, language=null, rfNumber=[16], rfOrder=19, authorNames=DONG C, LIU J, XU F, journalName=Remote Sensing, refType=null, unstructuredReference=DONG C, LIU J, XU F, et al. Ship detection from optical remote sensing images using multi-scale analysis and Fourier HOG descriptor[J]. Remote Sensing, 2019, 11(13): 1529., articleTitle=Ship detection from optical remote sensing images using multi-scale analysis and Fourier HOG descriptor, refAbstract=null), Reference(id=1241060197163127011, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[17], rfOrder=20, authorNames=DOSOVITSKIY A, BEYER L, KOLESNIKOV A, journalName=null, refType=null, unstructuredReference=DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale[C]//International Conference on Learning Representations, 2020., articleTitle=An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, refAbstract=null), Reference(id=1241060197221847269, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=10012, pageEnd=10022, url=null, language=null, rfNumber=[18], rfOrder=21, authorNames=LIU Z, LIN Y, CAO Y, journalName=null, refType=null, unstructuredReference=LIU Z, LIN Y, CAO Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 10012-10022., articleTitle=Swin transformer: Hierarchical vision transformer using shifted windows, refAbstract=null), Reference(id=1241060197284761832, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=6000, pageEnd=6010, url=null, language=null, rfNumber=[19], rfOrder=22, authorNames=VASWANI A, SHAZEER N, PARMAR N, journalName=null, refType=null, unstructuredReference=VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017: 6000-6010., articleTitle=Attention is all you need, refAbstract=null), Reference(id=1241060197377036523, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=1307, pageEnd=1318, url=null, language=null, rfNumber=[20], rfOrder=23, authorNames=YANG B, BENDER G, LE Q V, journalName=null, refType=null, unstructuredReference=YANG B, BENDER G, LE Q V, et al. CondConv: conditionally parameterized convolutions for efficient inference[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems, 2019:1307-1318., articleTitle=CondConv: conditionally parameterized convolutions for efficient inference, refAbstract=null), Reference(id=1241060197465116909, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=11030, pageEnd=11039, url=null, language=null, rfNumber=[21], rfOrder=24, authorNames=CHEN Y, DAI X, LIU M, journalName=null, refType=null, unstructuredReference=CHEN Y, DAI X, LIU M, et al. Dynamic convolution:Attention over convolution kernels[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 11030-11039., articleTitle=Dynamic convolution:Attention over convolution kernels, refAbstract=null), Reference(id=1241060197578363118, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[22], rfOrder=25, authorNames=LI C, ZHOU A, YAO A, journalName=null, refType=null, unstructuredReference=LI C, ZHOU A, YAO A. Omni-dimensional dynamic convolution[C]//International Conference on Learning Representations. 2021., articleTitle=Omni-dimensional dynamic convolution, refAbstract=null), Reference(id=1241060197674832112, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=213, pageEnd=229, url=null, language=null, rfNumber=[23], rfOrder=26, authorNames=CARION N, MASSA F, SYNNAEVE G, journalName=null, refType=null, unstructuredReference=CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transformers[C]//European Conference on Computer Vision. Springer International Publishing, 2020: 213-229., articleTitle=End-to-end object detection with transformers, refAbstract=null), Reference(id=1241060197767106803, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2023, volume=61, issue=null, pageStart=1, pageEnd=10, url=null, language=null, rfNumber=[24], rfOrder=27, authorNames=CHENG G, LI Q, WANG G, journalName=IEEE Transactions on Geoscience and Remote Sensing, refType=null, unstructuredReference=CHENG G, LI Q, WANG G, et al. SFRNet: Fine-grained oriented object recognition via separate feature refinement[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 1-10., articleTitle=SFRNet: Fine-grained oriented object recognition via separate feature refinement, refAbstract=null), Reference(id=1241060197880353013, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=679, pageEnd=683, url=null, language=null, rfNumber=[25], rfOrder=28, authorNames=WEI Q, HU X, HOU Q, journalName=null, refType=null, unstructuredReference=WEI Q, HU X, HOU Q. Dynamic-YOLOv5: An imp-roved aerial small object detector based on YOLOv5[C]//2023 3rd International Conference on Neural Networks, Information and Communication Engineering (NNICE), IEEE, 2023: 679-683., articleTitle=Dynamic-YOLOv5: An imp-roved aerial small object detector based on YOLOv5, refAbstract=null), Reference(id=1241060197951656183, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=770, pageEnd=778, url=null, language=null, rfNumber=[26], rfOrder=29, authorNames=HE K, ZHANG X, REN S, journalName=null, refType=null, unstructuredReference=HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778., articleTitle=Deep residual learning for image recognition, refAbstract=null), Reference(id=1241060199008620795, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=2117, pageEnd=2125, url=null, language=null, rfNumber=[27], rfOrder=30, authorNames=LIN T Y, DOLLÁR P, GIRSHICK R, journalName=null, refType=null, unstructuredReference=LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern recognition, 2017: 2117-2125., articleTitle=Feature pyramid networks for object detection, refAbstract=null), Reference(id=1241060199084118270, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2018, volume=20, issue=10, pageStart=2593, pageEnd=2604, url=null, language=null, rfNumber=[28], rfOrder=31, authorNames=SHAO Z, WU W, WANG Z, journalName=IEEE Transactions on Multimedia, refType=null, unstructuredReference=SHAO Z, WU W, WANG Z, et al. Seaships: A large-scale precisely annotated dataset for ship detection[J]. IEEE Transactions on Multimedia, 2018, 20(10): 2593-2604., articleTitle=Seaships: A large-scale precisely annotated dataset for ship detection, refAbstract=null), Reference(id=1241060199163810048, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=2961, pageEnd=2969, url=null, language=null, rfNumber=[29], rfOrder=32, authorNames=HE K, GKIOXARI G, DOLLÁR P, journalName=null, refType=null, unstructuredReference=HE K, GKIOXARI G, DOLLÁR P, et al. Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 2961-2969., articleTitle=Mask r-cnn, refAbstract=null), Reference(id=1241060199285444866, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[30], rfOrder=33, authorNames=BOCHKOVSKIY A, WANG C Y, LIAO H Y M, journalName=null, refType=null, unstructuredReference=BOCHKOVSKIY A, WANG C Y, LIAO H Y M. Yolov4: Optimal speed and accuracy of object detection[J].arXiv preprint arXiv: 2020:, articleTitle=Yolov4: Optimal speed and accuracy of object detection, refAbstract=null), Reference(id=1241060199365136646, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=11563, pageEnd=11572, url=null, language=null, rfNumber=[31], rfOrder=34, authorNames=SONG G, LIU Y, WANG X, journalName=null, refType=null, unstructuredReference=SONG G, LIU Y, WANG X. Revisiting the sibling head in object detector[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 11563-11572., articleTitle=Revisiting the sibling head in object detector, refAbstract=null), Reference(id=1241060199453217033, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=821, pageEnd=830, url=null, language=null, rfNumber=[32], rfOrder=35, authorNames=PANG J, CHEN K, SHI J, journalName=null, refType=null, unstructuredReference=PANG J, CHEN K, SHI J, et al. Libra r-cnn: Towards balanced learning for object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 821-830., articleTitle=Libra r-cnn: Towards balanced learning for object detection, refAbstract=null), Reference(id=1241060199528714507, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[33], rfOrder=36, authorNames=CHEN K, WANG J, PANG J, journalName=null, refType=null, unstructuredReference=CHEN K, WANG J, PANG J, et al. MMDetection: Open mmlab detection toolbox and benchmark[J]. arXiv preprint arXiv: 2019:, articleTitle=MMDetection: Open mmlab detection toolbox and benchmark, refAbstract=null), Reference(id=1241060199600017678, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, doi=null, pmid=null, pmcid=null, year=2014, volume=null, issue=null, pageStart=740, pageEnd=755, url=null, language=null, rfNumber=[34], rfOrder=37, authorNames=LIN T Y, MAIRE M, BELONGIE S, journalName=null, refType=null, unstructuredReference=LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft coco: Common objects in context[C]//Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer International Publishing, 2014: 740-755., articleTitle=Microsoft coco: Common objects in context, refAbstract=null)], funds=[Fund(id=1241060195225358496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, awardId=62206221, language=CN, fundingSource=国家自然科学基金项目(62206221), fundOrder=null, country=null), Fund(id=1241060195296661666, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, awardId=2021JM-074, language=CN, fundingSource=陕西省自然科学基础研究计划资助项目(2021JM-074), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1241060187776275336, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, xref=null, ext=[AuthorCompanyExt(id=1241060187784663945, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China), AuthorCompanyExt(id=1241060187788858250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, companyId=1241060187776275336, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=西北工业大学电子信息学院 西安 710072)])], figs=[ArticleFig(id=1241060191664394295, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.1, caption=The ground truth labels (in red) and Faster RCNN detection results (in yellow), figureFileSmall=knoSvDjHTSGjk7tCPJ7SaA==, figureFileBig=tXblVMbm5YsVT5Q96a2a+Q==, tableContent=null), ArticleFig(id=1241060191777640507, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图1, caption=真实标签(红色)和Faster RCNN检测结果(黄色), figureFileSmall=knoSvDjHTSGjk7tCPJ7SaA==, figureFileBig=tXblVMbm5YsVT5Q96a2a+Q==, tableContent=null), ArticleFig(id=1241060191987355722, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.2, caption=The overview detection framework of FDNet, figureFileSmall=wDQzjl0vQdNnZaI+6N/ytQ==, figureFileBig=zFqJiaVRTwrgtoH0dcsT+w==, tableContent=null), ArticleFig(id=1241060192071241809, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图2, caption=FDNet的总体检测框架, figureFileSmall=wDQzjl0vQdNnZaI+6N/ytQ==, figureFileBig=zFqJiaVRTwrgtoH0dcsT+w==, tableContent=null), ArticleFig(id=1241060192180293716, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.3, caption=Omni-dimensional Dynamic Convolution (ODConv), figureFileSmall=oAWtOcN8d9km5zXK6l3Dlw==, figureFileBig=sBWYLHboCOyzzceE8p8Ugg==, tableContent=null), ArticleFig(id=1241060192306122840, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图3, caption=全维动态卷积, figureFileSmall=oAWtOcN8d9km5zXK6l3Dlw==, figureFileBig=sBWYLHboCOyzzceE8p8Ugg==, tableContent=null), ArticleFig(id=1241060192427757661, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.4, caption=Transformer and Self-attention mechanism, figureFileSmall=pSw7mpgf96ESNCzZW6mUeA==, figureFileBig=DWcMuaUZowINKQIVau25Ag==, tableContent=null), ArticleFig(id=1241060192511643745, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图4, caption=Transformer和自我注意力机制, figureFileSmall=pSw7mpgf96ESNCzZW6mUeA==, figureFileBig=DWcMuaUZowINKQIVau25Ag==, tableContent=null), ArticleFig(id=1241060192633278567, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.5, caption=Illustration of the Seaship7000 dataset, figureFileSmall=rzluwM5qEgJ9bH/vpjQijQ==, figureFileBig=oqkU/Bk1smXKGLVmm64LPw==, tableContent=null), ArticleFig(id=1241060192712970347, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图5, caption=Seaship7000数据集示意图, figureFileSmall=rzluwM5qEgJ9bH/vpjQijQ==, figureFileBig=oqkU/Bk1smXKGLVmm64LPw==, tableContent=null), ArticleFig(id=1241060192822022259, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.6, caption=Comparison of detection results on the Seaship7000 dataset, with red indicating true labels, green representing our FDNet model, and yellow denoting Faster-RCNN, figureFileSmall=q8FxttGPh+SowuClGzkvoQ==, figureFileBig=Xd6AwkNgDgISrRXabbohKA==, tableContent=null), ArticleFig(id=1241060192914296951, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图6, caption=Seaship7000数据集检测结果对比,红色为真实标签,绿色为本文的模型FDNet,黄色为Faster-RCNN, figureFileSmall=q8FxttGPh+SowuClGzkvoQ==, figureFileBig=Xd6AwkNgDgISrRXabbohKA==, tableContent=null), ArticleFig(id=1241060192998183034, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Fig.7, caption=Comparison results of real image detection, figureFileSmall=L/mrlmIpCJKSE+S/LRemaA==, figureFileBig=JKM/RilDpXBqPQ83aQeXdg==, tableContent=null), ArticleFig(id=1241060193103040641, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=图7, caption=真实图像检测对比结果, figureFileSmall=L/mrlmIpCJKSE+S/LRemaA==, figureFileBig=JKM/RilDpXBqPQ83aQeXdg==, tableContent=null), ArticleFig(id=1241060193203703943, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Table 1, caption=

Comparison of different models on the Seaship7000 dataset

, figureFileSmall=null, figureFileBig=null, tableContent=
ModelsmAPAP50AP75APSAPMAPLFPSParametersFLOPs
YOLOv358.893.667.34.234.960.349.43.71.7
FCOS57.590.665.35.137.558.426.132.1202.8
YOLOv758.493.364.817.040.259.483.336.5103.2
SSD64.894.975.23.140.866.461.224.430.7
Libra-RCNN59.292.568.15.135.460.631.041.2226.3
Faster-RCNN64.495.375.711.245.465.617.541.6210.9
FDNet(Ours)67.495.980.815.049.068.314.265.1497.7
), ArticleFig(id=1241060193346310284, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=表1, caption=

不同模型在Seaship7000上的对比

, figureFileSmall=null, figureFileBig=null, tableContent=
ModelsmAPAP50AP75APSAPMAPLFPSParametersFLOPs
YOLOv358.893.667.34.234.960.349.43.71.7
FCOS57.590.665.35.137.558.426.132.1202.8
YOLOv758.493.364.817.040.259.483.336.5103.2
SSD64.894.975.23.140.866.461.224.430.7
Libra-RCNN59.292.568.15.135.460.631.041.2226.3
Faster-RCNN64.495.375.711.245.465.617.541.6210.9
FDNet(Ours)67.495.980.815.049.068.314.265.1497.7
), ArticleFig(id=1241060193455362191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Table 2, caption=

Ablation experiment

, figureFileSmall=null, figureFileBig=null, tableContent=
ModelsBackbone分类分支(CFGB)定位分支(LBDC)mAPAP50AP75APSAPMAPL
BaselineResNet5064.495.375.711.245.465.6
FDNet(Ours)ResNet5067.696.379.712.349.868.6
ResNet5067.395.880.114.050.068.3
ResNet5067.895.880.624.247.868.8
), ArticleFig(id=1241060194986283155, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=表2, caption=

消融实验

, figureFileSmall=null, figureFileBig=null, tableContent=
ModelsBackbone分类分支(CFGB)定位分支(LBDC)mAPAP50AP75APSAPMAPL
BaselineResNet5064.495.375.711.245.465.6
FDNet(Ours)ResNet5067.696.379.712.349.868.6
ResNet5067.395.880.114.050.068.3
ResNet5067.895.880.624.247.868.8
), ArticleFig(id=1241060195074363546, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=EN, label=Table 3, caption=

mAP for each category

, figureFileSmall=null, figureFileBig=null, tableContent=
ModelsBackbone散货
集装
箱船
渔船杂货
矿砂
客船
BaselineResNet5063.475.760.367.260.059.9
+ LBDCResNet5067.877.764.068.662.964.4
+ CFGBResNet5066.977.064.468.763.063.9
OursResNet5067.578.565.568.463.463.6
), ArticleFig(id=1241060195137278109, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241060182034272948, language=CN, label=表3, caption=

各个类别mAP

, figureFileSmall=null, figureFileBig=null, tableContent=
ModelsBackbone散货
集装
箱船
渔船杂货
矿砂
客船
BaselineResNet5063.475.760.367.260.059.9
+ LBDCResNet5067.877.764.068.662.964.4
+ CFGBResNet5066.977.064.468.763.063.9
OursResNet5067.578.565.568.463.463.6
)], attaches=null, journal=Journal(id=1238822691516493831, delFlag=0, nameCn=遥测遥控, nameEn=Journal of Telemetry, Tracking and Command, nameHistory1=null, nameHistory2=null, issn=2095-1000, eissn=null, cn=11-1780/TP, coden=null, periodic=1, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=uUqWBk5pV4VWPULNoAHkuQ==, journalPrice=null, startedYear=null, abbrevIsoEn=Journal of Telemetry, Tracking and Command, journalRemark=null, publicationField=null, createdTime=1773287945871, updatedTime=1773299995823, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=J, firstLetterEn=J, subjectCode=Engineering, subjectName=null, subjectCodeEn=Engineering, subjectNameEn=null, picCn=uUqWBk5pV4VWPULNoAHkuQ==, picEn=RDDslHHvDBDNnJMjk6BxdA==, jcr=null, cjcr=null, exts=[JournalExt(id=1238873232732909900, language=CN, name=遥测遥控, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1773299995835, updatedTime=1773299995835, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://manuscript.spacejournal.cn/ycyk, submissionEditorUrl=https://manuscript.spacejournal.cn/ycyk, submissionReviewUrl=https://manuscript.spacejournal.cn/ycyk, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1238873232779047245, language=EN, name=Journal of Telemetry, Tracking and Command, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1773299995846, updatedTime=1773299995846, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://manuscript.spacejournal.cn/ycyk, submissionEditorUrl=https://manuscript.spacejournal.cn/ycyk, submissionReviewUrl=https://manuscript.spacejournal.cn/ycyk, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1238841944844054536, websiteList=[Website(id=1238843830011744952, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1238841944844054536, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/ycyk/CN, language=CN, createTime=1773292985680, createBy=18614031015, updateTime=1773293577303, updateBy=18614031015, name=遥测遥控-中文, tplId=1146099689490845704, title=遥测遥控, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1238846390617568150, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=articleTextType, value=kx, createTime=1773293596176, updateTime=1773293596176, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390596596627, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=banner, value=null, createTime=1773293596171, updateTime=1773293596171, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390638539673, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=grayFlag, value=0, createTime=1773293596181, updateTime=1773293596181, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390588208018, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=logo, value=https://castjournals.cast.org.cn/joweb/ycyk/CN/file/pic?fileId=g9S2GGWx7iZjf51o9vpmmA==, createTime=1773293596169, updateTime=1773293596169, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390651122587, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=minRunFlag, value=0, createTime=1773293596184, updateTime=1773293596184, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390613373845, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/ycyk/CN/file/pic, createTime=1773293596175, updateTime=1773293596175, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390646928282, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=silenceFlag, value=0, createTime=1773293596183, updateTime=1773293596183, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390604985236, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1773293596173, updateTime=1773293596173, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390625956759, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=themeColor, value=null, createTime=1773293596178, updateTime=1773293596178, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390634345368, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=themeStyle, value=null, createTime=1773293596180, updateTime=1773293596180, creator=18614031015, updator=18614031015)]), Website(id=1238843830078853819, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1238841944844054536, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/ycyk/EN, language=EN, createTime=1773292985696, createBy=18614031015, updateTime=1773293566698, updateBy=18614031015, name=遥测遥控-英文, tplId=1146101810881728533, title=Journal of Telemetry, Tracking and Command, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1238846419235304352, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=articleTextType, value=kx, createTime=1773293602999, updateTime=1773293602999, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419214332829, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=banner, value=null, createTime=1773293602994, updateTime=1773293602994, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419256275875, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=grayFlag, value=0, createTime=1773293603004, updateTime=1773293603004, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419205944220, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=logo, value=https://castjournals.cast.org.cn/joweb/ycyk/EN/file/pic?fileId=g9S2GGWx7iZjf51o9vpmmA==, createTime=1773293602992, updateTime=1773293602992, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419268858789, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=minRunFlag, value=0, createTime=1773293603007, updateTime=1773293603007, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419231110047, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/ycyk/EN/file/pic, createTime=1773293602998, updateTime=1773293602998, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419264664484, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=silenceFlag, value=0, createTime=1773293603006, updateTime=1773293603006, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419222721438, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1773293602996, updateTime=1773293602996, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419243692961, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=themeColor, value=null, createTime=1773293603001, updateTime=1773293603001, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419252081570, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=themeStyle, value=null, createTime=1773293603003, updateTime=1773293603003, creator=18614031015, updator=18614031015)])], journalTitle=遥测遥控, weixinUrl=null, journalUrl=https://ycyk.spacejournal.cn/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Journal of Telemetry, Tracking and Command, journalPhotoCn=uUqWBk5pV4VWPULNoAHkuQ==, journalPhotoEn=RDDslHHvDBDNnJMjk6BxdA==, journalFirstLetter=J, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/ycyk/CN/10.12347/j.ycyk.20231114001, detailUrlEn=https://castjournals.cast.org.cn/joweb/ycyk/EN/10.12347/j.ycyk.20231114001, pdfUrlCn=https://castjournals.cast.org.cn/joweb/ycyk/CN/PDF/10.12347/j.ycyk.20231114001, pdfUrlEn=https://castjournals.cast.org.cn/joweb/ycyk/EN/PDF/10.12347/j.ycyk.20231114001, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
基于表观细粒度辨别网络的近海船舶目标检测方法
收藏切换
PDF下载
闵令通 , 范子满 , 窦飞阳 , 吕勤毅 , 李鑫
遥测遥控 | 人工智能技术 2024,45(2): 1-9
收起
收藏切换
遥测遥控 | 人工智能技术 2024, 45(2): 1-9
基于表观细粒度辨别网络的近海船舶目标检测方法
全屏
闵令通, 范子满, 窦飞阳, 吕勤毅, 李鑫
作者信息
  • 西北工业大学电子信息学院 西安 710072
  • 闵令通 1987年生,副教授,硕士生导师。

    范子满 2000年生,硕士研究生。

    窦飞阳 2001年生,硕士研究生。

    吕勤毅 1990年生,副教授,硕士生导师。

    李鑫 1984年生,副教授,硕士生导师。

Nearshore Ship Object Detection Method Based on Appearance Fine-grained Discrimination Network
Lingtong MIN, Ziman FAN, Feiyang DOU, Qinyi LYU, Xin LI
Affiliations
  • School of Electronic Information, Northwestern Polytechnical University, Xi'an 710072, China
doi: 10.12347/j.ycyk.20231114001
文章导航
收藏切换

近海船舶目标检测是一项非常具有挑战性的任务,受到学者专家广泛关注。基于卷积神经网络(CNN)和注意力机制的检测器在近海船舶目标检测方面的应用取得了显著成就。然而,船舶目标检测存在着表观相似和背景干扰导致检测过程中出现误检的问题。为此,本文提出了一种用于Faster RCNN(更快的基于区域的卷积神经网络)的表观细粒度辨别的检测头模块。该模块包括类别细粒度分支和高效全维动态卷积定位分支。其中类别细粒度分支通过全局特征建模和灵活的感知范围来挖掘和利用类别细粒度辨别特征,高效全维动态卷积定位分支通过高效灵活的感知船舶边界信息来区分目标与背景,从而减少误检漏检问题。通过在近海船舶公开数据集Seaships7000 上进行实验验证,本文算法减少了误检漏检,提升了检测器性能。

船舶目标检测  /  类别细粒度  /  表观判别  /  全维动态卷积  /  自注意力

Offshore ship object detection is a very challenging task and has received widespread attention from scholars and experts. Detectors based on Convolutional Neural Networks (CNN) and attention mechanisms have made significant progress in offshore ship object detection. However, the problem of false detection in the detection process is caused by the apparent similarity and background interference of ship targets. In order to solve this problem, this paper proposes a detection head module for fine-grained appearance discrimination implemented with Faster RCNN. This module includes a category fine-grained branch and an efficient full-dimensional dynamic convolution localization branch. The category fine-grained branch mines and utilizes category fine-grained identification features through global feature modeling and flexible perception range. The efficient omni-dimensional dynamic convolution positioning branch distinguishes objects and backgrounds through the efficient and flexible perception of ship boundary information, thereby reducing false and missed detections. Through experimental verification on the offshore ship public dataset Seaships7000, the proposed algorithm reduces false detections and missed detections and improves detector performance.

Ship object detection  /  Similar feature extraction  /  Apparent discrimination  /  Dynamic convolution  /  Self-attention
闵令通, 范子满, 窦飞阳, 吕勤毅, 李鑫. 基于表观细粒度辨别网络的近海船舶目标检测方法. 遥测遥控, 2024 , 45 (2) : 1 -9 . DOI: 10.12347/j.ycyk.20231114001
Lingtong MIN, Ziman FAN, Feiyang DOU, Qinyi LYU, Xin LI. Nearshore Ship Object Detection Method Based on Appearance Fine-grained Discrimination Network[J]. Journal of Telemetry, Tracking and Command, 2024 , 45 (2) : 1 -9 . DOI: 10.12347/j.ycyk.20231114001
随着我国在海洋强国建设方面的不断推进,智慧航运和智慧海洋工程领域迅速蓬勃发展。在这一背景下,借助海事图像或视频进行高效的近海舰船目标检测,以确保航运和海洋工程的安全,已经成为一项急切的需求[1]。近海船舶目标检测在海域监控[2]、近海风电平台[3]和海上交通控制[4]等多个领域中均有重要应用。
深度学习的不断进步为目标检测领域开辟了崭新的路径。一系列基于卷积神经网络(Convolutional Neural Network,CNN)的检测器[5-7]持续提高了自然图像的检测准确性和推理速度。例如,Liu等人[8-11]提出了单阶段目标检测网络,极大地提升了推理速度。Ren 等人[12]提出了生成高效候选提议的Faster RCNN双阶段目标检测网络,提升了检测器的准确性。针对近海船舶图像,基于Faster RCNN的双阶段目标检测网络为准确检测奠定了基础。Wei等人[13]通过改进Faster RCNN的卷积层提升了舰船目标的检测精度。Guo等人[14]通过设计旋转不变的候选框提议网络提升了舰船目标的检测性能。Dong等人[15]基于Faster RCNN设计了视觉注意力机制,提升了舰船目标的准确率。Dong等人[16]通过改进候选提议生成网络提升了船舶目标检测性能。这些工作有效地推动了舰船目标检测的发展,但是忽视了舰船目标存在表观相似和背景干扰的问题。如图1所示,普通货船与渔船之间存在外观一致,尺寸相似等特点。Faster RCNN检测结果显示图中的渔船被检测为普通货船,岸边也出现了将背景误检为普通货船的情况。
注意力机制是解决表观相似和背景干扰的近海船舶目标检测的有效方法,可以提升船舶目标的检测准确率。注意力机制中尤其是Transformer[17-19]和动态卷积[20-22]在全局信息建模方面和灵活感知范围方面展现出卓越的性能,专家学者做出了大量研究工作。例如,Carion等人[23]基于Transformer提出了端到端的目标检测器,有效推动了Transformer在目标检测的应用。Cheng等人[24]基于Transformer设计了检测头部,全局特征的注意力机制能够捕获类别信息显著特征,提升检测器的准确性。Wei等人[25]基于动态卷积设计了检测头部,灵活感知的注意力机制能够提升复杂检测环境下的检测器准确性。与上述使用注意力机制的方法不同,本文聚焦于全局特征和灵活感知注意力机制的联合设计。
近海船舶图像存在表观相似和背景干扰导致的误检漏检问题。为了解决上述问题,本文采用联合全局特征和灵活感知注意力机制的方法,重新设计了Faster RCNN的头部网络。该网络包括两部分:类别细粒度分支(Class Fine-Grained Branch,CFGB)和高效全维动态卷积定位分支(Location Branch with Dynamic Convolution,LBDC),能够增强对检测头输入特征对细粒度辨别能力和定位能力。其中,CFGB模块旨在学习到表观相似目标的细粒度差异,增强类别特征的判别能力,降低表观相似导致的误检。LBDC模块旨在灵活捕获目标边界特征信息以区分目标与背景,增强回归框的定位能力,降低背景干扰造成误检的可能。最后,本文采用较为常用的ResNet50[26]和FPN[27]的Faster RCNN构建了细粒度辨别网络(Fine-Grained Discrimination Network,FDNet),并在公开数据集Seaships7000[28]上验证了FDNet网络的有效性。
FDNet网络框架如图2所示,包括骨干网络、特征金字塔网络(Feature Pyramid Network,FPN)、区域建议网络(Region Proposal Network,RPN)、感兴趣区域(Region of Interest,RoI)Align[29](对齐)模块与检测头网络。工作流程如下:首先,输入图像依次经过骨干网络、FPN、RPN和RoI相关的操作,包括提取RoIs和RoI Align后获得的特征向量FєℝC×H×W;其次,检测器头部的输入特征FєℝC×H×W分别送入到高效全维动态卷积定位分支LBDC模块和类别细粒度分支CFGB模块,得到模块输出目标定位框特征向量YO和目标类别特征向量YC。最后YCYO分别经过全连接层(Fully Connected Layer,FCL)得到船舶目标的类别信息和定位框信息。训练损失计算与Faster R-CNN保持一致,其中回归损失采用Smooth L1,类别损失采用交叉熵损失(Cross Entropy Loss,CEL)。接下来分别对LBDC模块和CFGB模块进行介绍。
面对背景干扰下船舶目标的定位任务,近海船舶目标跟与沿岸环境融为一体,使得目标边界难以清晰地被检测器捕捉,如图1所示,导致误检现象的发生。受到CSPDarknet53[30](一种深度卷积网络)的启发,采用了多层残差结构和全维动态卷积,联合了全局特征信息和灵活感知注意力机制,构建了对船舶目标边界信息高效提取的动态卷积定位分支。其中,全维动态卷积能够对目标边界进行灵活建模,多层残差结构有助于特征的重复利用,从而对边界信息多次建模,有助于目标精准定位。
首先,LBDC模块对输入特征F采用1×1卷积进行处理;然后,对处理之后的特征沿通道维度划分,即T1єℝC'×H×WT2єℝC'×H×W。为了灵活感知目标边界信息,采用了基于全维动态卷积的Bottlenck(瓶颈)模块(同时为了减少参数规模,本文只将T2作为Bottlenck模块的输入,得到该模块的输出特征B,而T1保持不变);最后,为了增加对特征信息的多重利用和保持特征维度不变,将B,T1,T2在通道维度进行拼接之后,经过1×1卷积处理得到输出YO,可表示为:
YO=fc1(Cat(B,T1,T2))
其中,fc1(·), Cat(·)分别表示1×1卷积运算和通道维度拼接。
在Bottlenck模块中,如图2所示,采用了两个3×3的全维动态卷积和残差结构。全维动态卷积可以灵活地感知船舶目标边界信息,根据不同目标自适应的学习细微差异信息。如图3(e)所示,全维动态卷积会将作用于输入的卷积核进行四个维度加权求和,可用公式(2)表示:
y=(αw1·αf1·αc1·αs1·W1+···+αwn·αfn·αcn·αsn·Wn)*x
其中,xy分别为全维动态卷积的输入输出,Wi为卷积核权重。四个维度权重对于Wi的作用如图3中(a)、(b)、(c)、(d)所示,αfi表示卷积核的注意力标量,αciαsiαwi分别表示沿着卷积核的空间维度、输入通道维度和输出通道维度的三种注意力机制。αfiαciαsiαwi的计算流程如下:首先全维动态卷积的输入向量x统一经过全局平均池化(Global Ave-rage Pooling,GAP)、全连接层(FCL)和线性激活函数(Rectified Linear Unit,ReLU)得到中间向量,随后该中间向量分别通过四个FC和Sigmoid函数计算得到。
LBDC模块可以提升目标边界信息的判别能力,减少背景干扰导致的误检。另外,Bottleneck中的残差结构的设计有助于深层网络更好地收敛。最后,与固定卷积核不同,全维动态卷积的引入,让网络可以灵活地捕获船舶目标边界信息的细微差异。
面对船舶目标表观相似模块的分类任务,只关注目标的显著区域特征[31]是无法全面捕捉到目标的特定细微差异,导致表观相似的船舶目标出现漏检误检。因此,本文设计了类别细粒度分支(CFGB)模块,如图3所示,采用Transformer[19]、全维动态卷积[22]和残差结构,旨在同时考虑显著特征和目标细微差异,联合全局建模注意力机制和灵活感知注意力机制。具体而言,本文选择使用Transformer作为模块的一部分,以便能够有效地提取图像中的显著特征。同时,为了更细致地关注船舶目标的特定细微差异,本文还引入了全维动态卷积。全维动态卷积在处理目标之间的微小差异时具有优势,通过动态调整卷积核的权重,能够灵活地捕捉目标的局部特征,尤其是在目标表观相似性较高的情况下。首先,对输入特征FєℝC×H×W的空间维度展平得到向量F′єℝC×HW;其次,向量F′经过Transformer,从全局信息建模来捕获显著特征信息的长远依赖关系得到输出向量YTєℝC×HW;最后,为了灵活地感知以提取类别细微特征差异,对向量YT进行空间维度还原操作后再经过公式(2)的全维动态卷积,并与输入特征F经过残差结构,得到类别细粒度分支的输出YC
YC=F+f(YT)
其中,f(·)表示公式(1)中全维动态卷积操作,省略了空间维度还原操作。
在Transformer结构中,经过空间维度展平的输入特征F′通过第一个残差模块,即层归一化操作(LayerNorm)、自注意力操作(Self-Attention)、DropPath得到中间特征єℝC×H×W。其中,Self-Attention操作会将输入沿着通道维度分为N部分,每个部分会被复制为三份,如图4所示,即查询Q、键K、值VQ,K,V通过简单的线性投影,然后经过缩放的点积运算得到特征矩阵,可用公式(4)表示为:
其中,是由第i个部分导出的特征矩阵,d为缩放参数,一般取为K的维度。为了方便表示,公式中省略了LayerNorm和DropPath。和输入特征F′构成了Transformer第一个残差模块的输出向量P。第二个残差模块由LayerNorm、多层感知机(Multilayer Perceptron,MLP)和DropPath构成,MLP包含两个线性层和一个非线性激活函数,可以用如下公式(5)表示:
其中,表示第i个元素经由MLP导出的输出值,w1,b1表示第一个线性变换层的权重矩阵和偏置向量,w2,b2表示第二个线性变换层的权重矩阵和偏置向量,fl(·)表示非线性激活函数。MLP模块的输出向量和第一个残差模块的输出向量P构成了第二个残差模块的输出向量YT
CFGB模块基于Transformer的结构,能够对整个图像进行全局上下文建模,而不仅仅是在局部感受野内操作,这有助于模型更好地理解不同船舶目标的显著信息与其周围背景环境的关系。另外,采用了全维动态卷积,能够灵活地增加感知范围,捕捉船舶相似外观特征之间微弱的差异性,有助于模型增强对表观相似类别之间的判别能力。
为了验证FDNet算法的有效性,本文在公开数据集Seaships7000上进行训练和测试,如图5所示。首先本文对比了YOLOv3[8]、FCOS[9]、YOLOv7[10]、SSD[11]、Libra-RCNN[32]和Faster-RCNN[12]七个模型与本文模型FDNet,从而验证了FDNet的有效性。之后分别对CFGB和LBDC模块进行了消融试验,验证了本文的类别细粒度分支对类别细粒度特征提取的有效性和高效全维动态卷积对边界信息的挖掘优势。
近海船舶检测公开数据集Seaship7000总共包含7 000张图像6个类别(矿砂船、散货船、杂货船、集装箱船、渔船和客船),分辨率为1 920×1 080。本文使用Python3.9、Pytorch2.0、Cuda11.8构建实验环境,采用一块显存为24 GB的GeForce RTX 4090 GPU,所有实验在目标检测开源工具mmdetection[33]上运行。评价标准采用COCO2014[34],分别为mAP、AP50、AP75、APS、APM和APL。
表1展示了本文模型FDNet与其他模型在Seaship7000的对比结果。FDNet相较于基准模型Faster-RCNN来说,在mAP上提升了3%,在AP50上提升了0.6%,在AP75上提升了5.1%,说明整体表现优秀。值得注意的是,针对小目标的指标APS也提升了3.8%,说明FDNet模型对小目标的检测也有良好的效果。对比实验结果,说明了FDNet模型有助于特征的提取与挖掘,对表观相似的细粒度判别有效。
本文针对算法效率问题分析了不同模型推理速度FPS,参数量Parameters(M)和FLOPs(G)的比较。其中,FPS是使用空间分辨率大小为1 024×1 024的图像,在GeForce RTX 4090 GPU的实验平台上测试。具体对比结果见表1,可以看出本文的模型相较于基准模型Faster-RCNN,推理速度FPS下降了3.3 img/s、Parameters和FLOPs分别增加了23.5 M和286.8 G,这说明引入类别细粒度分支和动态卷积定位分支牺牲了部分推理速度。
本文给出了Faster-RCNN和FDNet检测结果的对比图,如图6所示。Faster-RCNN在面对较小目标和岸边出现干扰时容易出现误检,在面对表观重合时也容易发生漏检。与之不同的是,在面对上述情况时FDNet模型在一定程度上缓解了误检漏检的发生。
为了验证本文模型的泛化能力,本文从互联网上选取了一些真实图片进行实验,置信度阈值为0.6,检测结果如图7所示。在真实图像上,SSD在多目标的复杂背景下会出现漏检(如图7中第4张图片)。Libra-RCNN[32]、Faster-RCNN[12]在面对船舶表观不清晰时会出现不同程度的漏检和误检(如图7中第1张、第3张和第5张图)。本文的模型相较于其他三个模型降低了误检漏检的发生概率,提升了检测的准确率。在一定程度上解决了表观相似和背景干扰导致的误检漏检问题。
为了验证CFGB和LBDC模块的有效性,本文在基准模型上单独加入CFGB模块和LBDC模块,并选择ResNet 50作为骨干网络。表2展示了在Seaship7000数据集上的实验结果。
基准模型在单独引入LBDC模块后,mAP、AP50和AP75分别增长了3.2%、1.0%和4.0%。这表明LBDC在提取船舶目标的边界信息上效果显著,有助于船舶目标定位。LBDC模块在APS、APM和APL上分别增长了1.1%、4.4%和3.0%,表明在面对尺度不同时定位依然准确有效。
基准模型单独引入CFGB后,mAP、AP50和AP75分别增长了2.9%、0.5%和4.4%,这表明CFGB模块在提取船舶目标的显著性类别信息上效果优秀,有助于船舶表观细粒度的提取和挖掘。在APS、APM和APL上分别增长了2.8%、4.6%和2.7%,证明了CFGB模块在不同尺度上都可以对显著性特征和表观进行高效的提取和判别。
为了直观地看出CFGB和LBDC模块对各个类别船舶的检测效果,本文在表3中展示了散货船、集装箱船、渔船、杂货船、矿砂船和客船六个类别的mAP值。可以看到,对渔船和杂货船的mAP分别增长了5.2%和1.2%,说明CFGB和LBDC模块在一定程度上缓解了这两个类别中表观重合导致的漏检误检问题。
针对近海船舶目标中表观相似导致的误检问题,本文提出了一种FDNet网络。该网络包括类别细粒度分支和高效全维动态卷积定位分支。类别细粒度分支对表观细粒度差异进行精细学习,挖掘和利用船舶目标的类别信息。高效全维动态卷积定位分支灵活的感知目标的边界信息,增强定位框的准确性。最后,本文采用ResNet50和FPN的Faster R-CNN构建了表观相似细粒度辨别网络,并在公开数据集Seaships7000验证了FDNet网络的有效性。值得注意的是,近海船舶目标还存在着容易受湖面和天气影响的问题,将在未来的研究中对此问题进行深入的探讨。另一方面,本文的模型不能对光学视频进行实时处理,这是因为本文的模型基于经典两阶段检测网络Faster RCNN,采用候选框的形式牺牲了推理速度,这是目前二阶段检测算法的普遍问题。未来会致力于提高二阶段检测方法的推理速度,使其满足实时性的要求。
  • 国家自然科学基金项目(62206221)
  • 陕西省自然科学基础研究计划资助项目(2021JM-074)
参考文献 引证文献
排序方式:
[1]
叶晨, 逯天洋, 肖潏灏, 等. 海事监控视频舰船目标检测研究现状与展望[J]. 中国图象图形学报, 2022, 27(7):2078-2093.
YE C, LU T Y, XIAO Y H, et al. Maritime surveillance videos based ships detection algorithms: a survey[J]. Journal of Image and Graphics, 2022, 27(7): 2078-2093.
[2]
文莉莉, 孙苗, 邬满. 基于Faster R-CNN的海域监管预警方法[J]. 吉林大学学报 (信息科学版), 2021, 39(4):421-429.
WEN L, SUN M, WU M. Early warning method of sea area supervision based on faster R-CNN[J]. Journal of Jilin University (Information Science Edition), 2021, 39(4):421-429.
[3]
王小虎. 海上船舶融合识别算法研究与改进[J]. 南方能源建设, 2023, 10(4): 131-137.
WANG X. Research and improvement of offshore ship fusion recognition algorithm[J]. Southern Energy Constrution, 2023, 10(4): 131-137.
[4]
周薇娜, 刘露. 复杂场景下多尺度船舶实时检测方法[J].电信科学, 2022, 38(10): 67-78.
ZHOU W N, LIU L. A real-time detection method for multi-scale ships in complex scenes[J]. Telecommunications Science, 2022, 38(10): 67-78.
[5]
MIN L, FAN Z, LV Q, et al. YOLO-DCTI: Small object detection in remote sensing base on contextual transformer enhancement[J]. Remote Sensing, 2023, 15(16):3970.
[6]
XU Y, FU M, WANG Q, et al. Gliding vertex on the horizontal bounding box for multi-oriented object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intel-ligence, 2020, 43(4): 1452-1459.
[7]
谢星星, 程塨, 姚艳清, 等. 动态特征融合的遥感图像目标检测[J]. 计算机学报, 2022, 45(4): 735-747.
[8]
REDMON J, FARHADI A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv: 2018:
[9]
TIAN Z, SHEN C, CHEN H, et al. Fcos: Fully convolutional one-stage object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 9627-9636.
[10]
WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023: 7464-7475.
[11]
LIU W, ANGUELOV D, ERHAN D, et al. Ssd: Single shot multibox detector[C]// Computer Vision-ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
[12]
REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis &Machine Intelligence, 2017, 39(6): 1137-1149.
[13]
WEI S, CHEN H, ZHU X, et al. Ship detection in remote sensing image based on faster r-cnn with dilated convolution[C]//2020 39th Chinese Control Conference (CCC). IEEE, 2020: 7148-7153.
[14]
GUO H, YANG X, WANG N, et al. A rotational libra R-CNN method for ship detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(8): 5772-5781.
[15]
DONG Y, CHEN F, HAN S, et al. Ship object detection of remote sensing image based on visual attention[J]. Remote Sensing, 2021, 13(16): 3192.
[16]
DONG C, LIU J, XU F, et al. Ship detection from optical remote sensing images using multi-scale analysis and Fourier HOG descriptor[J]. Remote Sensing, 2019, 11(13): 1529.
[17]
DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale[C]//International Conference on Learning Representations, 2020.
[18]
LIU Z, LIN Y, CAO Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 10012-10022.
[19]
VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017: 6000-6010.
[20]
YANG B, BENDER G, LE Q V, et al. CondConv: conditionally parameterized convolutions for efficient inference[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems, 2019:1307-1318.
[21]
CHEN Y, DAI X, LIU M, et al. Dynamic convolution:Attention over convolution kernels[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 11030-11039.
[22]
LI C, ZHOU A, YAO A. Omni-dimensional dynamic convolution[C]//International Conference on Learning Representations. 2021.
[23]
CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transformers[C]//European Conference on Computer Vision. Springer International Publishing, 2020: 213-229.
[24]
CHENG G, LI Q, WANG G, et al. SFRNet: Fine-grained oriented object recognition via separate feature refinement[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 1-10.
[25]
WEI Q, HU X, HOU Q. Dynamic-YOLOv5: An imp-roved aerial small object detector based on YOLOv5[C]//2023 3rd International Conference on Neural Networks, Information and Communication Engineering (NNICE), IEEE, 2023: 679-683.
[26]
HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778.
[27]
LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern recognition, 2017: 2117-2125.
[28]
SHAO Z, WU W, WANG Z, et al. Seaships: A large-scale precisely annotated dataset for ship detection[J]. IEEE Transactions on Multimedia, 2018, 20(10): 2593-2604.
[29]
HE K, GKIOXARI G, DOLLÁR P, et al. Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 2961-2969.
[30]
BOCHKOVSKIY A, WANG C Y, LIAO H Y M. Yolov4: Optimal speed and accuracy of object detection[J].arXiv preprint arXiv: 2020:
[31]
SONG G, LIU Y, WANG X. Revisiting the sibling head in object detector[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 11563-11572.
[32]
PANG J, CHEN K, SHI J, et al. Libra r-cnn: Towards balanced learning for object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 821-830.
[33]
CHEN K, WANG J, PANG J, et al. MMDetection: Open mmlab detection toolbox and benchmark[J]. arXiv preprint arXiv: 2019:
[34]
LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft coco: Common objects in context[C]//Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer International Publishing, 2014: 740-755.
2024年第45卷第2期
PDF下载
86
36
引用本文
BibTeX
文章信息
doi: 10.12347/j.ycyk.20231114001
  • 接收时间:2023-11-14
  • 首发时间:2026-03-18
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2023-11-14
  • 修回日期:2023-12-29
基金
国家自然科学基金项目(62206221)
陕西省自然科学基础研究计划资助项目(2021JM-074)
作者信息
    西北工业大学电子信息学院 西安 710072
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/ycyk/CN/10.12347/j.ycyk.20231114001
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏