Article(id=1204385486027596021, tenantId=1146029695717560320, journalId=1189621681917173762, issueId=1204385484878356718, articleNumber=null, orderNo=null, doi=10.19620/j.cnki.1000-3703.20220668, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=research-article, receivedDate=null, receivedDateStr=null, revisedDate=1658160000000, revisedDateStr=2022-07-19, acceptedDate=null, acceptedDateStr=null, onlineDate=1765077476269, onlineDateStr=2025-12-07, pubDate=1679587200000, pubDateStr=2023-03-24, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1765077476269, onlineIssueDateStr=2025-12-07, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1765077476269, creator=13701087609, updateTime=1765077476269, updator=13701087609, issue=Issue{id=1204385484878356718, tenantId=1146029695717560320, journalId=1189621681917173762, year='2023', volume='', issue='3', pageStart='1', pageEnd='62', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=null, createTime=1765077475995, creator=13701087609, updateTime=1765079076236, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1204392196872053307, tenantId=1146029695717560320, journalId=1189621681917173762, issueId=1204385484878356718, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1204392196872053308, tenantId=1146029695717560320, journalId=1189621681917173762, issueId=1204385484878356718, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=19, endPage=25, ext={EN=ArticleExt(id=1204385486195368184, articleId=1204385486027596021, tenantId=1146029695717560320, journalId=1189621681917173762, language=EN, title=Visual Place Recognition with One Level Feature Fusing Coordinate Attention, columnId=null, journalTitle=Automobile Technology, columnName=null, runingTitle=null, highlight=null, articleAbstract=

To solve the problems of matching omission and poor real-time performance of existing visual place recognition methods in scenes with changing viewpoints and environments, this paper proposed visual place recognition method based on one level feature fused with coordinate attention. Firstly, the relative place information of features was captured by coordinate attention. Secondly, an encoder for multi-scale feature fusion was constructed using dilated convolution and NetVLAD. Finally, the network was trained based on triplet loss. Validated by Pitts30k and Nordland datasets, the proposed method achieves the same recall accuracy and 19% faster retrieval speed compared with the state-of-the-art method Patch-NetVLAD of the same baseline in the test of position recognition. In the test of loop detection, the proposed method achieves a reasonable balance between robustness and retrieval speed.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Zijian Liu, Jun Zhang, Yuansheng Liu, Ming Lu, Qingpeng Song), CN=ArticleExt(id=1204385488925860136, articleId=1204385486027596021, tenantId=1146029695717560320, journalId=1189621681917173762, language=CN, title=单级特征图融合坐标注意力的视觉位置识别方法*, columnId=0, journalTitle=汽车技术, columnName=, runingTitle=null, highlight=null, articleAbstract=

针对视角和环境变化的场景中现有视觉位置识别方法存在的匹配遗漏和实时性差的问题,提出基于单级特征图融合坐标注意力的视觉位置识别方法。首先通过坐标注意力捕获特征的相对位置信息,然后利用扩张卷积和局部聚合向量网络(NetVLAD)构造多尺度特征融合的编码器,最后基于三元组损失训练网络。经Pitts30k和Nordland数据集验证,在位置识别试验中,与同基线的先进方法Patch-NetVLAD相比,所提出的方法能够获得同等的召回精度且检索速度提高19%。在回环检测试验中,所提出的方法达到了合理平衡鲁棒性和检索速度的目标。

, correspAuthors=null, authorNote=null, correspAuthorsNote=
张军(1971—),女,山东临沂人,副教授,博士,研究方向为自动驾驶感知与决策技术,
, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=eQ7q1Jz3M50s7Zf8UbmQhg==, magXml=Ko36kNbP9XKZK4NJIbb76Q==, pdfUrl=null, pdf=jDuS70iQfk9w/vZRvZTgwA==, pdfFileSize=1470024, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=2Xel/oGC/1YrVaNBXVtRtw==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=LL+g75YE1YA6LoUh3TPLwQ==, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=刘子健, 张军, 刘元盛, 路铭, 宋庆鹏)}, authors=[Author(id=1204452616068640847, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1204452616412573808, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452616068640847, language=EN, stringName=Zijian Liu, firstName=Zijian, middleName=null, lastName=Liu, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1204452616576151673, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452616068640847, language=CN, stringName=刘子健, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=北京联合大学,北京市信息服务工程重点实验室,北京 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1204452615930228799, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, xref=null, ext=[AuthorCompanyExt(id=1204452615938617408, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101), AuthorCompanyExt(id=1204452615947006016, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=北京联合大学,北京市信息服务工程重点实验室,北京 100101)])]), Author(id=1204452616731340939, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=xxtzhangjun@buu.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1204452616928473243, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452616731340939, language=EN, stringName=Jun Zhang, firstName=Jun, middleName=null, lastName=Zhang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1204452617062690985, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452616731340939, language=CN, stringName=张军, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=北京联合大学,北京市信息服务工程重点实验室,北京 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1204452615930228799, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, xref=null, ext=[AuthorCompanyExt(id=1204452615938617408, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101), AuthorCompanyExt(id=1204452615947006016, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=北京联合大学,北京市信息服务工程重点实验室,北京 100101)])]), Author(id=1204452617222074556, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1204452617415012556, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452617222074556, language=EN, stringName=Yuansheng Liu, firstName=Yuansheng, middleName=null, lastName=Liu, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1204452617624727772, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452617222074556, language=CN, stringName=刘元盛, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=北京联合大学,北京市信息服务工程重点实验室,北京 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1204452615930228799, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, xref=null, ext=[AuthorCompanyExt(id=1204452615938617408, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101), AuthorCompanyExt(id=1204452615947006016, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=北京联合大学,北京市信息服务工程重点实验室,北京 100101)])]), Author(id=1204452617796694259, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1204452617901551875, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452617796694259, language=EN, stringName=Ming Lu, firstName=Ming, middleName=null, lastName=Lu, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1204452618027381009, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452617796694259, language=CN, stringName=路铭, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=北京联合大学,北京市信息服务工程重点实验室,北京 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1204452615930228799, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, xref=null, ext=[AuthorCompanyExt(id=1204452615938617408, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101), AuthorCompanyExt(id=1204452615947006016, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=北京联合大学,北京市信息服务工程重点实验室,北京 100101)])]), Author(id=1204452618153210140, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1204452618279039276, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452618153210140, language=EN, stringName=Qingpeng Song, firstName=Qingpeng, middleName=null, lastName=Song, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1204452618413257012, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, authorId=1204452618153210140, language=CN, stringName=宋庆鹏, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=北京联合大学,北京市信息服务工程重点实验室,北京 100101, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1204452615930228799, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, xref=null, ext=[AuthorCompanyExt(id=1204452615938617408, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101), AuthorCompanyExt(id=1204452615947006016, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=北京联合大学,北京市信息服务工程重点实验室,北京 100101)])])], keywords=[Keyword(id=1204452618752995659, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, orderNo=1, keyword=Autopilot), Keyword(id=1204452618891407702, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, orderNo=2, keyword=Visual place recognition), Keyword(id=1204452619059179874, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, orderNo=3, keyword=Loop detection), Keyword(id=1204452619260506486, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, orderNo=4, keyword=Coordinate attention), Keyword(id=1204452619411501443, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, orderNo=5, keyword=NetVLAD), Keyword(id=1204452619587662233, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, orderNo=6, keyword=Triplet loss), Keyword(id=1204452619721879975, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, orderNo=1, keyword=自动驾驶), Keyword(id=1204452619839320506, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, orderNo=2, keyword=视觉位置识别), Keyword(id=1204452619960955332, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, orderNo=3, keyword=回环检测), Keyword(id=1204452620099367380, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, orderNo=4, keyword=坐标注意力), Keyword(id=1204452620216807902, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, orderNo=5, keyword=局部聚合向量网络), Keyword(id=1204452620317471212, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, orderNo=6, keyword=三元组损失)], refs=[Reference(id=1204452624654382021, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=高翔, 张涛, 刘毅, journalName=视觉SLAM十四讲: 从理论到实践, refType=null, unstructuredReference=高翔, 张涛, 刘毅, 等. 视觉SLAM十四讲: 从理论到实践[M]. 2版. 北京: 电子工业出版社, 2019., articleTitle=null, refAbstract=null), Reference(id=1204452624843125713, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[1], rfOrder=1, authorNames=GAO X, ZHANG T, LIU Y, journalName=14 Lectures on Visual SLAM: From Theory to Practice, refType=null, unstructuredReference=GAO X, ZHANG T, LIU Y, et al. 14 Lectures on Visual SLAM: From Theory to Practice[M]. Beijing: Publishing House of Electronics Industry, 2019., articleTitle=null, refAbstract=null), Reference(id=1204452624989926361, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2016, volume=32, issue=1, pageStart=1, pageEnd=19, url=null, language=null, rfNumber=[2], rfOrder=2, authorNames=LOWRY S, SÜNDERHAUF N, NEWMAN P, journalName=IEEE Transactions on Robotics, refType=null, unstructuredReference=LOWRY S, SÜNDERHAUF N, NEWMAN P, et al. Visual Place Recognition: A Survey[J]. IEEE Transactions on Robotics, 2016, 32(1): 1-19., articleTitle=Visual Place Recognition: A Survey, refAbstract=null), Reference(id=1204452625128338409, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2019, volume=41, issue=1, pageStart=112, pageEnd=123, url=null, language=null, rfNumber=[3], rfOrder=3, authorNames=刘强, 段富海, 桑勇, journalName=机器人, refType=null, unstructuredReference=刘强, 段富海, 桑勇, 等. 复杂环境下视觉SLAM闭环检测方法综述[J]. 机器人, 2019, 41(1): 112-123+136., articleTitle=复杂环境下视觉SLAM闭环检测方法综述, refAbstract=null), Reference(id=1204452625266750453, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2019, volume=41, issue=1, pageStart=112, pageEnd=123, url=null, language=null, rfNumber=[3], rfOrder=4, authorNames=LIU Q, DUAN F H, SANG Y, journalName=Robot, refType=null, unstructuredReference=LIU Q, DUAN F H, SANG Y, et al. A Survey of Loop-Closure Detection Method of Visual SLAM in Complex Environments[J]. Robot, 2019, 41(1): 112-123+136., articleTitle=A Survey of Loop-Closure Detection Method of Visual SLAM in Complex Environments, refAbstract=null), Reference(id=1204452625426133001, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=9, pageStart=55, pageEnd=62, url=null, language=null, rfNumber=[4], rfOrder=5, authorNames=李兆凯, 李龙勇, 李泽晖, journalName=汽车技术, refType=null, unstructuredReference=李兆凯, 李龙勇, 李泽晖, 等. 基于RGB-D SLAM的智能车自主避障与路径规划试验研究[J]. 汽车技术, 2021(9): 55-62., articleTitle=基于RGB-D SLAM的智能车自主避障与路径规划试验研究, refAbstract=null), Reference(id=1204452625539379218, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=9, pageStart=55, pageEnd=62, url=null, language=null, rfNumber=[4], rfOrder=6, authorNames=LI Z K, LI L Y, LI Z H, journalName=Automobile Technology, refType=null, unstructuredReference=LI Z K, LI L Y, LI Z H, et al. Experimental Research on Autonomous Obstacle Avoidance and Path Planning of Intelligent Vehicle Based on RGB-D SLAM Technology[J]. Automobile Technology, 2021(9): 55-62., articleTitle=Experimental Research on Autonomous Obstacle Avoidance and Path Planning of Intelligent Vehicle Based on RGB-D SLAM Technology, refAbstract=null), Reference(id=1204452625673596959, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=1, pageStart=1, pageEnd=10, url=null, language=null, rfNumber=[5], rfOrder=7, authorNames=陈涛, 范林坤, 李旭川, journalName=汽车技术, refType=null, unstructuredReference=陈涛, 范林坤, 李旭川, 等. 基于深度学习的智能车辆视觉里程计技术发展综述[J]. 汽车技术, 2021(1): 1-10., articleTitle=基于深度学习的智能车辆视觉里程计技术发展综述, refAbstract=null), Reference(id=1204452625786843177, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=1, pageStart=1, pageEnd=10, url=null, language=null, rfNumber=[5], rfOrder=8, authorNames=CHEN T, FAN L K, LI X C, journalName=Automobile Technology, refType=null, unstructuredReference=CHEN T, FAN L K, LI X C, et al. Review on the Development of Deep Learning-Based Vision Odometer Technologies for Intelligent Vehicles[J]. Automobile Technology, 2021(1): 1-10., articleTitle=Review on the Development of Deep Learning-Based Vision Odometer Technologies for Intelligent Vehicles, refAbstract=null), Reference(id=1204452625887506480, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=44, issue=2, pageStart=275, pageEnd=291, url=null, language=null, rfNumber=[6], rfOrder=9, authorNames=段立娟, 孙启超, 乔元华, journalName=计算机学报, refType=null, unstructuredReference=段立娟, 孙启超, 乔元华, 等. 基于注意力感知和语义感知的RGB-D室内图像语义分割算法[J]. 计算机学报, 2021, 44(2): 275-291., articleTitle=基于注意力感知和语义感知的RGB-D室内图像语义分割算法, refAbstract=null), Reference(id=1204452626046890045, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=44, issue=2, pageStart=275, pageEnd=291, url=null, language=null, rfNumber=[6], rfOrder=10, authorNames=DUAN L J, SUN Q C, QIAO Y H, journalName=Chinese Journal of Computers, refType=null, unstructuredReference=DUAN L J, SUN Q C, QIAO Y H, et al. Attention-Aware and Semantic-Aware Network for RGB-D Indoor Semantic Segmentation[J]. Chinese Journal of Computers, 2021, 44(2): 275-291., articleTitle=Attention-Aware and Semantic-Aware Network for RGB-D Indoor Semantic Segmentation, refAbstract=null), Reference(id=1204452626189496387, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=13713, pageEnd=13722, url=null, language=null, rfNumber=[7], rfOrder=11, authorNames=HOU Q B, ZHOU D Q, FENG J S, journalName=Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, refType=null, unstructuredReference=HOU Q B, ZHOU D Q, FENG J S. Coordinate Attention for Efficient Mobile Network Design[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA: IEEE, 2021: 13713-13722., articleTitle=Coordinate Attention for Efficient Mobile Network Design, refAbstract=null), Reference(id=1204452626336297037, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=53, issue=2, pageStart=175, pageEnd=183, url=null, language=null, rfNumber=[8], rfOrder=12, authorNames=李琳辉, 张溪桐, 连静, journalName=哈尔滨工业大学学报, refType=null, unstructuredReference=李琳辉, 张溪桐, 连静, 等. 结合道路结构化特征的语义SLAM算法[J]. 哈尔滨工业大学学报, 2021, 53(2): 175-183., articleTitle=结合道路结构化特征的语义SLAM算法, refAbstract=null), Reference(id=1204452626474709084, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=53, issue=2, pageStart=175, pageEnd=183, url=null, language=null, rfNumber=[8], rfOrder=13, authorNames=LI L H, ZHANG X T, LIAN J, journalName=Journal of Harbin Institute of Technology, refType=null, unstructuredReference=LI L H, ZHANG X T, LIAN J, et al. Semantic SLAM Algorithm Combined with Road Structured Features[J]. Journal of Harbin Institute of Technology, 2021, 53(2): 175-183., articleTitle=Semantic SLAM Algorithm Combined with Road Structured Features, refAbstract=null), Reference(id=1204452626613121132, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=5297, pageEnd=5307, url=null, language=null, rfNumber=[9], rfOrder=14, authorNames=ARANDJELOVIC R, GRONAT P, TORII A, journalName=Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, refType=null, unstructuredReference=ARANDJELOVIC R, GRONAT P, TORII A, et al. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 5297-5307., articleTitle=NetVLAD: CNN Architecture for Weakly Supervised Place Recognition, refAbstract=null), Reference(id=1204452626738950264, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=47, issue=12, pageStart=2742, pageEnd=2760, url=null, language=null, rfNumber=[10], rfOrder=15, authorNames=姚足, 龚勋, 陈锐, journalName=自动化学报, refType=null, unstructuredReference=姚足, 龚勋, 陈锐, 等. 面向行人重识别的局部特征研究进展、挑战与展望[J]. 自动化学报, 2021, 47(12): 2742-2760., articleTitle=面向行人重识别的局部特征研究进展、挑战与展望, refAbstract=null), Reference(id=1204452626885750916, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=47, issue=12, pageStart=2742, pageEnd=2760, url=null, language=null, rfNumber=[10], rfOrder=16, authorNames=YAO Z, GONG X, CHEN R, Research Progress, journalName=Acta Automatica Sinica, refType=null, unstructuredReference=YAO Z, GONG X, CHEN R, et al. Research Progress, Challenge and Prospect of Local Features for Person Re-Identification[J]. Acta Automatica Sinica, 2021, 47(12): 2742-2760., articleTitle=Challenge and Prospect of Local Features for Person Re-Identification, refAbstract=null), Reference(id=1204452627045134481, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2013, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=17, authorNames=SÜNDERHAUF N, NEUBERT P, PROTZEL P, journalName=Proceedings of Workshop on Long-Term Autonomy, IEEE International Conference on Robotics and Automation (ICRA). Karlsruhe, refType=null, unstructuredReference=SÜNDERHAUF N, NEUBERT P, PROTZEL P. Are We There Yet? Challenging SeqSLAM on a 3000 km Journey Across all Four Seasons[C]// Proceedings of Workshop on Long-Term Autonomy, IEEE International Conference on Robotics and Automation (ICRA). Karlsruhe, Germany: IEEE, 2013., articleTitle=Are We There Yet? Challenging SeqSLAM on a 3000 km Journey Across all Four Seasons, refAbstract=null), Reference(id=1204452627221295271, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=14141, pageEnd=14152, url=null, language=null, rfNumber=[12], rfOrder=18, authorNames=HAUSLER S, GARG S, XU M, journalName=Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, refType=null, unstructuredReference=HAUSLER S, GARG S, XU M, et al. Patch-Netvlad: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA: IEEE, 2021: 14141-14152., articleTitle=Patch-Netvlad: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition, refAbstract=null), Reference(id=1204452627384873141, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=8, pageStart=1, pageEnd=6, url=null, language=null, rfNumber=[13], rfOrder=19, authorNames=常飞翔, 刘元盛, 李中道, journalName=汽车技术, refType=null, unstructuredReference=常飞翔, 刘元盛, 李中道, 等. 一种基于TX2平台的改进YOLOv3红绿灯检测算法[J]. 汽车技术, 2021(8): 1-6., articleTitle=一种基于TX2平台的改进YOLOv3红绿灯检测算法, refAbstract=null), Reference(id=1204452627493925057, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=8, pageStart=1, pageEnd=6, url=null, language=null, rfNumber=[13], rfOrder=20, authorNames=CHANG F X, LIU Y S, LI Z D, journalName=Automobile Technology, refType=null, unstructuredReference=CHANG F X, LIU Y S, LI Z D, et al. An Improved YOLOv3 Traffic Light Detection Algorithm Based on TX2 Platform[J]. Automobile Technology, 2021(8): 1-6., articleTitle=An Improved YOLOv3 Traffic Light Detection Algorithm Based on TX2 Platform, refAbstract=null), Reference(id=1204452627602976973, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, doi=null, pmid=null, pmcid=null, year=2017, volume=33, issue=5, pageStart=1255, pageEnd=1262, url=null, language=null, rfNumber=[14], rfOrder=21, authorNames=MURARTAL R, TARDOS J D, journalName=IEEE Transactions on Robotics, refType=null, unstructuredReference=MURARTAL R, TARDOS J D. ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras[J]. IEEE Transactions on Robotics, 2017, 33(5): 1255-1262., articleTitle=ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras, refAbstract=null)], funds=[Fund(id=1204452624012653439, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, awardId=2021YFC3001300, language=CN, fundingSource=* 国家重点研发计划项目(2021YFC3001300), fundOrder=null, country=null), Fund(id=1204452624117511053, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, awardId=61931012, language=CN, fundingSource=国家自然科学基金重点项目合作项目(61931012), fundOrder=null, country=null), Fund(id=1204452624243340188, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, awardId=4222025, language=CN, fundingSource=北京市自然科学基金面上项目(4222025), fundOrder=null, country=null), Fund(id=1204452624385946535, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, awardId=ZK10202208, language=CN, fundingSource=北京联合大学高水平孵化项目和新进博士孵化项目(ZK10202208), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1204452615930228799, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, xref=null, ext=[AuthorCompanyExt(id=1204452615938617408, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101), AuthorCompanyExt(id=1204452615947006016, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, companyId=1204452615930228799, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=北京联合大学,北京市信息服务工程重点实验室,北京 100101)])], figs=[ArticleFig(id=1204452620543963655, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=jj9Y0T+zp+fOtuscJDAGQQ==, figureFileBig=duKsJVlZ60eNuDO+eL40Mw==, tableContent=null), ArticleFig(id=1204452620728513049, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图1, caption=系统流程, figureFileSmall=jj9Y0T+zp+fOtuscJDAGQQ==, figureFileBig=duKsJVlZ60eNuDO+eL40Mw==, tableContent=null), ArticleFig(id=1204452620959199791, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=GOd+heMZLkOiGUYwxwJtew==, figureFileBig=cw3OdmTKokcGyd9vyubw0Q==, tableContent=null), ArticleFig(id=1204452621110194750, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图2, caption=残差结构, figureFileSmall=GOd+heMZLkOiGUYwxwJtew==, figureFileBig=cw3OdmTKokcGyd9vyubw0Q==, tableContent=null), ArticleFig(id=1204452621265384018, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=DmCNIaZHrSflkGgROgkhfQ==, figureFileBig=Xoi+/WaK/NgpybjFm2eIPg==, tableContent=null), ArticleFig(id=1204452621370241631, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图3, caption=坐标注意力网络结构, figureFileSmall=DmCNIaZHrSflkGgROgkhfQ==, figureFileBig=Xoi+/WaK/NgpybjFm2eIPg==, tableContent=null), ArticleFig(id=1204452621512847982, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=jTWYswl2nrDEC3yEwj8ZeQ==, figureFileBig=bie7s2vxV75z4u6bUOrc0Q==, tableContent=null), ArticleFig(id=1204452621621899897, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图4, caption=改进的坐标-残差结构, figureFileSmall=jTWYswl2nrDEC3yEwj8ZeQ==, figureFileBig=bie7s2vxV75z4u6bUOrc0Q==, tableContent=null), ArticleFig(id=1204452621709980295, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=WBS/Mp4WArvS7siXrF3g2w==, figureFileBig=MB78uFm7LTdiU2Rn3CVbfw==, tableContent=null), ArticleFig(id=1204452621852586646, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图5, caption=本文改进的扩张卷积-局部聚合向量网络编码器, figureFileSmall=WBS/Mp4WArvS7siXrF3g2w==, figureFileBig=MB78uFm7LTdiU2Rn3CVbfw==, tableContent=null), ArticleFig(id=1204452621970027169, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=JrM6JeBxjMsdHhagh0qbqQ==, figureFileBig=qjkYlu5VCMvJVDJOzkzPeQ==, tableContent=null), ArticleFig(id=1204452622121022128, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图6, caption=三元组损失, figureFileSmall=JrM6JeBxjMsdHhagh0qbqQ==, figureFileBig=qjkYlu5VCMvJVDJOzkzPeQ==, tableContent=null), ArticleFig(id=1204452622272017087, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=ePTswPW4ZXcoYJxlpR0iSw==, figureFileBig=yeN/nNgwJR6M1OcUDj81Yw==, tableContent=null), ArticleFig(id=1204452622360097482, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图7, caption=不同方法在2种数据集上位置识别任务中的召回率-查准率曲线, figureFileSmall=ePTswPW4ZXcoYJxlpR0iSw==, figureFileBig=yeN/nNgwJR6M1OcUDj81Yw==, tableContent=null), ArticleFig(id=1204452622544646876, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=8x738SaPwgShuTAZKaYLgQ==, figureFileBig=10YbqgpEZKsICdCyI8wM6w==, tableContent=null), ArticleFig(id=1204452622691447532, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图8, caption=旋风智能车试验平台, figureFileSmall=8x738SaPwgShuTAZKaYLgQ==, figureFileBig=10YbqgpEZKsICdCyI8wM6w==, tableContent=null), ArticleFig(id=1204452622821470968, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=iT2L+X8jEHdmnjiJbhr/FQ==, figureFileBig=Cz17o3rUi0azWhovDUwL0w==, tableContent=null), ArticleFig(id=1204452622938911493, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=图9, caption=季节变化下的回环检测, figureFileSmall=iT2L+X8jEHdmnjiJbhr/FQ==, figureFileBig=Cz17o3rUi0azWhovDUwL0w==, tableContent=null), ArticleFig(id=1204452623056352020, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
数据集 特征维度 召回率/%
Q=1 Q=5 Q=10
Nordland 原始特征 37.41 44.42 47.54
512维特征 43.17 47.71 50.63
Pitts30k 原始特征 81.63 91.16 93.22
512维特征 86.73 94.18 94.93
), ArticleFig(id=1204452623211541284, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=表1, caption=

本文方法的召回率

, figureFileSmall=null, figureFileBig=null, tableContent=
数据集 特征维度 召回率/%
Q=1 Q=5 Q=10
Nordland 原始特征 37.41 44.42 47.54
512维特征 43.17 47.71 50.63
Pitts30k 原始特征 81.63 91.16 93.22
512维特征 86.73 94.18 94.93
), ArticleFig(id=1204452623383507773, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
算法 平均闭环
准确率/%
平均闭环检测
时间/ms
DBoW2 71.34 2 634
NetVLAD 82.66 2 753
Patch-NetVLAD 90.11 4 746
本文方法 87.42 2 832
), ArticleFig(id=1204452623488365383, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=表2, caption=

对比方法在回环检测任务中的性能

, figureFileSmall=null, figureFileBig=null, tableContent=
算法 平均闭环
准确率/%
平均闭环检测
时间/ms
DBoW2 71.34 2 634
NetVLAD 82.66 2 753
Patch-NetVLAD 90.11 4 746
本文方法 87.42 2 832
), ArticleFig(id=1204452623639360345, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
网络结构 平均查准率/% 检索时间/ms
ResNet+NetVLAD(基线) 90.18 30.7
ResNet+CA+NetVLAD 94.05 33.8
ResNet+DC-NetVLAD 93.52 34.4
ResNet+CA+DC-NetVLAD 96.22 37.5
), ArticleFig(id=1204452623790355304, tenantId=1146029695717560320, journalId=1189621681917173762, articleId=1204385486027596021, language=CN, label=表3, caption=

消融试验结果

, figureFileSmall=null, figureFileBig=null, tableContent=
网络结构 平均查准率/% 检索时间/ms
ResNet+NetVLAD(基线) 90.18 30.7
ResNet+CA+NetVLAD 94.05 33.8
ResNet+DC-NetVLAD 93.52 34.4
ResNet+CA+DC-NetVLAD 96.22 37.5
)], attaches=null, journal=Journal(id=1149693407745847311, delFlag=0, nameCn=汽车技术, nameEn=Automobile Technology, nameHistory1=null, nameHistory2=null, issn=1000-3703, eissn=null, cn=22-1113/U, coden=null, periodic=0, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=rYFtDx/CU9+iX8QTM0ckbw==, journalPrice=null, startedYear=null, abbrevIsoEn=null, journalRemark=null, publicationField=null, createdTime=1752037868679, updatedTime=1761735668047, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=A, firstLetterEn=A, subjectCode=Engineering, subjectName=Engineering, subjectCodeEn=Engineering, subjectNameEn=null, picCn=rYFtDx/CU9+iX8QTM0ckbw==, picEn=oFT2NmUwKPUjZ27C1+d9pw==, jcr=null, cjcr=null, exts=[JournalExt(id=1190368926564450443, language=CN, name=汽车技术, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1761735668078, updatedTime=1761735668078, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://qcjs.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://qcjs.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://qcjs.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1190368926618976396, language=EN, name=Automobile Technology, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1761735668091, updatedTime=1761735668091, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://qcjs.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://qcjs.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://qcjs.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1189621681917173762, websiteList=[Website(id=1189624193747526544, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1189621681917173762, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcjs/CN, language=CN, createTime=1761558109939, createBy=18614031015, updateTime=1761558140534, updateBy=18614031015, name=汽车技术-中, tplId=1146099689490845704, title=汽车技术, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1189625424704451180, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=articleTextType, value=kx, createTime=1761558403421, updateTime=1761558403421, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424675091049, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=banner, value=null, createTime=1761558403414, updateTime=1761558403414, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424733811311, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=grayFlag, value=0, createTime=1761558403428, updateTime=1761558403428, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424658313832, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=logo, value=https://castjournals.cast.org.cn/joweb/qcjs/CN/file/pic?fileId=7En9rzX2QCa/1J8NnKt/Fg==, createTime=1761558403410, updateTime=1761558403410, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424746394225, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=minRunFlag, value=0, createTime=1761558403431, updateTime=1761558403431, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424691868267, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/qcjs/CN/file/pic, createTime=1761558403418, updateTime=1761558403418, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424742199920, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=silenceFlag, value=0, createTime=1761558403430, updateTime=1761558403430, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424683479658, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1761558403416, updateTime=1761558403416, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424712839789, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=themeColor, value=null, createTime=1761558403423, updateTime=1761558403423, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625424725422702, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193747526544, code=themeStyle, value=null, createTime=1761558403426, updateTime=1761558403426, creator=18614031015, updator=18614031015)]), Website(id=1189624193869161363, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1189621681917173762, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcjs/EN, language=EN, createTime=1761558109967, createBy=18614031015, updateTime=1761558340679, updateBy=18614031015, name=汽车技术-英文, tplId=1146101810881728533, title=Automobile Technology, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1189625550722311064, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=articleTextType, value=kx, createTime=1761558433466, updateTime=1761558433466, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550688756629, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=banner, value=null, createTime=1761558433458, updateTime=1761558433458, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550739088283, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=grayFlag, value=0, createTime=1761558433470, updateTime=1761558433470, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550676173716, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=logo, value=https://castjournals.cast.org.cn/joweb/qcjs/EN/file/pic?fileId=7En9rzX2QCa/1J8NnKt/Fg==, createTime=1761558433455, updateTime=1761558433455, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550751671197, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=minRunFlag, value=0, createTime=1761558433473, updateTime=1761558433473, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550713922455, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/qcjs/EN/file/pic, createTime=1761558433464, updateTime=1761558433464, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550743282588, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=silenceFlag, value=0, createTime=1761558433471, updateTime=1761558433471, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550705533846, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1761558433462, updateTime=1761558433462, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550726505369, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=themeColor, value=null, createTime=1761558433467, updateTime=1761558433467, creator=18614031015, updator=18614031015), WebsiteProps(id=1189625550734893978, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1189624193869161363, code=themeStyle, value=null, createTime=1761558433469, updateTime=1761558433469, creator=18614031015, updator=18614031015)])], journalTitle=汽车技术, weixinUrl=null, journalUrl=null, iacademicId=null, status=1, seqNo=null, journalTitleEn=Automobile Technology, journalPhotoCn=rYFtDx/CU9+iX8QTM0ckbw==, journalPhotoEn=oFT2NmUwKPUjZ27C1+d9pw==, journalFirstLetter=A, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/qcjs/CN/10.19620/j.cnki.1000-3703.20220668, detailUrlEn=https://castjournals.cast.org.cn/joweb/qcjs/EN/10.19620/j.cnki.1000-3703.20220668, pdfUrlCn=https://castjournals.cast.org.cn/joweb/qcjs/CN/PDF/10.19620/j.cnki.1000-3703.20220668, pdfUrlEn=https://castjournals.cast.org.cn/joweb/qcjs/EN/PDF/10.19620/j.cnki.1000-3703.20220668, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
单级特征图融合坐标注意力的视觉位置识别方法*
收藏切换
PDF下载
刘子健 , 张军 , 刘元盛 , 路铭 , 宋庆鹏
汽车技术 | 2023,(3): 19-25
收起
收藏切换
汽车技术 | 2023, (3): 19-25
单级特征图融合坐标注意力的视觉位置识别方法*
全屏
刘子健, 张军 , 刘元盛, 路铭, 宋庆鹏
作者信息
  • 北京联合大学,北京市信息服务工程重点实验室,北京 100101

通讯作者:

张军(1971—),女,山东临沂人,副教授,博士,研究方向为自动驾驶感知与决策技术,
Visual Place Recognition with One Level Feature Fusing Coordinate Attention
Zijian Liu, Jun Zhang , Yuansheng Liu, Ming Lu, Qingpeng Song
Affiliations
  • Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101
出版时间: 2023-03-24 doi: 10.19620/j.cnki.1000-3703.20220668
文章导航
收藏切换

针对视角和环境变化的场景中现有视觉位置识别方法存在的匹配遗漏和实时性差的问题,提出基于单级特征图融合坐标注意力的视觉位置识别方法。首先通过坐标注意力捕获特征的相对位置信息,然后利用扩张卷积和局部聚合向量网络(NetVLAD)构造多尺度特征融合的编码器,最后基于三元组损失训练网络。经Pitts30k和Nordland数据集验证,在位置识别试验中,与同基线的先进方法Patch-NetVLAD相比,所提出的方法能够获得同等的召回精度且检索速度提高19%。在回环检测试验中,所提出的方法达到了合理平衡鲁棒性和检索速度的目标。

自动驾驶  /  视觉位置识别  /  回环检测  /  坐标注意力  /  局部聚合向量网络  /  三元组损失

To solve the problems of matching omission and poor real-time performance of existing visual place recognition methods in scenes with changing viewpoints and environments, this paper proposed visual place recognition method based on one level feature fused with coordinate attention. Firstly, the relative place information of features was captured by coordinate attention. Secondly, an encoder for multi-scale feature fusion was constructed using dilated convolution and NetVLAD. Finally, the network was trained based on triplet loss. Validated by Pitts30k and Nordland datasets, the proposed method achieves the same recall accuracy and 19% faster retrieval speed compared with the state-of-the-art method Patch-NetVLAD of the same baseline in the test of position recognition. In the test of loop detection, the proposed method achieves a reasonable balance between robustness and retrieval speed.

Autopilot  /  Visual place recognition  /  Loop detection  /  Coordinate attention  /  NetVLAD  /  Triplet loss
刘子健, 张军, 刘元盛, 路铭, 宋庆鹏. 单级特征图融合坐标注意力的视觉位置识别方法*. 汽车技术, 2023 , (3) : 19 -25 . DOI: 10.19620/j.cnki.1000-3703.20220668
Zijian Liu, Jun Zhang, Yuansheng Liu, Ming Lu, Qingpeng Song. Visual Place Recognition with One Level Feature Fusing Coordinate Attention[J]. Automobile Technology, 2023 , (3) : 19 -25 . DOI: 10.19620/j.cnki.1000-3703.20220668
自动驾驶汽车基于同步定位与建图(Simultaneous Localization and Mapping,SLAM)[1]构建自主导航系统。在长期运行中,定位系统会产生累积误差,导致定位失效。实践中,自动驾驶汽车先存储已访问过场景的表征向量,再在汽车运动时匹配当前环境的表征向量解决定位问题,即回环检测与重定位。随着自动驾驶汽车的发展,以视觉位置识别(Visual Place Recognition,VPR)[2]技术为代表的回环检测与重定位方法受到关注。VPR以环境中的视觉特征为基础,计算形成环境表征向量[3]。目前应用于VPR技术的视觉特征主要分为基于规则设计的手工特征和基于数据驱动的学习特征。
基于规则设计的角点和描述子组合的ORB(Oriented FAST and Rotated BRIEF)[4]局部特征具有尺度和旋转不变性,对图像噪声及透视变换具有鲁棒性。因此,ORB特征能准确地描述和检索稳定不变的环境信息。但此类方法可能受到视角转换、光照和季节变换等因素影响,产生当前观测的环境信息与数据库中已知的环境信息存在较大差异的现象,最终导致位置识别结果的准确率降低等问题。
基于数据驱动的卷积神经网络(Convolutional Neural Networks,CNN)[5]具有强大的图像表示能力。为在现有VPR方法的识别精度下提高定位系统的鲁棒性和实时性,本文针对环境中的视觉特征空间关系学习和高层特征学习两方面进行改进:在骨干网络中融合坐标注意力,获得视觉特征的空间位置关系和显著的空间位置信息;利用多重扩张卷积和局部聚合向量网络,构造多尺度特征融合的特征编码器,获得单级特征图在不同尺度下的视觉特征向量。最后,将该方法在公开数据集上进行验证。
本文基于残差网络(Residual Networks,ResNet)[6]提出一种弱监督训练的VPR方法,实现高效的视觉位置识别:输入图像集或图像序列,通过本文搭建的神经网络得到特征匹配结果,最终输出检索结果。系统流程如图1所示。
深层CNN存在参数爆炸、梯度消失等问题。ResNet通过短路连接解决深层网络梯度消失带来的学习退化问题。当理论残差为零时,残差结构只做恒等映射以确保实际残差不会为零,同时保持当前网络性能,结构如图2所示。
本文采用ResNet-50作为提取图像特征的骨干网络。自动驾驶汽车采集的视觉信息在ResNet中经过卷积(Convolutional,Conv)层、批归一化(Batch Normalization,BN)层、激活函数和池化层完成预处理,随后再经过4组瓶颈(Bottleneck)层提取特征信息。
坐标注意力(Coordinate Attention,CA)[7]网络将输入的二维全局池化转化为单个特征向量的过程,分解为水平方向和垂直方向一维特征编码过程,将聚合的方向信息嵌入特征图中。通过级联捕获的2个空间方向的依赖关系和位置信息,经BN层和非线性变换生成空间信息的特征图。沿着水平和垂直方向切分特征图得到注意力权重,使网络捕获到特征的位置关系。网络结构如图3所示。
视觉特征的相对位置关系对于位置识别结果至关重要。为了在开放环境中保持定位系统的鲁棒性,需要网络模型具备筛选重要的环境特征和获取视觉特征坐标的能力。为弥补ResNet对图像特征的空间关系不敏感的缺陷,本文在其残差结构中引入CA,构建新的特征提取网络。通过捕获环境特征的空间位置关系,获得特征间的依赖关系和精确的位置信息。
CNN的浅层输出边缘和色彩等具体的图像特征,随着网络层数的加深,输出的特征图尺寸逐渐减小,视觉信息逐渐抽象和语义化。ResNet的Conv5特征图包含对图像变化足够鲁棒的视觉特征,同时具有丰富的语义信息。因此,本文在Conv5的短路连接和瓶颈层之间引入CA,将生成的特征图编码为一组方向感知和位置敏感的注意力热图,促使网络学习到特征间隐含的空间结构关系,帮助模型准确定位感兴趣的目标,增强特征图表达能力。改进后的网络利用CNN提取图像的局部信息,发挥了CA对全局信息建模的能力,有效提高视觉特征的表达能力。改进后的网络结构如图4所示。
对环境的局部视觉特征进行建模,不足以区分局部特征相似的场景,需要联系上下文信息和环境的整体信息对场景进行同一性判断。特征金字塔(Feature Pyramid Network,FPN)是获取多尺度特征的经典网络,但车载设备的算力普遍较低,导致基于FPN的VPR方法移植到车载设备存在实时性差的问题。扩张卷积(Dilated Convolution,DC)[8]具有仅增加1个参数即可获取更广视野范围特征图的特点。因此,本文基于DC和局部聚合向量网络(Vector of Locally Aggregated Descriptors Network,NetVLAD)构造图像编码器,利用DC-NetVLAD对Conv5卷积层输出的单级特征图进行视野扩张,然后对不同视野下的图像特征进行聚合得到图像的描述向量,在不失精度的前提下节约计算成本,提高系统的实时性。
基于局部特征解决VPR问题,需要对特征矩阵进行聚类处理以压缩数据维度。相比视觉词袋(Bag of Word,BoW)编码,局部聚合描述符向量(Vector of Locally Aggregated Descriptors,VLAD)不仅保存每个特征点到离它最近的聚类中心的距离,还记录了特征点在不同维度的取值,避免了信息损失问题。VLAD的计算公式为:
V ( J , K ) = i = 1 N a k ( X i ) ( x i (j) - c k (j) )
式中,Xi为输入的数量为N的局部特征向量;V(J,K)为输出的聚类后的特征向量;K为聚类中心的数量;J为第k个特征向量的维度;xi(j)为第i个局部特征的第j个特征值;ck(j)为第k个聚类中心的第j个特征值;ak(Xi)为二值的符号函数,对每个属于聚类中心ck的特征向量xiak的值取为1,否则为0。
原始的VLAD为不可导函数,为得到端到端的特征聚类结果,Arandjelovic等[9]在VLAD算法的基础上,用归一化指数函数(Softmax)代替原始方法中的最近邻二值函数ak,提出NetVLAD算法。NetVLAD设定的聚类的数量为K,计算局部特征在这些聚类的差值分布得到全局特征,使VLAD算法中需要手工聚类获得的参数改为通过网络训练获取。系数 a - k(Xi)的计算公式为:
a - k ( X i ) = e 2 α c - k x i - α c - k 2 k = 1 K e 2 α c - k x i - α c - k 2
式中, a - k为基于Softmax计算每个局部特征到聚类中心的距离时的权重,其取值与距离成反比,特征离聚类簇中心越近,则其权重分配越高,远则权重分配越低,即当该特征属于聚类中心时,把局部特征描述子Xi在每个聚类簇下的权重调整为1,否则为0;xi为特征向量; c - k为网络学习的聚类中心;α为权重系数。
首先去掉ResNet-50网络最后的全局平均池化和全连接层,利用1×1卷积对残差网络输出的特征图进行通道压缩减少计算量,然后经过3个并联的具有不同扩张率参数的扩张卷积,得到不同视野下的图像特征。为避免扩张卷积中心点不连续,本文的扩张率参数分别设定为6、12和18。在NetVLAD中,经过激活函数Softmax和局部聚合层的池化获得VLAD特征,通过类内正则化对每个聚类中心的所有残差分别进行正则化,最后将所有VLAD特征共同进行欧式范数正则化得到图像全局表示的特征向量。改进后的DC-NetVLAD编码器结构如图5所示。
NetVLAD算法首先将图像的局部特征聚类压缩,得到以聚类中心表示的全局特征图,然后将待查询图像的特征向量与数据库中的图像向量进行相似度计算,以余弦相似度为度量,距离越小的图像之间越相似。余弦距离的计算公式为:
d ( G , H ) = 1 - i = 1 n ( g i × h i ) i = 1 n ( g i ) 2 × i = 1 n ( h i ) 2
式中,GH分别为输入的2个特征向量;n为向量维度;gihi分别为GHi个维度的特征值。
针对环境中普遍存在着视觉特征相似场景的情况,本文使用三元组损失(Triplet Loss,TL)[10]训练网络来提高系统的查准能力。TL的输入是一组编码后的图像特征,包括正例、负例和基准图像。基于TL训练的网络能够优化图像与特征向量之间的映射关系,在新的特征空间中使负例到基准图像的距离尽可能大于正例到基准图像的距离,从而提升网络识别正例的能力,效果如图6所示。
根据模型训练的需要设定边缘阈值β控制正、负样本的距离。TL目标函数的表达式为:
L = i N f ( x i a ) - f ( x i p ) 2 2 - f ( x i a ) - f ( x i n ) 2 2 + β +
式中, f ( x i a )为输入的数量为N的三元组中的第i个基准样例的特征值; f ( x i p )为第i个正例的特征值; f ( x i n )为第i个负例的特征值。
超参数β控制损失函数的结果:L>0时取该值作为损失;L<0时,表示负例与基准样本的距离大于正例与基准样本的距离,因此损失函数的结果记为0。
本文的试验环境为Ubuntu 18.04 LTS操作系统,硬件平台采用11代英特尔i7处理器、32 GB内存和英伟达RTX 2080Ti显卡。
采用Pitts30k和Nordland[11]2个公开数据集进行测试。Pitts30k是谷歌公司制作的城市街景数据集,该数据集基于地理位置进行划分,包括训练、验证和测试部分,分别含有约7 000张查询图像和10 000张参考图像。Nordland数据集采集自挪威北部地区,包含4个帧率为25 帧/s的视频影像,展示长约728 km的环路场景在四季变化下不同的视觉外观。
针对Nordland数据集,通过插值GPS数据实现时间同步,保证每个视频中任意帧的车辆位置都能够对应到其他3个视频中的同一帧。
以Pitts30k的训练集作为训练样本,设定10 m以内的图像为正例,25 m以外的图像为负例。训练模型的输入图像组包括基准图像、1个正例和10个负例。
本文以召回率-查准率作为评价指标,与典型的VPR方法进行比较,包括手工聚类特征的算法DBoW2、本文工作的基线方法NetVLAD和同基线方法的先进成果Patch-NetVLAD[12]。具体规则如下:
a. 基于Pitts30k数据集的验证,规定如果召回的前Q张图像中,至少存在1张图像距离用于测试的基准图像在地理范围的10 m内,则认为图像已被正确检索。
b. 基于Nordland数据集的验证,规定如果召回的前Q张图像中,至少存在1张图像距离用于测试的基准图像在10帧内,则认为图像已被正确检索。
c. 试验的评价指标为召回率(Recall)和查准率(Precision):
R=TP/(TP+FN)
P=TP/(TP+FP)
式中,R为召回率;P为查准率;TP为真正例数量;FP为假正例数量;FN为假负例数量。
首先对本文方法在位置识别任务中的召回率进行测试,Q的不同取值条件下试验结果如表1所示。
相比原始特征向量,经过主成分分析(Principal Components Analysis,PCA)降维的特征向量在位置识别任务中的召回率表现更好。在Nordland数据集上的平均召回率提高4.05百分点,在Pitts30k数据集上的平均召回率提高3.28百分点。提高平均召回率的主要原因是降维后的图像特征忽略了没有辨识度的特征区域,突出了图像中具有区分度的部分,产生更好的召回结果。
为验证本文改进方法的有效性,将本文方法与近年来具有代表性的VPR方法和同基线的方法进行对比。对不同方法得到的特征向量均降维到512维,使用召回率-查准率评价指标进行分析。本文方法和对比方法在Pitts30k数据集和Nordland数据集上的试验结果如图7所示。
在侧重视角变化的Pitts30k数据集上,NetVLAD算法在召回率为30%时开始出现精度下降,DBoW2和Patch-NetVLAD算法在召回率为40%时开始出现精度的快速下降,本文方法在召回率为50%时才出现明显的精度下降。与DBoW2和NetVLAD算法相比,本文方法更快地检索到了所有正例图像;仅在召回率达到80%后,本文方法的召回精度与Patch-NetVLAD算法相比略有不足。
在具有季节变化和光照变化的Nordland数据集上,本文方法始终优于DBoW2和NetVLAD算法,与先进的Patch-NetVLAD算法相比,在召回率达到70%前,本文方法也保持着更好的召回精度。
可以看出,相比DBoW2和NetVLAD算法,本文方法在试验中始终保持着更好的召回精度。这是由于融合的坐标注意力能够记录明显的空间特征位置关系,在视角变化的情况下,特征间的相对位置关系保持稳定,而不同视野特征图的组合编码能够覆盖不同视距下的观测结果。同时也可以观察到,本文方法在高召回率的场景中召回精度相比同基线的Patch-NetVLAD算法略低。这是由于Patch-NetVLAD算法采用基于区块尺度的图像特征和基于一致性评分的重新排序策略实现了召回精度的提升。相应地,此策略降低了Patch-NetVLAD算法的检索速度,因此本文方法的检索速度与Patch-NetVLAD算法相比有19%的提升。视觉SLAM的实际测试中,常常需要快速实现场景重识别和回环检测,很少产生需要全部召回的现象,因此本文方法的实用性更强。
经上述试验验证,本文方法在召回率-查准率的评价体系下明显优于DBoW2和NetVLAD算法,与先进的Patch-NetVLAD相比,本文方法牺牲较少的召回精度获得了检索速度的大幅提升,证明本文改进方法的有效性。
回环检测是视觉定位中校正全局地图累积误差的重要手段。为验证本文方法在回环检测任务中的表现,设计一组基于旋风智能车[13]试验平台的应用测试。试验平台如图8所示,其处理器为Jetson TX2,包含6核CPU、256核帕斯卡(Pascal)架构GPU和8 GB内存。
利用ORB-SLAM2[14]算法在Nordland数据集的原始影像上提取用于建立回环地图的关键帧序列,以关键帧序列为数据集进行回环测试。将本文方法与DBoW2、NetVLAD和Patch-NetVLAD算法在回环检测任务中的表现进行对比,试验结果如图9所示。
图9可以看出,Nordland数据集夏季序列和冬季序列之间存在较大的外观差异。相比DBoW2和NetVLAD算法,本文方法和Patch-NetVLAD算法用于回环检测任务的结果更接近真实的回环位置,进一步说明本文改进方法的鲁棒性。统计几种方法在回环检测任务中平均闭环准确率和检测时间,结果如表2所示。
表2可知:相比DBoW2和NetVLAD算法,本文方法在回环检测任务中的平均闭环准确率有所提高;与Patch-NetVLAD相比,本文方法的平均闭环准确率略低,但是在时间性能上具有明显优势。
为探究本文所提出的改进点对试验结果的影响,在Pitts30k数据集上通过消融试验对两方面的改进进行验证,如表3所示。
相比基线方法,本文改进方法获得了更好的平均精度,检索时间仅延长6.8 ms。其中,融合CA对网络的精度提升3.87百分点,使用DC-NetVLAD编码器对网络的精度提升3.34百分点。通过融合CA使特征间的位置关系得到利用,利用并联的DC-NetVLAD编码器发挥不同视野下特征图信息的互补性。最终的试验结果表明,同时采用2个模块对网络进行改进,平均精度提升6.04百分点,进一步提升了网络的位置识别能力,证明本文的2个改进点具有一定的互补性。
针对开放场景中的自动驾驶汽车的视觉定位和位置识别问题,本文在现有方法的基础上提出两点改进:基于残差网络融合坐标注意力提取图像特征,提高原始网络对图像特征位置关系的捕获能力;通过多重扩张卷积和NetVLAD实现轻量的多尺度特征融合,编码图像的表征向量。试验结果表明:在季节、光照和视角等变化导致环境外观发生改变的场景中,本文提出的方法取得了96.22%的召回精度,单帧检索时间仅需37.5 ms,与同基线方法相比,显著提高了检索速度和召回精度。
针对视觉传感器捕获的信息模态较为单一,在视觉受限的场景容易产生系统失效的问题,后续将开展多源传感器融合定位的研究以及通过云端计算的方式缓解边缘设备的算力问题等相关工作,实现自动驾驶车辆的高精度定位。
  • * 国家重点研发计划项目(2021YFC3001300)
  • 国家自然科学基金重点项目合作项目(61931012)
  • 北京市自然科学基金面上项目(4222025)
  • 北京联合大学高水平孵化项目和新进博士孵化项目(ZK10202208)
参考文献 引证文献
排序方式:
[1]
高翔, 张涛, 刘毅, 等. 视觉SLAM十四讲: 从理论到实践[M]. 2版. 北京: 电子工业出版社, 2019.
GAO X, ZHANG T, LIU Y, et al. 14 Lectures on Visual SLAM: From Theory to Practice[M]. Beijing: Publishing House of Electronics Industry, 2019.
[2]
LOWRY S, SÜNDERHAUF N, NEWMAN P, et al. Visual Place Recognition: A Survey[J]. IEEE Transactions on Robotics, 2016, 32(1): 1-19.
[3]
刘强, 段富海, 桑勇, 等. 复杂环境下视觉SLAM闭环检测方法综述[J]. 机器人, 2019, 41(1): 112-123+136.
LIU Q, DUAN F H, SANG Y, et al. A Survey of Loop-Closure Detection Method of Visual SLAM in Complex Environments[J]. Robot, 2019, 41(1): 112-123+136.
[4]
李兆凯, 李龙勇, 李泽晖, 等. 基于RGB-D SLAM的智能车自主避障与路径规划试验研究[J]. 汽车技术, 2021(9): 55-62.
LI Z K, LI L Y, LI Z H, et al. Experimental Research on Autonomous Obstacle Avoidance and Path Planning of Intelligent Vehicle Based on RGB-D SLAM Technology[J]. Automobile Technology, 2021(9): 55-62.
[5]
陈涛, 范林坤, 李旭川, 等. 基于深度学习的智能车辆视觉里程计技术发展综述[J]. 汽车技术, 2021(1): 1-10.
CHEN T, FAN L K, LI X C, et al. Review on the Development of Deep Learning-Based Vision Odometer Technologies for Intelligent Vehicles[J]. Automobile Technology, 2021(1): 1-10.
[6]
段立娟, 孙启超, 乔元华, 等. 基于注意力感知和语义感知的RGB-D室内图像语义分割算法[J]. 计算机学报, 2021, 44(2): 275-291.
DUAN L J, SUN Q C, QIAO Y H, et al. Attention-Aware and Semantic-Aware Network for RGB-D Indoor Semantic Segmentation[J]. Chinese Journal of Computers, 2021, 44(2): 275-291.
[7]
HOU Q B, ZHOU D Q, FENG J S. Coordinate Attention for Efficient Mobile Network Design[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA: IEEE, 2021: 13713-13722.
[8]
李琳辉, 张溪桐, 连静, 等. 结合道路结构化特征的语义SLAM算法[J]. 哈尔滨工业大学学报, 2021, 53(2): 175-183.
LI L H, ZHANG X T, LIAN J, et al. Semantic SLAM Algorithm Combined with Road Structured Features[J]. Journal of Harbin Institute of Technology, 2021, 53(2): 175-183.
[9]
ARANDJELOVIC R, GRONAT P, TORII A, et al. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 5297-5307.
[10]
姚足, 龚勋, 陈锐, 等. 面向行人重识别的局部特征研究进展、挑战与展望[J]. 自动化学报, 2021, 47(12): 2742-2760.
YAO Z, GONG X, CHEN R, et al. Research Progress, Challenge and Prospect of Local Features for Person Re-Identification[J]. Acta Automatica Sinica, 2021, 47(12): 2742-2760.
[11]
SÜNDERHAUF N, NEUBERT P, PROTZEL P. Are We There Yet? Challenging SeqSLAM on a 3000 km Journey Across all Four Seasons[C]// Proceedings of Workshop on Long-Term Autonomy, IEEE International Conference on Robotics and Automation (ICRA). Karlsruhe, Germany: IEEE, 2013.
[12]
HAUSLER S, GARG S, XU M, et al. Patch-Netvlad: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA: IEEE, 2021: 14141-14152.
[13]
常飞翔, 刘元盛, 李中道, 等. 一种基于TX2平台的改进YOLOv3红绿灯检测算法[J]. 汽车技术, 2021(8): 1-6.
CHANG F X, LIU Y S, LI Z D, et al. An Improved YOLOv3 Traffic Light Detection Algorithm Based on TX2 Platform[J]. Automobile Technology, 2021(8): 1-6.
[14]
MURARTAL R, TARDOS J D. ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras[J]. IEEE Transactions on Robotics, 2017, 33(5): 1255-1262.
2023年第卷第3期
PDF下载
201
76
引用本文
BibTeX
文章信息
doi: 10.19620/j.cnki.1000-3703.20220668
  • 首发时间:2025-12-07
  • 出版时间:2023-03-24
补充材料
相关文章
文章信息
作者
出版历史
  • 修回日期:2022-07-19
基金
* 国家重点研发计划项目(2021YFC3001300)
国家自然科学基金重点项目合作项目(61931012)
北京市自然科学基金面上项目(4222025)
北京联合大学高水平孵化项目和新进博士孵化项目(ZK10202208)
作者信息
    北京联合大学,北京市信息服务工程重点实验室,北京 100101

通讯作者:

张军(1971—),女,山东临沂人,副教授,博士,研究方向为自动驾驶感知与决策技术,
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/qcjs/CN/10.19620/j.cnki.1000-3703.20220668
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏