Article(id=1149781737145852506, tenantId=1146029695717560320, journalId=1146120084050784272, issueId=1149781735614935465, articleNumber=null, orderNo=null, doi=10.19562/j.chinasae.qcgc.2025.ep.001, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1722960000000, receivedDateStr=2024-08-07, revisedDate=1727366400000, revisedDateStr=2024-09-27, acceptedDate=null, acceptedDateStr=null, onlineDate=1752058928046, onlineDateStr=2025-07-09, pubDate=1742832000000, pubDateStr=2025-03-25, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1752058928046, onlineIssueDateStr=2025-07-09, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1752058928046, creator=13701087609, updateTime=1752058928046, updator=13701087609, issue=Issue{id=1149781735614935465, tenantId=1146029695717560320, journalId=1146120084050784272, year='2025', volume='47', issue='3', pageStart='1', pageEnd='586', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=0, createTime=1752058927681, creator=13701087609, updateTime=1753780706926, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1157003401239290378, tenantId=1146029695717560320, journalId=1146120084050784272, issueId=1149781735614935465, language=EN, specialIssueTitle=, coverIllustrator=, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1157003401239290379, tenantId=1146029695717560320, journalId=1146120084050784272, issueId=1149781735614935465, language=CN, specialIssueTitle=, coverIllustrator=, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=430, endPage=439, ext={EN=ArticleExt(id=1149781737342984795, articleId=1149781737145852506, tenantId=1146029695717560320, journalId=1146120084050784272, language=EN, title=PolarDet: An End-to-End 3D Object Detection Algorithm in Polar Coordinates Based on Position and Semantic Information Weighting, columnId=1149809888211198868, journalTitle=Automotive Engineering, columnName=Feature Topic:Key Technologies on Intelligent and Connected Vehicles, runingTitle=null, highlight=

Traditional 3D object detection methods in Cartesian coordinate systems often overlook the symmetry and continuity of the target from different perspectives to some extent during camera image encoding due to the fixed wedge-shaped imaging geometry of in-vehicle cameras. To address this,in this paper,PolarDet,an innovative end-to-end 3D object detection method in polar coordinates based on position and semantic information weighting is proposed. This method generates BEV (Bird's Eye View) position and semantic information in polar coordinates through polar coordinate queries and predefined polar grid,which then interacts with the BEV information from the previous frame to incorporate temporal information. When outputting the final detection results,PolarDet performs a weighted sum of position and semantic information to enhance information utilization efficiency,allowing the network to achieve higher detection accuracy. Extensive experiments on the challenging BEV object detection nuScenes dataset show that the optimal model of PolarDet achieves a mAP (mean average precision) of 0.469 and an NDS (nuScenes detection score) of 0.56,significantly outperforming Cartesian coordinate-based BEV detection methods.

, articleAbstract=

Traditional 3D object detection methods in Cartesian coordinate systems often overlook the symmetry and continuity of the target from different perspectives to some extent during camera image encoding due to the fixed wedgeshaped imaging geometry of invehicle cameras. To address this, in this paper, PolarDet, an innovative endtoend 3D object detection method in polar coordinates based on position and semantic information weighting is proposed. This method generates BEV (Bird's Eye View) position and semantic information in polar coordinates through polar coordinate queries and predefined polar grid, which then interacts with the BEV information from the previous frame to incorporate temporal information. When outputting the final detection results, PolarDet performs a weighted sum of position and semantic information to enhance information utilization efficiency, allowing the network to achieve higher detection accuracy. Extensive experiments on the challenging BEV object detection nuScenes dataset show that the optimal model of PolarDet achieves a mAP (mean average precision) of 0.469 and an NDS (nuScenes detection score) of 0.56, significantly outperforming Cartesian coordinatebased BEV detection methods.

, correspAuthors=Peicheng Shi, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Peicheng Shi, Runshuai Ge, Chakir Chadia, Xinlong Dong, Taonian Liang, Aixi Yang), CN=ArticleExt(id=1149781748910875287, articleId=1149781737145852506, tenantId=1146029695717560320, journalId=1146120084050784272, language=CN, title=PolarDet:基于位置与语义信息加权的极坐标BEV端到端3D目标检测算法*, columnId=1149809888341222293, journalTitle=汽车工程, columnName=专题:汽车智能化关键技术, runingTitle=null, highlight=

传统的笛卡尔坐标系下的3D目标检测方法因车载相机的固定楔形成像几何,导致相机图像编码时在一定程度上忽视了目标在不同视角下的对称性和连续性。鉴于此,本文提出一种基于位置与语义信息加权的极坐标BEV端到端3D目标检测方法—PolarDet。该方法通过极坐标查询与预定义的极坐标网格生成极坐标下的BEV位置与语义信息,再与前一帧的BEV信息进行特征交互以融入时间信息;在输出最终检测结果时,PolarDet再对位置与语义信息进行加权求和,以提高信息的利用效率,使网络能够达到更高的检测精度。本文在具有挑战性的BEV目标检测nuScenes数据集上进行了广泛的实验,结果表明,PolarDet最优模型的mAP(平均精度)达到0.469,NDS(nuScenes检测得分)达到0.56,显著优于基于笛卡尔坐标的BEV目标检测方法。

, articleAbstract=

传统的笛卡尔坐标系下的3D目标检测方法因车载相机的固定楔形成像几何,导致相机图像编码时在一定程度上忽视了目标在不同视角下的对称性和连续性。鉴于此,本文提出一种基于位置与语义信息加权的极坐标BEV端到端3D目标检测方法—PolarDet。该方法通过极坐标查询与预定义的极坐标网格生成极坐标下的BEV位置与语义信息,再与前一帧的BEV信息进行特征交互以融入时间信息;在输出最终检测结果时,PolarDet再对位置与语义信息进行加权求和,以提高信息的利用效率,使网络能够达到更高的检测精度。本文在具有挑战性的BEV目标检测 nuScenes数据集上进行了广泛的实验,结果表明,PolarDet最优模型的mAP(平均精度)达到0.469, NDS(nuScenes 检测得分)达到0.56,显著优于基于笛卡尔坐标的BEV目标检测方法。

, correspAuthors=时培成, authorNote=null, correspAuthorsNote=
时培成,教授,博士,E-mail:
, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=XCWqz6xB0qOL+cYvm5BRRA==, magXml=VNLNhU5j+4wojigaeb14/w==, pdfUrl=null, pdf=znA4j7hxsOMYPEtoWJJB2A==, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=IDXIvdzMXkB2bfeFxX3tKA==, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=时培成, 戈润帅, Chadia Chakir, 董心龙, 梁涛年, 杨爱喜)}, authors=[Author(id=1170296961904161074, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=shipeicheng@ahpu.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1170296962013212980, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296961904161074, language=EN, stringName=Peicheng Shi, firstName=Peicheng, middleName=null, lastName=Shi, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1170296962092904757, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296961904161074, language=CN, stringName=时培成, firstName=培成, middleName=null, lastName=时, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 安徽工程大学机械与汽车工程学院,芜湖 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1170296961434399014, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=1, ext=[AuthorCompanyExt(id=1170296961455370535, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000), AuthorCompanyExt(id=1170296961459564840, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 安徽工程大学机械与汽车工程学院,芜湖 241000)])]), Author(id=1170296962151625015, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1170296962264871225, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962151625015, language=EN, stringName=Runshuai Ge, firstName=Runshuai, middleName=null, lastName=Ge, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1170296962361340218, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962151625015, language=CN, stringName=戈润帅, firstName=润帅, middleName=null, lastName=戈, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 安徽工程大学机械与汽车工程学院,芜湖 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1170296961434399014, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=1, ext=[AuthorCompanyExt(id=1170296961455370535, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000), AuthorCompanyExt(id=1170296961459564840, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 安徽工程大学机械与汽车工程学院,芜湖 241000)])]), Author(id=1170296962424254780, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1170296962508140862, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962424254780, language=EN, stringName=Chakir Chadia, firstName=Chakir, middleName=null, lastName=Chadia, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1170296962596221247, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962424254780, language=CN, stringName=Chadia Chakir, firstName=Chadia, middleName=null, lastName=Chakir, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 安徽工程大学机械与汽车工程学院,芜湖 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1170296961434399014, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=1, ext=[AuthorCompanyExt(id=1170296961455370535, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000), AuthorCompanyExt(id=1170296961459564840, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 安徽工程大学机械与汽车工程学院,芜湖 241000)])]), Author(id=1170296962667524417, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1170296962768187715, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962667524417, language=EN, stringName=Xinlong Dong, firstName=Xinlong, middleName=null, lastName=Dong, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1170296962856268100, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962667524417, language=CN, stringName=董心龙, firstName=心龙, middleName=null, lastName=董, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 安徽工程大学机械与汽车工程学院,芜湖 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1170296961434399014, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=1, ext=[AuthorCompanyExt(id=1170296961455370535, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000), AuthorCompanyExt(id=1170296961459564840, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 安徽工程大学机械与汽车工程学院,芜湖 241000)])]), Author(id=1170296962914988358, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1170296962977902920, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962914988358, language=EN, stringName=Taonian Liang, firstName=Taonian, middleName=null, lastName=Liang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 Chery New Energy Automobile Co.,Ltd.,Wuhu 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1170296963049206089, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296962914988358, language=CN, stringName=梁涛年, firstName=涛年, middleName=null, lastName=梁, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2 奇瑞新能源汽车股份有限公司,芜湖 241000, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1170296961556033833, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=2, ext=[AuthorCompanyExt(id=1170296961564422442, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961556033833, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 Chery New Energy Automobile Co.,Ltd.,Wuhu 241000), AuthorCompanyExt(id=1170296961572811051, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961556033833, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 奇瑞新能源汽车股份有限公司,芜湖 241000)])]), Author(id=1170296963254726987, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, orderNo=5, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1170296963388944717, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296963254726987, language=EN, stringName=Aixi Yang, firstName=Aixi, middleName=null, lastName=Yang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=3, address=3 Polytechnic Institute,Zhejiang University,Hangzhou 310015, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1170296963518968142, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, authorId=1170296963254726987, language=CN, stringName=杨爱喜, firstName=爱喜, middleName=null, lastName=杨, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=3, address=3 浙江大学工程师学院,杭州 310015, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1170296961694445869, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=3, ext=[AuthorCompanyExt(id=1170296961732194606, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961694445869, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=3 Polytechnic Institute,Zhejiang University,Hangzhou 310015), AuthorCompanyExt(id=1170296961769943343, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961694445869, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=3 浙江大学工程师学院,杭州 310015)])])], keywords=[Keyword(id=1170296963695128911, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, orderNo=1, keyword=polar coordinates), Keyword(id=1170296963808375120, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, orderNo=2, keyword=BEV object detection), Keyword(id=1170296963867095377, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, orderNo=3, keyword=position and semantic information), Keyword(id=1170296963917427026, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, orderNo=4, keyword=feature weighting), Keyword(id=1170296963976147283, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, orderNo=5, keyword=cross-plane encoder), Keyword(id=1170296964026478932, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, orderNo=1, keyword=极坐标), Keyword(id=1170296964118753621, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, orderNo=2, keyword=BEV目标检测), Keyword(id=1170296964231999830, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, orderNo=3, keyword=位置和语义信息), Keyword(id=1170296964320080215, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, orderNo=4, keyword=特征加权), Keyword(id=1170296964483658072, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, orderNo=5, keyword=跨平面编码器)], refs=[Reference(id=1170296967021212017, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=null, journalName=null, refType=null, unstructuredReference=LI Z,WANG W,LI H,et al. BEVFormer:learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers[J/OL]. arXiv,2022[2023-10-17]. http://arxiv.org/abs/2203.17270., articleTitle=null, refAbstract=null), Reference(id=1170296967079932274, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=null, journalName=null, refType=null, unstructuredReference=JIANG Y,ZHANG L,MIAO Z,et al. PolarFormer:multi-camera 3D object detection with polar transformer[J/OL]. Proceedings of the AAAI Conference on Artificial Intelligence,2023,37(1):1042-1050[2024-01-16]. https://ojs.aaai.org/index.php/AAAI/article/view/25185. DOI:10.1609/aaai.v37i1.25185., articleTitle=null, refAbstract=null), Reference(id=1170296967134458227, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=null, journalName=null, refType=null, unstructuredReference=PHILION J,FIDLER S. Lift,Splat,Shoot:encoding images from arbitrary camera rigs by implicitly unprojecting to 3D[J/OL]. arXiv,2020[2023-10-25]. http://arxiv.org/abs/2008.05711., articleTitle=null, refAbstract=null), Reference(id=1170296967247704436, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[4], rfOrder=3, authorNames=null, journalName=null, refType=null, unstructuredReference=HUANG J,HUANG G,ZHU Z,et al. BEVDet:high-performance multi-camera 3D object detection in bird-eye-view[J/OL]. arXiv,2022[2023-10-17]. http://arxiv.org/abs/2112.11790., articleTitle=null, refAbstract=null), Reference(id=1170296967369339253, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[5], rfOrder=4, authorNames=null, journalName=null, refType=null, unstructuredReference=READING C,HARAKEH A,CHAE J,et al. Categorical depth distribution network for monocular 3D object detection[J/OL]. arXiv,2021[2023-12-16]. http://arxiv.org/abs/2103.01100., articleTitle=null, refAbstract=null), Reference(id=1170296967444836726, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[6], rfOrder=5, authorNames=null, journalName=null, refType=null, unstructuredReference=VASWANI A,SHAZEER N,PARMAR N,et al. Attention is all you need[J/OL]. arXiv,2023[2023-10-06]. http://arxiv.org/abs/1706.03762. DOI:10.48550/arXiv.1706.03762., articleTitle=null, refAbstract=null), Reference(id=1170296967524528503, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[7], rfOrder=6, authorNames=null, journalName=null, refType=null, unstructuredReference=ZHU X,SU W,LU L,et al. Deformable DETR:deformable transformers for end-to-end object detection[J/OL]. arXiv,2021[2024-03-07]. http://arxiv.org/abs/2010.04159., articleTitle=null, refAbstract=null), Reference(id=1170296967625191800, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[8], rfOrder=7, authorNames=null, journalName=null, refType=null, unstructuredReference=CARION N,MASSA F,SYNNAEVE G,et al. End-to-end object detection with transformers[C/OL]//VEDALDI A,BISCHOF H,BROX T,et al. Computer Vision-ECCV 2020. Cham:Springer International Publishing,2020:213-229. DOI:10.1007/978-3-030-58452-8_13., articleTitle=null, refAbstract=null), Reference(id=1170296967813935481, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[9], rfOrder=8, authorNames=null, journalName=null, refType=null, unstructuredReference=BEAL J,KIM E,TZENG E,et al. Toward transformer-based object detection[J/OL]. arXiv,2020[2023-11-20]. http://arxiv.org/abs/2012.09958. DOI:10.48550/arXiv.2012.09958., articleTitle=null, refAbstract=null), Reference(id=1170296967960736122, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[10], rfOrder=9, authorNames=null, journalName=null, refType=null, unstructuredReference=LIU S,LI F,ZHANG H,et al. DAB-DETR:dynamic anchor boxes are better queries for DETR[J/OL]. arXiv,2022[2023-11-25]. http://arxiv.org/abs/2201.12329., articleTitle=null, refAbstract=null), Reference(id=1170296968057205115, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=10, authorNames=null, journalName=null, refType=null, unstructuredReference=DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al. An image is worth 16x16 words:transformers for image recognition at scale[J]. 2020.DOI:10.48550/arXiv.2010.11929. ., articleTitle=null, refAbstract=null), Reference(id=1170296968149479804, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[12], rfOrder=11, authorNames=null, journalName=null, refType=null, unstructuredReference=LIU Y,WANG T,ZHANG X,et al. PETR:position embedding transformation for multi-view 3D object detection[J/OL]. arXiv,2022[2023-10-25]. http://arxiv.org/abs/2203.05625., articleTitle=null, refAbstract=null), Reference(id=1170296968254337405, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[13], rfOrder=12, authorNames=null, journalName=null, refType=null, unstructuredReference=WANG Y,GUIZILINI V,ZHANG T,et al. DETR3D:3D object detection from multi-view images via 3D-to-2D queries[J/OL]. arXiv,2021[2024-01-16]. http://arxiv.org/abs/2110.06922., articleTitle=null, refAbstract=null), Reference(id=1170296968359195006, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[14], rfOrder=13, authorNames=null, journalName=null, refType=null, unstructuredReference=CHEN S,WANG X,CHENG T,et al. Polar parametrization for vision-based surround-view 3D detection[J/OL]. arXiv,2022[2024-02-28]. http://arxiv.org/abs/2206.10965., articleTitle=null, refAbstract=null), Reference(id=1170296968447275391, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[15], rfOrder=14, authorNames=null, journalName=null, refType=null, unstructuredReference=RODDICK T,KENDALL A,CIPOLLA R. Orthographic feature transform for monocular 3D object detection[J/OL]. arXiv,2018[2023-12-16]. http://arxiv.org/abs/1811.08188., articleTitle=null, refAbstract=null), Reference(id=1170296968535355776, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[16], rfOrder=15, authorNames=null, journalName=null, refType=null, unstructuredReference=HUANG J,HUANG G. BEVDet4D:exploit temporal cues in multi-camera 3D object detection[J/OL]. arXiv,2022[2024-07-09]. http://arxiv.org/abs/2203.17054., articleTitle=null, refAbstract=null), Reference(id=1170296968602464641, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[17], rfOrder=16, authorNames=null, journalName=null, refType=null, unstructuredReference=YANG C,CHEN Y,TIAN H,et al. BEVFormer v2:adapting modern image backbones to bird’s-eye-view recognition via perspective supervision[J/OL]. arXiv,2022[2024-04-04]. http://arxiv.org/abs/2211.10439., articleTitle=null, refAbstract=null), Reference(id=1170296968728293762, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[18], rfOrder=17, authorNames=null, journalName=null, refType=null, unstructuredReference=LI Y,GE Z,YU G,et al. BEVDepth:acquisition of reliable depth for multi-view 3D object detection[J/OL]. Proceedings of the AAAI Conference on Artificial Intelligence,2023,37(2):1477-1485[2023-11-16]. https://ojs.aaai.org/index.php/AAAI/article/view/25233. DOI:10.1609/aaai.v37i2.25233., articleTitle=null, refAbstract=null), Reference(id=1170296968782819715, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[19], rfOrder=18, authorNames=null, journalName=null, refType=null, unstructuredReference=LIN X,LIN T,PEI Z,et al. Sparse4D:multi-view 3D object detection with sparse spatial-temporal fusion[J/OL]. arXiv,2023[2024-04-11]. http://arxiv.org/abs/2211.10581., articleTitle=null, refAbstract=null), Reference(id=1170296968896065924, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[20], rfOrder=19, authorNames=null, journalName=null, refType=null, unstructuredReference=LIN X,LIN T,PEI Z,et al. Sparse4D v2:recurrent temporal fusion with sparse model[J/OL]. arXiv,2023[2024-01-16]. http://arxiv.org/abs/2305.14018., articleTitle=null, refAbstract=null), Reference(id=1170296969026089349, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[21], rfOrder=20, authorNames=null, journalName=null, refType=null, unstructuredReference=LIN T Y,DOLLÁR P,GIRSHICK R,et al. Feature pyramid networks for object detection[J/OL]. arXiv,2017[2023-11-15]. http://arxiv.org/abs/1612.03144., articleTitle=null, refAbstract=null), Reference(id=1170296969156112774, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[22], rfOrder=21, authorNames=null, journalName=null, refType=null, unstructuredReference=CAESAR H,BANKITI V,LANG A H,et al. nuScenes:a multimodal dataset for autonomous driving[J/OL]. arXiv,2020[2024-05-08]. http://arxiv.org/abs/1903.11027. DOI:10.48550/arXiv.1903.11027., articleTitle=null, refAbstract=null), Reference(id=1170296969273553287, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[23], rfOrder=22, authorNames=null, journalName=null, refType=null, unstructuredReference=DOLL S,SCHULZ R,SCHNEIDER L,et al. SpatialDETR:robust scalable transformer-based 3D object detection from multi-view camera images with global cross-sensor attention[M/OL]//AVIDAN S,BROSTOW G,CISSÉ M,et al. Computer vision-ECCV 2022:Vol. 13699. Cham:Springer Nature Switzerland,2022:230-245[2024-06-23]. https://link.springer.com/10.1007/978-3-031-19842-7_14. DOI:10.1007/978-3-031-19842-7_14., articleTitle=null, refAbstract=null), Reference(id=1170296969378410888, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[24], rfOrder=23, authorNames=null, journalName=null, refType=null, unstructuredReference=LIU Y,YAN J,JIA F,et al. PETRv2:a unified framework for 3D perception from multi-camera images[C/OL]//2023 IEEE/CVF International Conference on Computer Vision (ICCV). Paris,France:IEEE,2023:3239-3249[2024-02-27]. https://ieeexplore.ieee.org/document/10377268/. DOI:10.1109/ICCV51070.2023.00302., articleTitle=null, refAbstract=null), Reference(id=1170296969470685577, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[25], rfOrder=24, authorNames=null, journalName=null, refType=null, unstructuredReference=ROH W,CHANG G,MOON S,et al. ORA3D:overlap region aware multi-view 3D object detection[J/OL]. arXiv,2023[2024-06-23]. http://arxiv.org/abs/2207.00865., articleTitle=null, refAbstract=null), Reference(id=1170296969550377354, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[26], rfOrder=25, authorNames=null, journalName=null, refType=null, unstructuredReference=HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[J/OL]. arXiv,2015[2024-05-08]. http://arxiv.org/abs/1512.03385. DOI:10.48550/arXiv.1512.03385., articleTitle=null, refAbstract=null), Reference(id=1170296969730732427, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[27], rfOrder=26, authorNames=null, journalName=null, refType=null, unstructuredReference=WANG T,ZHU X,PANG J,et al. FCOS3D:fully convolutional one-stage monocular 3D object detection[J/OL]. arXiv,2021[2023-10-17]. http://arxiv.org/abs/2104.10956., articleTitle=null, refAbstract=null), Reference(id=1170296969802035596, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, doi=null, pmid=null, pmcid=null, year=null, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[28], rfOrder=27, authorNames=null, journalName=null, refType=null, unstructuredReference=LOSHCHILOV I,HUTTER F. Decoupled weight decay regularization[J/OL]. arXiv,2019[2024-06-14]. http://arxiv.org/abs/1711.05101., articleTitle=null, refAbstract=null)], funds=[Fund(id=1170296966358511981, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, awardId=2023CSJGG1600, language=CN, fundingSource=*中央引导地方科技专项-长三角科技创新共同体联合攻关计划项目(2023CSJGG1600), fundOrder=null, country=null), Fund(id=1170296966400455022, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, awardId=2208085MF173, language=CN, fundingSource=安徽省自然科学基金面上项目(2208085MF173), fundOrder=null, country=null), Fund(id=1170296966459175279, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, awardId=2023zd01, language=CN, fundingSource=芜湖市“赤铸之光”重大科技项目资助。(2023zd01), fundOrder=null, country=null), Fund(id=1170296966610170224, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, awardId=2023zd03, language=CN, fundingSource=芜湖市“赤铸之光”重大科技项目资助。(2023zd03), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1170296961434399014, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=1, ext=[AuthorCompanyExt(id=1170296961455370535, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000), AuthorCompanyExt(id=1170296961459564840, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961434399014, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 安徽工程大学机械与汽车工程学院,芜湖 241000)]), AuthorCompany(id=1170296961556033833, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=2, ext=[AuthorCompanyExt(id=1170296961564422442, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961556033833, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 Chery New Energy Automobile Co.,Ltd.,Wuhu 241000), AuthorCompanyExt(id=1170296961572811051, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961556033833, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 奇瑞新能源汽车股份有限公司,芜湖 241000)]), AuthorCompany(id=1170296961694445869, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, xref=3, ext=[AuthorCompanyExt(id=1170296961732194606, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961694445869, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=3 Polytechnic Institute,Zhejiang University,Hangzhou 310015), AuthorCompanyExt(id=1170296961769943343, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, companyId=1170296961694445869, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=3 浙江大学工程师学院,杭州 310015)])], figs=[ArticleFig(id=1170296964710150489, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=DJ01l40w1XMSJjuvL+IriQ==, figureFileBig=XSrotrjJyr2PCVZykvSdNA==, tableContent=null), ArticleFig(id=1170296964764676442, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=图1, caption=PolarFormer[2]与PolarDet的检测结果对比(黄色实线圈出的是PolarFormer[2]的误检结果,黄色虚线圈出的是PolarDet在相同位置处的检测结果), figureFileSmall=DJ01l40w1XMSJjuvL+IriQ==, figureFileBig=XSrotrjJyr2PCVZykvSdNA==, tableContent=null), ArticleFig(id=1170296964840173915, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=w70CBCZVXv3yJDX2pYMc/A==, figureFileBig=JrMVU4nNfdhwjekG0+BypA==, tableContent=null), ArticleFig(id=1170296964903088476, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=图2, caption=PolarDet网络结构, figureFileSmall=w70CBCZVXv3yJDX2pYMc/A==, figureFileBig=JrMVU4nNfdhwjekG0+BypA==, tableContent=null), ArticleFig(id=1170296964995363165, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=834N68Yt4WsAS79trGm85Q==, figureFileBig=c7xKpA6sF3pvdXzWaWI3uw==, tableContent=null), ArticleFig(id=1170296965066666334, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=图3, caption=PolarDet的跨平面编码器、多层级BEV特征编码器与多层级BEV特征解码器结构, figureFileSmall=834N68Yt4WsAS79trGm85Q==, figureFileBig=c7xKpA6sF3pvdXzWaWI3uw==, tableContent=null), ArticleFig(id=1170296965129580895, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=l8lXpQbEjQLnS23FtEK3ww==, figureFileBig=fjPqlcV+f5ndTB+T8uz3tQ==, tableContent=null), ArticleFig(id=1170296965217661280, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=图4, caption=不同场景下的PolarDet检测结果(为便于查看,BEV使用笛卡尔坐标系绘制;蓝色框代表检测结果,绿色框代表真实标签), figureFileSmall=l8lXpQbEjQLnS23FtEK3ww==, figureFileBig=fjPqlcV+f5ndTB+T8uz3tQ==, tableContent=null), ArticleFig(id=1170296965272187233, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=gNJjJNX9XlsXPuvZ29OdyA==, figureFileBig=6WVHlOCv6foKCqpZSq87xQ==, tableContent=null), ArticleFig(id=1170296965318324578, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=图5, caption=PolarDet不同权重对性能的影响, figureFileSmall=gNJjJNX9XlsXPuvZ29OdyA==, figureFileBig=6WVHlOCv6foKCqpZSq87xQ==, tableContent=null), ArticleFig(id=1170296965402210659, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
方法 坐标系 骨干网络 mAP NDS mATE mASE mAOE mAVE mAAE
BEVFormer[1] 笛卡尔 ResNet101 0.445 0.535 0.631 0.257 0.405 0.435 0.143
BEVDet-Beta[4] 笛卡尔 VovNet 0.422 0.482 0.529 0.236 0.396 0.979 0.152
SpatialDETR[23] 笛卡尔 ResNet101 0.424 0.486 0.613 0.253 0.402 0.857 0.131
PETR[12] 笛卡尔 ResNet101 0.366 0.441 0.717 0.267 0.412 0.834 0.190
DETR3D[13] 笛卡尔 ResNet101 0.349 0.434 0.716 0.268 0.379 0.842 0.200
PETRv2[24] 笛卡尔 ResNet101 0.421 0.524 0.681 0.267 0.357 0.377 0.186
ORA3D[25] 笛卡尔 VovNet 0.423 0.489 0.595 0.254 0.392 0.851 0.128
PolarFormer-T[2] 极坐标 ResNet101 0.457 0.543 0.612 0.257 0.392 0.467 0.129
PolarDETR-T[14] 极坐标 ResNet101 0.383 0.488 0.707 0.269 0.344 0.518 0.196
PolarDet 极坐标 ResNet101 0.469 0.560 0.593 0.218 0.313 0.470 0.113
), ArticleFig(id=1170296965494485348, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=表1, caption=

PolarDet与先进网络的对比

, figureFileSmall=null, figureFileBig=null, tableContent=
方法 坐标系 骨干网络 mAP NDS mATE mASE mAOE mAVE mAAE
BEVFormer[1] 笛卡尔 ResNet101 0.445 0.535 0.631 0.257 0.405 0.435 0.143
BEVDet-Beta[4] 笛卡尔 VovNet 0.422 0.482 0.529 0.236 0.396 0.979 0.152
SpatialDETR[23] 笛卡尔 ResNet101 0.424 0.486 0.613 0.253 0.402 0.857 0.131
PETR[12] 笛卡尔 ResNet101 0.366 0.441 0.717 0.267 0.412 0.834 0.190
DETR3D[13] 笛卡尔 ResNet101 0.349 0.434 0.716 0.268 0.379 0.842 0.200
PETRv2[24] 笛卡尔 ResNet101 0.421 0.524 0.681 0.267 0.357 0.377 0.186
ORA3D[25] 笛卡尔 VovNet 0.423 0.489 0.595 0.254 0.392 0.851 0.128
PolarFormer-T[2] 极坐标 ResNet101 0.457 0.543 0.612 0.257 0.392 0.467 0.129
PolarDETR-T[14] 极坐标 ResNet101 0.383 0.488 0.707 0.269 0.344 0.518 0.196
PolarDet 极坐标 ResNet101 0.469 0.560 0.593 0.218 0.313 0.470 0.113
), ArticleFig(id=1170296965616120165, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
方法

是否使用

多层级

特征

特征所在

坐标系

预测所在

坐标系

mAP NDS
PolarFormer-CC-s 笛卡尔 笛卡尔 0.381 0.449
PolarDet-CC-s 笛卡尔 笛卡尔 0.426 0.531
PolarFormer-PC-s 极坐标 笛卡尔 0.388 0.450
PolarDet-PC-s 极坐标 笛卡尔 0.430 0.535
PolarFormer-s 极坐标 极坐标 0.391 0.458
PolarDet-s 极坐标 极坐标 0.433 0.537
PolarFormer-CC 笛卡尔 笛卡尔 0.381 0.450
PolarDet-CC 笛卡尔 笛卡尔 0.453 0.550
PolarFormer-PC 极坐标 笛卡尔 0.385 0.455
PolarDet-PC 极坐标 笛卡尔 0.461 0.557
PolarFormer 极坐标 极坐标 0.396 0.458
PolarDet 极坐标 极坐标 0.469 0.560
), ArticleFig(id=1170296965704200550, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=表2, caption=

PolarDet与PolarFormer检测结果对比

, figureFileSmall=null, figureFileBig=null, tableContent=
方法

是否使用

多层级

特征

特征所在

坐标系

预测所在

坐标系

mAP NDS
PolarFormer-CC-s 笛卡尔 笛卡尔 0.381 0.449
PolarDet-CC-s 笛卡尔 笛卡尔 0.426 0.531
PolarFormer-PC-s 极坐标 笛卡尔 0.388 0.450
PolarDet-PC-s 极坐标 笛卡尔 0.430 0.535
PolarFormer-s 极坐标 极坐标 0.391 0.458
PolarDet-s 极坐标 极坐标 0.433 0.537
PolarFormer-CC 笛卡尔 笛卡尔 0.381 0.450
PolarDet-CC 笛卡尔 笛卡尔 0.453 0.550
PolarFormer-PC 极坐标 笛卡尔 0.385 0.455
PolarDet-PC 极坐标 笛卡尔 0.461 0.557
PolarFormer 极坐标 极坐标 0.396 0.458
PolarDet 极坐标 极坐标 0.469 0.560
), ArticleFig(id=1170296965775503719, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
方法

是否

加权

维度

数量

是否嵌入

时间信息

mAP NDS
PolarFormer 256 0.396 0.458
PolarDet-E-NT 512 0.402 0.460
PolarDet-E 512 0.455 0.539
PolarDet-NT 512 0.437 0.479
PolarDet 512 0.469 0.560
), ArticleFig(id=1170296965892944232, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=表3, caption=

PolarDet是否施加权重与是否使用时间信息的消融实验

, figureFileSmall=null, figureFileBig=null, tableContent=
方法

是否

加权

维度

数量

是否嵌入

时间信息

mAP NDS
PolarFormer 256 0.396 0.458
PolarDet-E-NT 512 0.402 0.460
PolarDet-E 512 0.455 0.539
PolarDet-NT 512 0.437 0.479
PolarDet 512 0.469 0.560
), ArticleFig(id=1170296965964247401, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=

语义

信息-C

位置

信息-C

语义

信息-B

位置

信息-B

mAP NDS
0.8 0.2 0.5 0.5 0.461 0.550
0.8 0.2 0.4 0.6 0.466 0.558
0.8 0.2 0.3 0.7 0.463 0.550
0.8 0.2 0.6 0.4 0.453 0.536
0.8 0.2 0.7 0.3 0.441 0.526
0.7 0.3 0.5 0.5 0.463 0.557
0.7 0.3 0.4 0.6 0.469 0.560
0.7 0.3 0.3 0.7 0.464 0.552
0.7 0.3 0.6 0.4 0.454 0.533
0.7 0.3 0.7 0.3 0.444 0.529
0.6 0.4 0.5 0.5 0.460 0.553
0.6 0.4 0.4 0.6 0.464 0.556
0.6 0.4 0.3 0.7 0.463 0.555
0.6 0.4 0.6 0.4 0.452 0.540
0.6 0.4 0.7 0.3 0.440 0.523
0.5 0.5 0.5 0.5 0.455 0.539
0.5 0.5 0.4 0.6 0.458 0.548
0.5 0.5 0.3 0.7 0.456 0.541
0.5 0.5 0.6 0.4 0.443 0.530
0.5 0.5 0.7 0.3 0.440 0.520
0.4 0.6 0.5 0.5 0.442 0.528
0.4 0.6 0.4 0.6 0.445 0.530
0.4 0.6 0.3 0.7 0.436 0.527
0.4 0.6 0.6 0.4 0.423 0.518
0.4 0.6 0.7 0.3 0.421 0.509
), ArticleFig(id=1170296966039744874, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=表4, caption=

PolarDet不同权重对性能的影响

, figureFileSmall=null, figureFileBig=null, tableContent=

语义

信息-C

位置

信息-C

语义

信息-B

位置

信息-B

mAP NDS
0.8 0.2 0.5 0.5 0.461 0.550
0.8 0.2 0.4 0.6 0.466 0.558
0.8 0.2 0.3 0.7 0.463 0.550
0.8 0.2 0.6 0.4 0.453 0.536
0.8 0.2 0.7 0.3 0.441 0.526
0.7 0.3 0.5 0.5 0.463 0.557
0.7 0.3 0.4 0.6 0.469 0.560
0.7 0.3 0.3 0.7 0.464 0.552
0.7 0.3 0.6 0.4 0.454 0.533
0.7 0.3 0.7 0.3 0.444 0.529
0.6 0.4 0.5 0.5 0.460 0.553
0.6 0.4 0.4 0.6 0.464 0.556
0.6 0.4 0.3 0.7 0.463 0.555
0.6 0.4 0.6 0.4 0.452 0.540
0.6 0.4 0.7 0.3 0.440 0.523
0.5 0.5 0.5 0.5 0.455 0.539
0.5 0.5 0.4 0.6 0.458 0.548
0.5 0.5 0.3 0.7 0.456 0.541
0.5 0.5 0.6 0.4 0.443 0.530
0.5 0.5 0.7 0.3 0.440 0.520
0.4 0.6 0.5 0.5 0.442 0.528
0.4 0.6 0.4 0.6 0.445 0.530
0.4 0.6 0.3 0.7 0.436 0.527
0.4 0.6 0.6 0.4 0.423 0.518
0.4 0.6 0.7 0.3 0.421 0.509
), ArticleFig(id=1170296966115242347, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
权重分配

分类预测结果

NDS平均值

回归预测结果

NDS平均值

(0.8,0.2) 0.545 4
(0.7,0.3) 0.546 2 0.521 4
(0.6,0.4) 0.545 4 0.531 4
(0.5,0.5) 0.535 6 0.545 4
(0.4,0.6) 0.522 4 0.550 4
(0.3,0.7) 0.545 0
), ArticleFig(id=1170296966194934124, tenantId=1146029695717560320, journalId=1146120084050784272, articleId=1149781737145852506, language=CN, label=表5, caption=

各种权重分配下的不同任务中的NDS平均值

, figureFileSmall=null, figureFileBig=null, tableContent=
权重分配

分类预测结果

NDS平均值

回归预测结果

NDS平均值

(0.8,0.2) 0.545 4
(0.7,0.3) 0.546 2 0.521 4
(0.6,0.4) 0.545 4 0.531 4
(0.5,0.5) 0.535 6 0.545 4
(0.4,0.6) 0.522 4 0.550 4
(0.3,0.7) 0.545 0
)], attaches=null, journal=Journal(id=1146119049450201092, delFlag=0, nameCn=汽车工程, nameEn=Automotive Engineering, nameHistory1=null, nameHistory2=null, issn=1000-680X, eissn=, cn=11-2221/U, coden=null, periodic=0, language=CN, oaType=否, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=QBBRQev7wkMVPuUPGz0mFw==, journalPrice=null, startedYear=null, abbrevIsoEn=Auto Eng, journalRemark=null, publicationField=null, createdTime=null, updatedTime=1755587219741, createdBy=null, updatedBy=15831073675, firstLetterCn=A, firstLetterEn=A, subjectCode=Engineering, subjectName=工程, subjectCodeEn=Engineering, subjectNameEn=null, picCn=QBBRQev7wkMVPuUPGz0mFw==, picEn=p+MsLQKu3DZkDibBsTBu1Q==, jcr=null, cjcr=null, exts=[JournalExt(id=1164580465202643295, language=CN, name=汽车工程, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=null, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=https://www.qichegongcheng.com/CN/1000-680X/home.shtml, createdTime=1755587219763, updatedTime=1755587219763, createdBy=15831073675, updatedBy=15831073675, submissionGuidelinesUrl=https://www.qichegongcheng.com/CN/column/column6.shtml, submissionAuthorUrl=https://journal03.magtechjournal.com/journalx_qcgc/authorLogOn.action, submissionEditorUrl=https://journal03.magtechjournal.com/journalx_qcgc/editorLogOn.action, submissionReviewUrl=https://journal03.magtechjournal.com/journalx_qcgc/expertLogOn.action, submissionCeEditorUrl=https://journal03.magtechjournal.com/journalx_qcgc/editorInChiefLogOn.action, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1164580465248780640, language=EN, name=Automotive Engineering, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=null, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=https://www.qichegongcheng.com/EN/1000-680X/home.shtml, createdTime=1755587219774, updatedTime=1755587219774, createdBy=15831073675, updatedBy=15831073675, submissionGuidelinesUrl=https://www.qichegongcheng.com/EN/column/column6.shtml, submissionAuthorUrl=https://journal03.magtechjournal.com/journalx_qcgc/authorLogOn.action, submissionEditorUrl=https://journal03.magtechjournal.com/journalx_qcgc/editorLogOn.action, submissionReviewUrl=https://journal03.magtechjournal.com/journalx_qcgc/expertLogOn.action, submissionCeEditorUrl=https://journal03.magtechjournal.com/journalx_qcgc/editorInChiefLogOn.action, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1146120084050784272, websiteList=[Website(id=1148243202387206565, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1146120084050784272, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcygc/CN, language=CN, createTime=1751692112776, createBy=18614031015, updateTime=1753500958911, updateBy=18614031015, name=《汽车工程》中文站点, tplId=1146099689490845704, title=汽车工程, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1148622315115540535, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1148243202387206565, code=articleTextType, value=kx, createTime=1751782500294, updateTime=1751782500294, creator=18614031015, updator=18614031015), WebsiteProps(id=1148622315094569012, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1148243202387206565, code=banner, value=null, createTime=1751782500289, updateTime=1751782500289, creator=18614031015, updator=18614031015), WebsiteProps(id=1148622315081986099, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1148243202387206565, code=logo, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic?fileId=+W0ZN6/p6N8AvZxnX71krg==, createTime=1751782500286, updateTime=1751782500286, creator=18614031015, updator=18614031015), WebsiteProps(id=1148622315107151926, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1148243202387206565, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic, createTime=1751782500292, updateTime=1751782500292, creator=18614031015, updator=18614031015), WebsiteProps(id=1148622315102957621, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1148243202387206565, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1751782500291, updateTime=1751782500291, creator=18614031015, updator=18614031015)]), Website(id=1155829970321686531, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1146120084050784272, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/qcygc/EN, language=EN, createTime=1753500939211, createBy=18614031015, updateTime=1753500939211, updateBy=18614031015, name=《汽车工程》英文站点, tplId=1146101810881728533, title=Automotive Engineering, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1155830904879702095, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1155829970321686531, code=articleTextType, value=kx, createTime=1753501162023, updateTime=1753501162023, creator=18614031015, updator=18614031015), WebsiteProps(id=1155830904858730572, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1155829970321686531, code=banner, value=null, createTime=1753501162018, updateTime=1753501162018, creator=18614031015, updator=18614031015), WebsiteProps(id=1155830904837759051, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1155829970321686531, code=logo, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic?fileId=+W0ZN6/p6N8AvZxnX71krg==, createTime=1753501162013, updateTime=1753501162013, creator=18614031015, updator=18614031015), WebsiteProps(id=1155830904875507790, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1155829970321686531, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic, createTime=1753501162022, updateTime=1753501162022, creator=18614031015, updator=18614031015), WebsiteProps(id=1155830904867119181, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1155829970321686531, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1753501162020, updateTime=1753501162020, creator=18614031015, updator=18614031015)])], journalTitle=汽车工程, weixinUrl=null, journalUrl=null, iacademicId=null, status=0, seqNo=null, journalTitleEn=Automotive Engineering, journalPhotoCn=QBBRQev7wkMVPuUPGz0mFw==, journalPhotoEn=p+MsLQKu3DZkDibBsTBu1Q==, journalFirstLetter=A, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/qcygc/CN/10.19562/j.chinasae.qcgc.2025.ep.001, detailUrlEn=https://castjournals.cast.org.cn/joweb/qcygc/EN/10.19562/j.chinasae.qcgc.2025.ep.001, pdfUrlCn=https://castjournals.cast.org.cn/joweb/qcygc/CN/PDF/10.19562/j.chinasae.qcgc.2025.ep.001, pdfUrlEn=https://castjournals.cast.org.cn/joweb/qcygc/EN/PDF/10.19562/j.chinasae.qcgc.2025.ep.001, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
PolarDet:基于位置与语义信息加权的极坐标BEV端到端3D目标检测算法*
收藏切换
PDF下载
时培成 1 , 戈润帅 1 , Chadia Chakir 1 , 董心龙 1 , 梁涛年 2 , 杨爱喜 3
汽车工程 | 专题:汽车智能化关键技术 2025,47(3): 430-439
收起
收藏切换
汽车工程 | 专题:汽车智能化关键技术 2025, 47(3): 430-439
PolarDet:基于位置与语义信息加权的极坐标BEV端到端3D目标检测算法*
全屏
时培成1 , 戈润帅1, Chadia Chakir1, 董心龙1, 梁涛年2, 杨爱喜3
作者信息
  • 1 安徽工程大学机械与汽车工程学院,芜湖 241000
  • 2 奇瑞新能源汽车股份有限公司,芜湖 241000
  • 3 浙江大学工程师学院,杭州 310015

通讯作者:

时培成,教授,博士,E-mail:
PolarDet: An End-to-End 3D Object Detection Algorithm in Polar Coordinates Based on Position and Semantic Information Weighting
Peicheng Shi1 , Runshuai Ge1, Chakir Chadia1, Xinlong Dong1, Taonian Liang2, Aixi Yang3
Affiliations
  • 1 School of Mechanical and Automotive Engineering,Anhui Polytechnic University,Wuhu 241000
  • 2 Chery New Energy Automobile Co.,Ltd.,Wuhu 241000
  • 3 Polytechnic Institute,Zhejiang University,Hangzhou 310015
出版时间: 2025-03-25 doi: 10.19562/j.chinasae.qcgc.2025.ep.001
文章导航
收藏切换

传统的笛卡尔坐标系下的3D目标检测方法因车载相机的固定楔形成像几何,导致相机图像编码时在一定程度上忽视了目标在不同视角下的对称性和连续性。鉴于此,本文提出一种基于位置与语义信息加权的极坐标BEV端到端3D目标检测方法—PolarDet。该方法通过极坐标查询与预定义的极坐标网格生成极坐标下的BEV位置与语义信息,再与前一帧的BEV信息进行特征交互以融入时间信息;在输出最终检测结果时,PolarDet再对位置与语义信息进行加权求和,以提高信息的利用效率,使网络能够达到更高的检测精度。本文在具有挑战性的BEV目标检测 nuScenes数据集上进行了广泛的实验,结果表明,PolarDet最优模型的mAP(平均精度)达到0.469, NDS(nuScenes 检测得分)达到0.56,显著优于基于笛卡尔坐标的BEV目标检测方法。

极坐标  /  BEV目标检测  /  位置和语义信息  /  特征加权  /  跨平面编码器

Traditional 3D object detection methods in Cartesian coordinate systems often overlook the symmetry and continuity of the target from different perspectives to some extent during camera image encoding due to the fixed wedgeshaped imaging geometry of invehicle cameras. To address this, in this paper, PolarDet, an innovative endtoend 3D object detection method in polar coordinates based on position and semantic information weighting is proposed. This method generates BEV (Bird's Eye View) position and semantic information in polar coordinates through polar coordinate queries and predefined polar grid, which then interacts with the BEV information from the previous frame to incorporate temporal information. When outputting the final detection results, PolarDet performs a weighted sum of position and semantic information to enhance information utilization efficiency, allowing the network to achieve higher detection accuracy. Extensive experiments on the challenging BEV object detection nuScenes dataset show that the optimal model of PolarDet achieves a mAP (mean average precision) of 0.469 and an NDS (nuScenes detection score) of 0.56, significantly outperforming Cartesian coordinatebased BEV detection methods.

polar coordinates  /  BEV object detection  /  position and semantic information  /  feature weighting  /  cross-plane encoder
时培成, 戈润帅, Chadia Chakir, 董心龙, 梁涛年, 杨爱喜. PolarDet:基于位置与语义信息加权的极坐标BEV端到端3D目标检测算法*. 汽车工程, 2025 , 47 (3) : 430 -439 . DOI: 10.19562/j.chinasae.qcgc.2025.ep.001
Peicheng Shi, Runshuai Ge, Chakir Chadia, Xinlong Dong, Taonian Liang, Aixi Yang. PolarDet: An End-to-End 3D Object Detection Algorithm in Polar Coordinates Based on Position and Semantic Information Weighting[J]. Automotive Engineering, 2025 , 47 (3) : 430 -439 . DOI: 10.19562/j.chinasae.qcgc.2025.ep.001
鸟瞰图(bird’s eye view,BEV)感知技术通过整合多传感器数据,将这些数据统一映射至单一视图,提供一个由上而下观察的俯视视角,有效解决了传统视图中物体被遮挡以及物体尺度不一致的问题。此集成视图不仅为后续的规划与控制模块提供了清晰的信息基础,还显著降低了系统部署的难度,对于自动驾驶技术的落地应用具有重要价值。在此技术背景下,相机因其低廉的成本以及其能够捕捉丰富纹理信息的优势,受到了学术界的广泛关注,因此基于相机的BEV感知技术[1-2]近几年得到了迅速发展。
在早期研究[3-5]中,基于相机的BEV感知任务通常依赖于物体的深度信息,因此须针对图像信息进行深度估计。这种方法不仅对深度估计的准确性要求高,而且计算成本也较高。为克服这些挑战,近年来研究转向采用Transformer[6]模型。Transformer模型最初用于自然语言处理,能够高度并行化且擅长捕捉长距离依赖关系,因此Transformer模型被引入到计算机视觉领域,并取得了一些显著成果[7-11]。在BEV目标检测领域中,一些方法[112-13]通过在笛卡尔坐标系中引入Transformer模型的注意力机制进行3D目标检测,并结合体素化栅格和双线性插值等技术生成BEV视图,巧妙地规避了对深度的直接估计,简化了传统检测流程,实现了高效的目标检测。然而,这些方法[112-13]绝大部分基于笛卡尔坐标系,其利用信息的方式并不符合传感器的工作模式。事实上,在极坐标下每个像素的位置均能用距离和角度来表示,与相机捕获图像的方式非常相似,而每个车载相机以径向轴成像几何感知世界,这导致在笛卡尔坐标系下感知世界相比于极坐标更加复杂。因此,本文基于以前的工作[1-2],提出了一种在极坐标系下进行端到端BEV目标检测的算法模型——PolarDet。
此外,在过往的研究[1-214]中,基于相机的BEV目标检测方法往往直接依赖于单一特征图来进行分类预测和边界框与速度等回归预测,这种做法忽视了RGB图像无法直接表示深度的本质特性。由于3D目标检测的最终任务须估计物体的3D位置以进行边界框预测,即使许多基于Transformer的方案不直接预测深度信息,也会使提取到的丰富图像语义特征中混杂大量位置信息。这种方法在训练过程中不仅存在收敛困难的问题,也会导致模型在最后检测阶段辨别信息的负担过重,使误差风险增加。鉴于此,本文提出了一种位置信息与语义信息加权方法,强调不同信息在不同任务中的作用大小,使模型在目标检测过程中有针对性地利用不同信息。如图1所示,PolarFormer在如图场景中出现了多处误检,而使用位置与语义信息加权策略的PolarDet则解决了该问题。
综上所述,这项工作的创新点可总结如下。
(1)极坐标下的端到端3D目标检测。本文首次提出了一种在极坐标系下直接对汽车不同视角的输入图像进行端到端3D目标检测的新模型—PolarDet。相比于以前的方法,PolarDet通过在极坐标下进行3D目标检测,降低了坐标转换的难度,能够更自然地处理来自不同视角的数据,有效提升了3D目标检测的准确率。
(2)基于注意力机制的多信息学习方法。本文设计了一种新颖的多信息学习方法。该方法通过在注意力机制中复制、拼接和切割极坐标查询,分别提取位置与语义信息。通过分别学习不同信息,提升对相机数据中丰富纹理细节的利用率,这种创新的方法能够大大提升3D目标检测模型对图像信息的理解。
(3)基于信息加权的3D目标检测。本文提出了一种根据不同任务对信息施加不同权重的方法。通过利用神经网络在训练过程中学习到的语义信息和目标的3D位置信息,在最终预测时根据任务类型赋予权重,使模型在目标检测的不同任务中有针对性地利用信息。
早期BEV目标检测主要基于深度信息进行透视视角到BEV视角的变换。 LSS[3]在OFT[15]的基础上,使用多个相机确定透视射线每个点的特征以减少深度预测偏差。BEVDet[4]直接将密集预测的深度信息投影到BEV视角下提取特征。CaDDN[5]使用激光雷达数据监督深度估计。BEVDet4D[16]保留过去帧的BEV特征并融合当前帧的BEV特征,以减小速度的预测误差。
近年来的工作更多致力于使用注意力机制,DETR3D[13]用参考点附近的2D特征优化3D物体表征。PETR[12]则受DETR[8]的影响,对3D位置信息进行编码,直接在3D空间内更新目标查询。BEVFormer[1]利用时间自注意机制与空间交叉注意力机制实现3D目标检测与语义分割任务。BEVFormerv2[17]进一步引入了透视空间监督来增强 BEVFormer[1]的性能。BEVDepth[18]引入外部参数与点云数据提高深度估计准确性。Sparse4D[19]通过采用多帧采样的方式实现时序融合。Sparse4D v2[20]在Sparse4D[19]的基础上改进了时间融合模块,以降低计算复杂度。
然而,这些工作均基于笛卡尔坐标系进行目标检测,未能遵循真实世界中传感器的工作模式。本文提出在极坐标下进行的BEV目标检测方案,以更好地利用传感器的工作模式,提高检测的准确性与计算效率。
近两年出现了一些基于极坐标的BEV目标检测方案,PolarDETR[14]遵循DETR[8]的做法,使用目标查询迭代更新极坐标下的BEV特征。PolarFormer[2]引入一种多尺度极坐标表征策略,以解决极坐标中极射线方向上目标尺度随距离变化的问题。
其中,PolarFormer[2]虽然解决了目标尺度随远近而变化的问题,但它与PolarDETR[14]仍然受到位置信息与语义信息混杂而加重网络负担的影响。
图2所示,PolarDet首先通过骨干网络对汽车N个不同视角相机拍摄的图片I={IiR3×H×W,  i=1, 2, 3, , N}进行2D特征提取,再使用特征金字塔[21]网络在2D特征中提取多层级特征;在跨平面编码器处理后,特征通过投影与插值得到极坐标下的BEV特征;随后多层级BEV特征编码器将历史BEV信息融合到当前帧的极坐标BEV特征中;接下来在多层级BEV特征解码器中使用信息查询得到目标信息;最后,将特征切割,得到极坐标BEV位置与语义信息,通过后续网络得到分类与回归的结果。
图3(a)所示,跨平面编码器将特征金字塔[21]输出的多层级特征转为极坐标射线,根据相机坐标系中坐标(x, y, z),用以下公式得到BEV视角下的极坐标(φ, ρ)
φ=arctanxz
ρ=x2+z2
式中:x代表相机像素中的水平位置;y代表相机像素中的竖直位置;z代表深度;φ代表与极点连线相对于极轴的旋转角度;ρ代表与极点的距离。
PolarDet采用Transformer解码器获取列像素特征,多层级特征须进行一次复制拼接再进入跨平面编码器,通过特征的复制拼接与查询的拼接,可以利用分块矩阵的乘法,分别学习同一个图像特征内部的不同信息。
图像特征在该阶段首先经历的处理过程如下:
pn,u,w=MultiHeadp˙n,u,w,fn,u,w,fn,u,w=
              Concat(head1,head2,,headh)Wuo
headi=Attention(p˙n,u,wWi,uQ,fn,u,wWi,uK,fn,u,wWi,uV)
式中:p˙n,u,w代表极射线查询; fn,u,w代表第u个尺度中,第n个相机图片的第w列像素;WuORhdmodel×dkWuQRdmodel×dqWuKRdmodel×dkWuVRdmodel×dv代表注意力机制权值矩阵,其中dq=dk=dv=dmodel/h
特征的复制拼接与查询的拼接公式如下:
p˙n,u,w=Concat(p˙n,u,wl,p˙n,u,wt)
fn,u,w=Concat(f˙n,u,w,f˙n,u,w)
式中:p˙n,u,wl代表位置信息极射线;p˙n,u,wt代表语义信息极射线;f˙n,u,w代表第u个尺度中,第n个相机图片的第w列像素。
随后将pn,u,w沿方位角展开,得到单个相机语义信息与位置信息的极射线:
Pn,u=Stack(pn,u,1,pn,u,2,,pn,u,wu,
dim=1)RRu×Wu×C
下一步将极射线融合,得到全局坐标系下的BEV特征图。在圆柱坐标上生成一组3D点𝒢P={(ρiP,ϕjP,ZkP)|i=1, ,u;j=1,,𝒩u;k=1, , 𝒵u},其中:u代表圆周上生成的点数;𝒩u代表一条极射线上生成的点数;𝒵u代表在圆柱坐标范围内,沿圆柱高方向上生成的点数。
下列公式对圆柱点进行极坐标与笛卡尔坐标的转换:
xi,j=ρi(P)sin  ϕj(P)
yi,j=ρi(P)cos  ϕj(P)
zk(P)=zk(P)
接下来使用相机参数进行视角变换,用s代表缩放因子,得到极射线的索引:
sxi,j,k,n(I)syi,j,k,n(I)s1=Πn001Εnxi,jyi,jzk(P)1
再使用下列公式得到同一世界坐标系下的极坐标BEV特征图:
GuρiP,ϕjP=1n=1Nk=1ZλnρiP,ϕjP,zkP
n=1Nk=1ZλnρiP,ϕjP,zkP
B(Pn,u,(x¯i,j,k,nI,r¯i,j,n))
式中:λn(ρiP,ϕjP,zk(P))是二进制加权因子; B(Pn,u,(x,y))表示极射线Pn,u在坐标 (x,y)处双线性插值采样; (x¯i,j,k,n,r¯i,j,n)表示已经归一化的极坐标角度索引与半径索引。
图3(b)所示,多层级BEV特征编码器使用可变形注意力[7]机制融入时间信息,并进行更细致的特征提取。该机制在特征中学习参考点,使用双线性插值得到局部区域特征,相比于多头注意力机制,该机制对小目标的检测能力更强。通过融入时间信息,PolarDet对被遮挡物体以及运动物体的检测能力也更加出色。
{Gu}u=1U表示当前时刻的BEV语义信息与位置信息,可变形注意力机制处理过程如下:
MSDeformAttnGuu=1U,BEVprev=
m=1MWm[u=1Uk=1KAmuqkWm'
sample(BEVprev,ζuxq+Δxmuqk)]
式中:sample(BEVprev,ζuxq+Δxmuqk)表示利用偏移Δxmuqk与初始位置ζuxqBEVprev上进行采样;mk表示注意力头与采样点的索引;BEVprev表示上一帧的BEV;WmWm'都是可学习的权重矩阵;Amuqk表示第m个注意力头的第u个尺度上第k个点的注意力权重。
图3(c)所示,多层级BEV特征解码器中信息查询进行多头注意力机制避免学习相同特征,随后与上游特征通过多层级可变形注意力机制[7]进行特征交互。
为了对检测结果进行加权,该模块后续还须对检测结果进行分割,假设输出特征为pd1,,dnd1,,dn代表p在各个维度上的元素个数,PolarDet进行如下处理:
p1=d1,,dn2
p2=dn2,,dn
式中:p1代表位置信息;p2代表语义信息。模块中使用线性层对p1p2进行处理,并分别在分类与回归计算过程中进行加权:
Class=A1Cp1+A2Cp2
Location=A1Lp1+A2Lp2
通过上述步骤,得到目标分类的置信度cRo,目标的位置与速度信息θori,dρ,dϕ,θvo表示被分类的类别数;θori代表边界框的偏航角;dρ表示沿极射线方向的位置;dϕ表示极射线角度;θv表示目标的绝对速度角。
目标的位置与速度信息用来细化参考点(ρ,ϕ,z),该模块使用目标的位置与速度信息生成参考点的偏移,得到θori的正交分量θϕθρv的正交分量vϕvρ
θ¯ori=θori-ϕ
θϕ=sin θ¯ori
θρ=cos θ¯ori
θ¯v=θv-ϕ
vϕ=vabssin θ¯v
vρ=vabscos θ¯v
式中vabs代表目标的绝对速度。最后使用焦点损失与L1损失用来衡量分类与回归的误差。
本节在nuScenes[22]数据集上对PolarDet进行了全面测试,nuScenes[22]数据集包含在汽车周围构成360°全景环绕的6个相机在每个场景下以每0.5 s拍摄一张图片的频率持续采样20 s的1 000个场景,其中700个场景被划分为训练集,150个场景被划分为验证集,150个场景被划分为测试集。
本节采用nuScenes[22]数据集的评价指标,包括nuScenes检测得分(NDS)和平均精度(mAP)。mAP是根据鸟瞰图中心距离D={0.5,1,2,4}m和类别集C的匹配阈值计算的平均值,NDS是mAP与其他物体属性检测结果(如平均平移误差ATE、平均尺度误差ASE、平均方向误差AOE、平均速度误差AVE和平均属性误差AAE)的加权组合。
PolarDet采用ResNet101-DCN[26]作为骨干网络,使用预训练的FCOS3D[27]作为骨干网络的检查点,其余数据随机初始化。特征尺度与跨平面编码器数量设置为3,多尺度极坐标BEV特征图的半径与分辨率分别设置为(64,256)、(32,128)、(16,64),设置6个多层级BEV特征编码器层与6个解码器层。
PolarDet使用AdamW[28]优化器训练,权重衰减为0.075,初始学习率为2×10-5,并采用余弦退火策略。模型在一个NVIDIA A40 GPU进行了24轮的训练,6个摄像头的总批次大小为6,采用同步归一化。
表1所示,本节在nuScenes[22]验证集上将PolarDet与目前最先进的模型进行比较。由于PolarDet包含时间信息,为了公平比较,对于PolarFormer[2]与PolarDETR[14]的结果,本节采用融入时间信息的PolarFormer-T[2]与PolarDETR-T[14]。结果显示PolarDet在NDS和mAP等多项指标上均取得了最佳性能。
为了进一步证明PolarDet的有效性,本节使用训练完成的网络对nuScenes[22]数据集内包含的复杂道路、夜晚场景进行了检测结果可视化。如图4 (a)所示,即使在夜晚,对于远处、被截断、模糊物体,PolarDet均能准确地检测到,证明了PolarDet融入时序特征方法的有效性。从BEV图中可以看出,PolarDet生成的检测框都十分精准,证明了本文提出的位置信息与语义信息加权方法的有效性。图4 (c)展示了夜晚场景,即使在有大量行人和光污染的交叉路口,PolarDet 仍表现出卓越性能,能够在密集人群检测中取得良好的检测效果。
图4 (b)所示,在复杂街道场景中,PolarDet 依然表现出色,特别是在边界框的精准度上性能卓越。实验证明,PolarDet的检测方案能够充分利用图像信息,提升检测精度并增强网络的鲁棒性。
为更进一步证明在极坐标下进行3D目标检测相比于笛卡尔坐标系的优越性,本节设置了多组对比实验 ,与PolarFormer[2]在笛卡尔坐标系和极坐标系下进行了全面对比。
表2展示了PolarDet网络在不同配置下的性能,包括是否采用多层级特征、是否使用极坐标进行特征采样、是否使用极坐标表示的参数进行预测。其中,“s”代表未使用多层级特征; “CC”代表在特征采样和结果检测中均采用笛卡尔坐标系;“PC”代表在特征采样时采用极坐标系,但在预测中使用笛卡尔坐标系。
表2可见,全局使用极坐标系的PolarDet相比于全局使用笛卡尔坐标系,mAP提升了大约3.5%,NDS提升了大约1.8%。
本文选取的基线模型是PolarFormer[2],其特征维度是256维,由于PolarDet须同时学习位置信息与语义信息,使用了512维特征,因此无法直接对比。为了排除特征维度对模型性能的影响,如表3所示,本节设置了多组实验,为确保PolarDet的性能提升并非均来自特征维度的增加,部分实验将语义信息与位置信息的权重设计为0.5,确保模型仅增加了一倍维度而未对信息进行加权,表格中E表示权重均为0.5;NT表示没有输入时间信息,此时使用当前帧信息代替历史信息。
实验结果显示,PolarDet通过信息加权,mAP提升约3%,NDS提升约3.9%; PolarDet融入时间信息,mAP提升约7.3%,NDS提升约17%。
为探究不同信息权重对网络检测效果的影响,确定最优权重配置,本节采用正交实验法进行了一系列实验。
图5所示,横坐标使用“(语义信息权重,位置信息权重)”来表示两种信息的权重。对于分类预测,本节选取了以下几组具有代表性的权重设置:(0.8,0.2);(0.7,0.3);(0.6,0.4);(0.5,0.5);(0.4,0.6)。对于边界框与速度信息等回归预测,实验权重设置的范围为:(0.7,0.3);(0.6,0.4);(0.5,0.5);(0.4,0.6);(0.3,0.7)。这些组合覆盖了从偏向语义信息到偏向位置信息的各种情况,使实验结果能够全面显示不同权重分配对网络性能的影响。
在实验中,采用标准的L25(52)正交表,每次实验测试两个因子中的一个组合,确保每个组合都被充分测试。
本节进行了25次实验,图5将实验数据可视化成点线图,以直观展示不同权重配比下PolarDet的性能;表4列出了不同权重配比下的实验数据,以方便进行进一步分析。本节使用极差分析法来分析实验数据,从而选出最优参数,下列公式计算因子A与因子B中各水平的平均值:
A¯i=1njYij
B¯i=1njYij
式中:Yij表示第i次实验中的因子AB的实验结果;n是重复次数。
计算得到因子A与因子B中各水平的平均值后,通过下列公式计算极差:
RA=max(A¯i)-min (A¯i)
RB=max(B¯i)-min (B¯i)
本节使用NDS作为评价指标,通过式(24)式(25)计算得到如表5所示的分类预测与回归预测中各种权重分配结果的NDS平均值。
随后使用式(26)式(27)计算分类预测权重分配的极差与回归预测权重分配的极差,选取差值最大的因子对应的水平作为最优水平,得到最佳权重分配:对于分类预测,最佳权重分配为(0.7,0.3),对于回归预测,最佳权重分配为(0.4,0.6)。
(1)本文提出了一种基于位置与语义信息加权的极坐标BEV端到端目标检测方法—PolarDet,其可以生成极坐标下的BEV位置与语义信息,降低坐标转换难度,可更自然地处理来自不同视角的数据,有效提升3D目标检测的准确率,结果表明其mAP达到0.469,NDS达到0.56,显著优于基于笛卡尔坐标的BEV目标检测方法。
(2)基于可变形注意力机制能够灵活关注不同局部区域的优势,PolarDet将上一帧的BEV信息融入当前帧,使其能够更准确地捕捉目标的运动轨迹与速度变化,结果表明融入上一帧时间信息后,可使PolarDet的mAP提升7.3%,NDS提升17%。
(3)在结果输出时,对位置与语义信息进行加权求和,可提高信息的利用效率,使网络能够达到更高的检测精度,并减少误检和漏检的情况,结果表明使用信息加权的PolarDet能够提升3%的mAP与3.9%的NDS。
  • *中央引导地方科技专项-长三角科技创新共同体联合攻关计划项目(2023CSJGG1600)
  • 安徽省自然科学基金面上项目(2208085MF173)
  • 芜湖市“赤铸之光”重大科技项目资助。(2023zd01)
  • 芜湖市“赤铸之光”重大科技项目资助。(2023zd03)
参考文献 引证文献
排序方式:
[1]
LI Z,WANG W,LI H,et al. BEVFormer:learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers[J/OL]. arXiv,2022[2023-10-17]. http://arxiv.org/abs/2203.17270.
[2]
JIANG Y,ZHANG L,MIAO Z,et al. PolarFormer:multi-camera 3D object detection with polar transformer[J/OL]. Proceedings of the AAAI Conference on Artificial Intelligence,2023,37(1):1042-1050[2024-01-16]. https://ojs.aaai.org/index.php/AAAI/article/view/25185. DOI:10.1609/aaai.v37i1.25185.
[3]
PHILION J,FIDLER S. Lift,Splat,Shoot:encoding images from arbitrary camera rigs by implicitly unprojecting to 3D[J/OL]. arXiv,2020[2023-10-25]. http://arxiv.org/abs/2008.05711.
[4]
HUANG J,HUANG G,ZHU Z,et al. BEVDet:high-performance multi-camera 3D object detection in bird-eye-view[J/OL]. arXiv,2022[2023-10-17]. http://arxiv.org/abs/2112.11790.
[5]
READING C,HARAKEH A,CHAE J,et al. Categorical depth distribution network for monocular 3D object detection[J/OL]. arXiv,2021[2023-12-16]. http://arxiv.org/abs/2103.01100.
[6]
VASWANI A,SHAZEER N,PARMAR N,et al. Attention is all you need[J/OL]. arXiv,2023[2023-10-06]. http://arxiv.org/abs/1706.03762. DOI:10.48550/arXiv.1706.03762.
[7]
ZHU X,SU W,LU L,et al. Deformable DETR:deformable transformers for end-to-end object detection[J/OL]. arXiv,2021[2024-03-07]. http://arxiv.org/abs/2010.04159.
[8]
CARION N,MASSA F,SYNNAEVE G,et al. End-to-end object detection with transformers[C/OL]//VEDALDI A,BISCHOF H,BROX T,et al. Computer Vision-ECCV 2020. Cham:Springer International Publishing,2020:213-229. DOI:10.1007/978-3-030-58452-8_13.
[9]
BEAL J,KIM E,TZENG E,et al. Toward transformer-based object detection[J/OL]. arXiv,2020[2023-11-20]. http://arxiv.org/abs/2012.09958. DOI:10.48550/arXiv.2012.09958.
[10]
LIU S,LI F,ZHANG H,et al. DAB-DETR:dynamic anchor boxes are better queries for DETR[J/OL]. arXiv,2022[2023-11-25]. http://arxiv.org/abs/2201.12329.
[11]
DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al. An image is worth 16x16 words:transformers for image recognition at scale[J]. 2020.DOI:10.48550/arXiv.2010.11929. .
[12]
LIU Y,WANG T,ZHANG X,et al. PETR:position embedding transformation for multi-view 3D object detection[J/OL]. arXiv,2022[2023-10-25]. http://arxiv.org/abs/2203.05625.
[13]
WANG Y,GUIZILINI V,ZHANG T,et al. DETR3D:3D object detection from multi-view images via 3D-to-2D queries[J/OL]. arXiv,2021[2024-01-16]. http://arxiv.org/abs/2110.06922.
[14]
CHEN S,WANG X,CHENG T,et al. Polar parametrization for vision-based surround-view 3D detection[J/OL]. arXiv,2022[2024-02-28]. http://arxiv.org/abs/2206.10965.
[15]
RODDICK T,KENDALL A,CIPOLLA R. Orthographic feature transform for monocular 3D object detection[J/OL]. arXiv,2018[2023-12-16]. http://arxiv.org/abs/1811.08188.
[16]
HUANG J,HUANG G. BEVDet4D:exploit temporal cues in multi-camera 3D object detection[J/OL]. arXiv,2022[2024-07-09]. http://arxiv.org/abs/2203.17054.
[17]
YANG C,CHEN Y,TIAN H,et al. BEVFormer v2:adapting modern image backbones to bird’s-eye-view recognition via perspective supervision[J/OL]. arXiv,2022[2024-04-04]. http://arxiv.org/abs/2211.10439.
[18]
LI Y,GE Z,YU G,et al. BEVDepth:acquisition of reliable depth for multi-view 3D object detection[J/OL]. Proceedings of the AAAI Conference on Artificial Intelligence,2023,37(2):1477-1485[2023-11-16]. https://ojs.aaai.org/index.php/AAAI/article/view/25233. DOI:10.1609/aaai.v37i2.25233.
[19]
LIN X,LIN T,PEI Z,et al. Sparse4D:multi-view 3D object detection with sparse spatial-temporal fusion[J/OL]. arXiv,2023[2024-04-11]. http://arxiv.org/abs/2211.10581.
[20]
LIN X,LIN T,PEI Z,et al. Sparse4D v2:recurrent temporal fusion with sparse model[J/OL]. arXiv,2023[2024-01-16]. http://arxiv.org/abs/2305.14018.
[21]
LIN T Y,DOLLÁR P,GIRSHICK R,et al. Feature pyramid networks for object detection[J/OL]. arXiv,2017[2023-11-15]. http://arxiv.org/abs/1612.03144.
[22]
CAESAR H,BANKITI V,LANG A H,et al. nuScenes:a multimodal dataset for autonomous driving[J/OL]. arXiv,2020[2024-05-08]. http://arxiv.org/abs/1903.11027. DOI:10.48550/arXiv.1903.11027.
[23]
DOLL S,SCHULZ R,SCHNEIDER L,et al. SpatialDETR:robust scalable transformer-based 3D object detection from multi-view camera images with global cross-sensor attention[M/OL]//AVIDAN S,BROSTOW G,CISSÉ M,et al. Computer vision-ECCV 2022:Vol. 13699. Cham:Springer Nature Switzerland,2022:230-245[2024-06-23]. https://link.springer.com/10.1007/978-3-031-19842-7_14. DOI:10.1007/978-3-031-19842-7_14.
[24]
LIU Y,YAN J,JIA F,et al. PETRv2:a unified framework for 3D perception from multi-camera images[C/OL]//2023 IEEE/CVF International Conference on Computer Vision (ICCV). Paris,France:IEEE,2023:3239-3249[2024-02-27]. https://ieeexplore.ieee.org/document/10377268/. DOI:10.1109/ICCV51070.2023.00302.
[25]
ROH W,CHANG G,MOON S,et al. ORA3D:overlap region aware multi-view 3D object detection[J/OL]. arXiv,2023[2024-06-23]. http://arxiv.org/abs/2207.00865.
[26]
HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[J/OL]. arXiv,2015[2024-05-08]. http://arxiv.org/abs/1512.03385. DOI:10.48550/arXiv.1512.03385.
[27]
WANG T,ZHU X,PANG J,et al. FCOS3D:fully convolutional one-stage monocular 3D object detection[J/OL]. arXiv,2021[2023-10-17]. http://arxiv.org/abs/2104.10956.
[28]
LOSHCHILOV I,HUTTER F. Decoupled weight decay regularization[J/OL]. arXiv,2019[2024-06-14]. http://arxiv.org/abs/1711.05101.
2025年第47卷第3期
PDF下载
356
141
引用本文
BibTeX
文章信息
doi: 10.19562/j.chinasae.qcgc.2025.ep.001
  • 接收时间:2024-08-07
  • 首发时间:2025-07-09
  • 出版时间:2025-03-25
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2024-08-07
  • 修回日期:2024-09-27
基金
*中央引导地方科技专项-长三角科技创新共同体联合攻关计划项目(2023CSJGG1600)
安徽省自然科学基金面上项目(2208085MF173)
芜湖市“赤铸之光”重大科技项目资助。(2023zd01)
芜湖市“赤铸之光”重大科技项目资助。(2023zd03)
作者信息
    1 安徽工程大学机械与汽车工程学院,芜湖 241000
    2 奇瑞新能源汽车股份有限公司,芜湖 241000
    3 浙江大学工程师学院,杭州 310015

通讯作者:

时培成,教授,博士,E-mail:
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/qcygc/CN/10.19562/j.chinasae.qcgc.2025.ep.001
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏