Article(id=1241757971894563721, tenantId=1146029695717560320, journalId=1238841944844054536, issueId=1241757966894945217, articleNumber=null, orderNo=null, doi=10.12347/j.ycyk.20240408001, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1712505600000, receivedDateStr=2024-04-08, revisedDate=1714233600000, revisedDateStr=2024-04-28, acceptedDate=null, acceptedDateStr=null, onlineDate=1773987771247, onlineDateStr=2026-03-20, pubDate=1720972800000, pubDateStr=2024-07-15, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1773987771247, onlineIssueDateStr=2026-03-20, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1773987771247, creator=13701087609, updateTime=1773987771247, updator=13701087609, issue=Issue{id=1241757966894945217, tenantId=1146029695717560320, journalId=1238841944844054536, year='2024', volume='45', issue='4', pageStart='1', pageEnd='136', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=null, createTime=1773987770054, creator=13701087609, updateTime=1773991490516, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1241773571714909111, tenantId=1146029695717560320, journalId=1238841944844054536, issueId=1241757966894945217, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1241773571714909112, tenantId=1146029695717560320, journalId=1238841944844054536, issueId=1241757966894945217, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=22, endPage=30, ext={EN=ArticleExt(id=1241757972318188434, articleId=1241757971894563721, tenantId=1146029695717560320, journalId=1238841944844054536, language=EN, title=Siamese Network Tracking Algorithm Based on Diverse Branch Block and Reparameterization, columnId=1241060179823882575, journalTitle=Journal of Telemetry, Tracking and Command, columnName=Artificial Intelligence Technology, runingTitle=null, highlight=null, articleAbstract=

Aiming at the problem that the Siamese network has insufficient ability to express the features of scale-varying targets, a multi-branch structure is constructed by using convolution, pooling branches and pruning operations of different sizes to improve the robustness of features and ensure the translation invariance of the Siamese network. Aiming at the problem that the multi-branch structure brings too many parameters, the multi-branch structure is reparameterized into a single convolution in the tracking stage, which effectively reduces the time cost in the tracking stage. The experimental results show that compared with SiamFC, the accuracy, success rate and tracking speed of the proposed algorithm on the OTB100 datasets are improved by 5.1%, 3% and 30 FPS,respectively. The tracking accuracy and success rate are improved on the UAV123 and Temple-Color-128 datasets.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Ming JIN, Yu TANG, Yong HAN, Shuai LIU, Feng Gang YAN), CN=ArticleExt(id=1241757978756444311, articleId=1241757971894563721, tenantId=1146029695717560320, journalId=1238841944844054536, language=CN, title=基于多分支和重参数化的孪生网络跟踪算法, columnId=1241060179987460434, journalTitle=遥测遥控, columnName=人工智能技术, runingTitle=null, highlight=null, articleAbstract=

针对孪生网络对尺度变化目标特征表达能力不足的问题,本文使用不同尺寸的卷积、池化分支和剪枝操作构成多分支结构,以提高特征的鲁棒性并保证孪生网络的平移不变性。针对多分支结构带来参数量过多的问题,本文在跟踪阶段将多分支结构重参数化为单一的卷积,有效减少跟踪阶段时间成本。实验结果表明:本文提出的算法相比于SiamFC,在OTB100数据集上,其精度、成功率和跟踪速度分别提高了5.1%、3%、30 FPS,在UAV123和Temple-Color-128数据集上跟踪精度和成功率均有所提高。

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=SMKqsSM5c56z2XNQvVIACA==, magXml=n0Y50dbVN4xp6UeJhG2cWg==, pdfUrl=null, pdf=R9BIJldsQZwE5gsn1Jou8A==, pdfFileSize=7965898, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=qNLwLkCfTvJ2/RTsaVd8Hw==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=erRHlD65M1y9fw7Ll7UeMA==, mapNumber=null, authorCompany=null, fund=null, authors=

金铭 1968年生,博士,教授,博士生导师。

唐宇 2000年生,硕士研究生。

韩勇 1976年生,博士,讲师,硕士生导师。

刘帅 1980年生,博士,教授,博士生导师。

闫锋刚 1983年生,博士,教授,博士生导师,泰山学者青年专家。

, authorsList=金铭, 唐宇, 韩勇, 刘帅, 闫锋刚)}, authors=[Author(id=1241788815677915885, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241788815749219056, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788815677915885, language=EN, stringName=Ming JIN, firstName=Ming, middleName=null, lastName=JIN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Harbin Institute of Technology, Weihai 264200, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241788815816327923, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788815677915885, language=CN, stringName=金铭, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=哈尔滨工业大学(威海) 威海 264200, bio={"content":"

金铭 1968年生,博士,教授,博士生导师。

"}, bioImg=null, bioContent=

金铭 1968年生,博士,教授,博士生导师。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241788815577252583, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, xref=null, ext=[AuthorCompanyExt(id=1241788815585641191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Harbin Institute of Technology, Weihai 264200, China), AuthorCompanyExt(id=1241788815589835496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=哈尔滨工业大学(威海) 威海 264200)])]), Author(id=1241788815891825400, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241788816005071611, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788815891825400, language=EN, stringName=Yu TANG, firstName=Yu, middleName=null, lastName=TANG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Harbin Institute of Technology, Weihai 264200, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241788816093151997, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788815891825400, language=CN, stringName=唐宇, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=哈尔滨工业大学(威海) 威海 264200, bio={"content":"

唐宇 2000年生,硕士研究生。

"}, bioImg=null, bioContent=

唐宇 2000年生,硕士研究生。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241788815577252583, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, xref=null, ext=[AuthorCompanyExt(id=1241788815585641191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Harbin Institute of Technology, Weihai 264200, China), AuthorCompanyExt(id=1241788815589835496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=哈尔滨工业大学(威海) 威海 264200)])]), Author(id=1241788816185426689, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241788816256729861, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788816185426689, language=EN, stringName=Yong HAN, firstName=Yong, middleName=null, lastName=HAN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Harbin Institute of Technology, Weihai 264200, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241788816349004552, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788816185426689, language=CN, stringName=韩勇, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=哈尔滨工业大学(威海) 威海 264200, bio={"content":"

韩勇 1976年生,博士,讲师,硕士生导师。

"}, bioImg=null, bioContent=

韩勇 1976年生,博士,讲师,硕士生导师。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241788815577252583, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, xref=null, ext=[AuthorCompanyExt(id=1241788815585641191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Harbin Institute of Technology, Weihai 264200, China), AuthorCompanyExt(id=1241788815589835496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=哈尔滨工业大学(威海) 威海 264200)])]), Author(id=1241788817825399563, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241788817917674255, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788817825399563, language=EN, stringName=Shuai LIU, firstName=Shuai, middleName=null, lastName=LIU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Harbin Institute of Technology, Weihai 264200, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241788818001560337, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788817825399563, language=CN, stringName=刘帅, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=哈尔滨工业大学(威海) 威海 264200, bio={"content":"

刘帅 1980年生,博士,教授,博士生导师。

"}, bioImg=null, bioContent=

刘帅 1980年生,博士,教授,博士生导师。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241788815577252583, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, xref=null, ext=[AuthorCompanyExt(id=1241788815585641191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Harbin Institute of Technology, Weihai 264200, China), AuthorCompanyExt(id=1241788815589835496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=哈尔滨工业大学(威海) 威海 264200)])]), Author(id=1241788818060280596, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241788818148360983, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788818060280596, language=EN, stringName=Feng Gang YAN, firstName=Feng Gang, middleName=null, lastName=YAN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=Harbin Institute of Technology, Weihai 264200, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241788818228052761, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, authorId=1241788818060280596, language=CN, stringName=闫锋刚, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=null, address=哈尔滨工业大学(威海) 威海 264200, bio={"content":"

闫锋刚 1983年生,博士,教授,博士生导师,泰山学者青年专家。

"}, bioImg=null, bioContent=

闫锋刚 1983年生,博士,教授,博士生导师,泰山学者青年专家。

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241788815577252583, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, xref=null, ext=[AuthorCompanyExt(id=1241788815585641191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Harbin Institute of Technology, Weihai 264200, China), AuthorCompanyExt(id=1241788815589835496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=哈尔滨工业大学(威海) 威海 264200)])])], keywords=[Keyword(id=1241788818379047713, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, orderNo=1, keyword=Visual tracking), Keyword(id=1241788818471322406, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, orderNo=2, keyword=Siamese network), Keyword(id=1241788818551014185, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, orderNo=3, keyword=Feature extraction), Keyword(id=1241788818605540138, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, orderNo=4, keyword=Structural reparameterization), Keyword(id=1241788818681037612, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, orderNo=1, keyword=视觉跟踪), Keyword(id=1241788818756535086, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, orderNo=2, keyword=孪生网络), Keyword(id=1241788818857198385, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, orderNo=3, keyword=特征提取), Keyword(id=1241788818949473076, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, orderNo=4, keyword=结构重参数化)], refs=[Reference(id=1241788823093445511, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2022, volume=23, issue=5, pageStart=3943, pageEnd=3968, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=MARVASTI-ZADEH S M, CHENG L, GHANEI-YAKHDAN H, journalName=IEEE Transactions on Intelligent Transportation Systems, refType=null, unstructuredReference=MARVASTI-ZADEH S M, CHENG L, GHANEI-YAKHDAN H, et al. Deep learning for visual tracking: A comprehensive survey[J]. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(5): 3943-3968., articleTitle=Deep learning for visual tracking: A comprehensive survey, refAbstract=null), Reference(id=1241788823177331593, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2022, volume=24, issue=null, pageStart=301, pageEnd=312, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=ZHU X F, WU X J, XU T, journalName=IEEE Transactions on Multimedia, refType=null, unstructuredReference=ZHU X F, WU X J, XU T, et al. Robust visual object tracking via adaptive attribute aware discriminative correlation filters[J]. IEEE Transactions on Multimedia, 2022, 24: 301-312., articleTitle=Robust visual object tracking via adaptive attribute aware discriminative correlation filters, refAbstract=null), Reference(id=1241788823303160715, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=1499, pageEnd=1503, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=KUMARI L K, RAMALAKSHMI K, SRINIVASA-RAGHAVAN V, journalName=null, refType=null, unstructuredReference=KUMARI L K, RAMALAKSHMI K, SRINIVASA-RAGHAVAN V. Robust visual object tracking in clustered environment[C]// 2022 5th International Conference on Contemporary Computing and Informatics(IC3I), Uttar Pradesh, India. 2022: 1499-1503., articleTitle=Robust visual object tracking in clustered environment, refAbstract=null), Reference(id=1241788823491904397, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2023, volume=11, issue=null, pageStart=32895, pageEnd=32906, url=null, language=null, rfNumber=[4], rfOrder=3, authorNames=REN H, XIANG L, SHI T, journalName=IEEE Access, refType=null, unstructuredReference=REN H, XIANG L, SHI T. Research on background learning correlation filtering algorithm with multifeature fusion[J]. IEEE Access, 2023, 11: 32895-32906., articleTitle=Research on background learning correlation filtering algorithm with multifeature fusion, refAbstract=null), Reference(id=1241788823613539216, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2023, volume=38, issue=7, pageStart=4, pageEnd=19, url=null, language=null, rfNumber=[5], rfOrder=4, authorNames=KUMAR R, DEB A K, journalName=IEEE Aerospace and Electronic Systems Magazine, refType=null, unstructuredReference=KUMAR R, DEB A K. Pedestrian tracking in UAV images with Kalman filter motion estimator and correlation filter[J]. IEEE Aerospace and Electronic Systems Magazine, 2023, 38(7): 4-19., articleTitle=Pedestrian tracking in UAV images with Kalman filter motion estimator and correlation filter, refAbstract=null), Reference(id=1241788823726785426, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=505, pageEnd=510, url=null, language=null, rfNumber=[6], rfOrder=5, authorNames=WU L, FANG H, CUI X, journalName=null, refType=null, unstructuredReference=WU L, FANG H, CUI X, et al. Improved partbased Kernel correlation filter for object tracking and robot grasping[C]// 2023 China Automation Congress(CAC), Chongqing, China. 2023: 505-510., articleTitle=Improved partbased Kernel correlation filter for object tracking and robot grasping, refAbstract=null), Reference(id=1241788823814865811, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2022, volume=32, issue=5, pageStart=3085, pageEnd=3098, url=null, language=null, rfNumber=[7], rfOrder=6, authorNames=CAO Y, SHI G, ZHANG T, journalName=IEEE Transactions on Circuits and Systems for Video Technology, refType=null, unstructuredReference=CAO Y, SHI G, ZHANG T, et al. Bayesian correlation filter learning with Gaussian scale mixture model for visual tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(5): 3085-3098., articleTitle=Bayesian correlation filter learning with Gaussian scale mixture model for visual tracking, refAbstract=null), Reference(id=1241788823944889238, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=850, pageEnd=865, url=null, language=null, rfNumber=[8], rfOrder=7, authorNames=BERTINETTO L, VALMADRE J, HENRIQUES J F, journalName=null, refType=null, unstructuredReference=BERTINETTO L, VALMADRE J, HENRIQUES J F,et al. Fully-convolutional Siamese networks for object tracking[C]//European Conference on Computer Vision,Amsterdam, The Netherlands. 2016: 850-865., articleTitle=Fully-convolutional Siamese networks for object tracking, refAbstract=null), Reference(id=1241788824024581016, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=8971, pageEnd=8980, url=null, language=null, rfNumber=[9], rfOrder=8, authorNames=LI B, YAN J, WU W, journalName=null, refType=null, unstructuredReference=LI B, YAN J, WU W, et al. High performance visual tracking with Siamese region proposal network[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 2018: 8971-8980., articleTitle=High performance visual tracking with Siamese region proposal network, refAbstract=null), Reference(id=1241788824129438618, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=6269, pageEnd=6277, url=null, language=null, rfNumber=[10], rfOrder=9, authorNames=GUO D, WANG J, CUI Y, journalName=null, refType=null, unstructuredReference=GUO D, WANG J, CUI Y, et al. SiamCAR: Siamese fully convolutional classification and regression for visual tracking[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020: 6269-6277., articleTitle=SiamCAR: Siamese fully convolutional classification and regression for visual tracking, refAbstract=null), Reference(id=1241788824234296219, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=771, pageEnd=787, url=null, language=null, rfNumber=[11], rfOrder=10, authorNames=ZHANG Z, PENG H, FU J, journalName=null, refType=null, unstructuredReference=ZHANG Z, PENG H, FU J, et al. Ocean: Object-aware anchor-free tracking[C]// Proceedings of the European Conference on Computer Vision. 2020: 771-787., articleTitle=Ocean: Object-aware anchor-free tracking, refAbstract=null), Reference(id=1241788824318182301, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=6668, pageEnd=6677, url=null, language=null, rfNumber=[12], rfOrder=11, authorNames=CHEN Z, ZHONG B, LI G, journalName=null, refType=null, unstructuredReference=CHEN Z, ZHONG B, LI G, et al. Siamese box adaptive network for visual tracking[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 6668-6677., articleTitle=Siamese box adaptive network for visual tracking, refAbstract=null), Reference(id=1241788824389485471, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=749, pageEnd=765, url=null, language=null, rfNumber=[13], rfOrder=12, authorNames=HELD D, THRUN S, SAVARESE S, journalName=null, refType=null, unstructuredReference=HELD D, THRUN S, SAVARESE S. Learning to track at 100 FPS with deep regression networks[C]// European Conference on Computer Vision, Amsterdam, The Netherlands. 2016: 749-765., articleTitle=Learning to track at 100 FPS with deep regression networks, refAbstract=null), Reference(id=1241788824456594337, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=1781, pageEnd=1789, url=null, language=null, rfNumber=[14], rfOrder=13, authorNames=GUO Q, FENG W, ZHOU C, journalName=null, refType=null, unstructuredReference=GUO Q, FENG W, ZHOU C, et al. Learning dynamic siamese network for visual object tracking[C]//2017 IEEE International Conference on Computer Vision(ICCV), Venice, Italy. 2017: 1781-1789., articleTitle=Learning dynamic siamese network for visual object tracking, refAbstract=null), Reference(id=1241788824515314595, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2023, volume=15, issue=3, pageStart=1239, pageEnd=1251, url=null, language=null, rfNumber=[15], rfOrder=14, authorNames=HAN G, YANG R, GAO H, journalName=IEEE Transactions on Cognitive and Developmental Systems,, refType=null, unstructuredReference=HAN G, YANG R, GAO H, et al. Deep decoupling classification and regression for visual tracking[J]. IEEE Transactions on Cognitive and Developmental Systems, 2023, 15(3): 1239-1251., articleTitle=Deep decoupling classification and regression for visual tracking, refAbstract=null), Reference(id=1241788824599200677, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2022, volume=31, issue=null, pageStart=2190, pageEnd=2200, url=null, language=null, rfNumber=[16], rfOrder=15, authorNames=LI Z, ZHANG J, LI Y, journalName=IEEE Transactions on Image Processing, refType=null, unstructuredReference=LI Z, ZHANG J, LI Y, et al. Learning feature channel weighting for real-time visual tracking[J]. IEEE Transactions on Image Processing, 2022, 31: 2190-2200., articleTitle=Learning feature channel weighting for real-time visual tracking, refAbstract=null), Reference(id=1241788824666309542, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=95, pageEnd=99, url=null, language=null, rfNumber=[17], rfOrder=16, authorNames=SUN N, ZHANG Z, XUE W, journalName=null, refType=null, unstructuredReference=SUN N, ZHANG Z, XUE W. Contrastive-Siamese collaborative networks for visual object tracking[C]// 2022 International Conference on High Performance Big Data and Intelligent Systems(HDIS), Tianjin, China. 2022:95-99., articleTitle=Contrastive-Siamese collaborative networks for visual object tracking, refAbstract=null), Reference(id=1241788824746001321, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2015, volume=null, issue=null, pageStart=1, pageEnd=9, url=null, language=null, rfNumber=[18], rfOrder=17, authorNames=SZEGEDY C, LIU W, JIA Y, journalName=null, refType=null, unstructuredReference=SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]// 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA,USA. 2015: 1-9., articleTitle=Going deeper with convolutions, refAbstract=null), Reference(id=1241788824829887402, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2021, volume=43, issue=5, pageStart=1562, pageEnd=1577, url=null, language=null, rfNumber=[19], rfOrder=18, authorNames=HUANG L, ZHAO X, HUANG K, journalName=IEEE Transactions on Pattern Analysis and Machine Intelligence, refType=null, unstructuredReference=HUANG L, ZHAO X, HUANG K. GOT-10k: A large high-diversity benchmark for generic object tracking in the wild[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(5): 1562-1577., articleTitle=GOT-10k: A large high-diversity benchmark for generic object tracking in the wild, refAbstract=null), Reference(id=1241788824913773483, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2015, volume=37, issue=9, pageStart=1834, pageEnd=1848, url=null, language=null, rfNumber=[20], rfOrder=19, authorNames=WU Y, LIM J, YANG M H, journalName=IEEE Transactions on Pattern Analysis and Machine Intelligence, refType=null, unstructuredReference=WU Y, LIM J, YANG M H. Object tracking benchmark[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1834–1848., articleTitle=Object tracking benchmark, refAbstract=null), Reference(id=1241788824989270957, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=445, pageEnd=461, url=null, language=null, rfNumber=[21], rfOrder=20, authorNames=MUELLER M, SMITH N, GHANEM B, journalName=null, refType=null, unstructuredReference=MUELLER M, SMITH N, GHANEM B. A benchmark and simulator for UAV tracking[C]// Proceedings of the European Conference on Computer Vision. 2016:445-461., articleTitle=A benchmark and simulator for UAV tracking, refAbstract=null), Reference(id=1241788825047991215, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2015, volume=24, issue=12, pageStart=5630, pageEnd=5644, url=null, language=null, rfNumber=[22], rfOrder=21, authorNames=LIANG P, BLASCH E, LING H, journalName=IEEE Transactions on Image Processing., refType=null, unstructuredReference=LIANG P, BLASCH E, LING H. Encodingcolor information for visual tracking: Algorithms and benchmark[J]. IEEE Transactions on Image Processing., 2015, 24(12): 5630-5644., articleTitle=Encodingcolor information for visual tracking: Algorithms and benchmark, refAbstract=null), Reference(id=1241788825119294384, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=5000, pageEnd=5008, url=null, language=null, rfNumber=[23], rfOrder=22, authorNames=VALMADRE J, BERTINETTO L, HENRIQUES J, journalName=null, refType=null, unstructuredReference=VALMADRE J, BERTINETTO L, HENRIQUES J,et al. End-to-end representation learning for correlation filter based tracking[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA. 2017: 5000–5008., articleTitle=End-to-end representation learning for correlation filter based tracking, refAbstract=null), Reference(id=1241788825186403250, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2017, volume=39, issue=8, pageStart=1561, pageEnd=1575, url=null, language=null, rfNumber=[24], rfOrder=23, authorNames=DANELLJAN M, HÄGER G, KHAN F S, journalName=IEEE Transactions on Pattern Analysis and Machine Intelligence., refType=null, unstructuredReference=DANELLJAN M, HÄGER G, KHAN F S, et al. Discriminative scale space tracking[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017, 39(8): 1561-1575., articleTitle=Discriminative scale space tracking, refAbstract=null), Reference(id=1241788825253512115, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2015, volume=null, issue=null, pageStart=749, pageEnd=758, url=null, language=null, rfNumber=[25], rfOrder=24, authorNames=HONG Z, CHEN Z, WANG C, journalName=null, refType=null, unstructuredReference=HONG Z, CHEN Z, WANG C, et al. MUlti-Store Tracker (MUSTer): A cognitive psychology inspired approach to object tracking[C]// 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA. 2015: 749-758., articleTitle=MUlti-Store Tracker (MUSTer): A cognitive psychology inspired approach to object tracking, refAbstract=null), Reference(id=1241788825337398196, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2016, volume=null, issue=null, pageStart=1401, pageEnd=1409, url=null, language=null, rfNumber=[26], rfOrder=25, authorNames=BERTINETTO L, VALMADRE J, GOLODETZ S, journalName=null, refType=null, unstructuredReference=BERTINETTO L, VALMADRE J, GOLODETZ S, et al. Staple: Complementary learners for real-time tracking[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.2016: 1401-1409., articleTitle=Staple: Complementary learners for real-time tracking, refAbstract=null), Reference(id=1241788826797015991, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2014, volume=null, issue=null, pageStart=188, pageEnd=203, url=null, language=null, rfNumber=[27], rfOrder=26, authorNames=ZHANG Jianming, MA Shugao, SCLAROFF S, journalName=null, refType=null, unstructuredReference=ZHANG Jianming, MA Shugao, SCLAROFF S. MEEM: Robust tracking via multiple experts using entropy minimization[C]// Proceedings of the 13th European Conference on Computer Vision, Zurich, Switzerland. 2014: 188-203., articleTitle=MEEM: Robust tracking via multiple experts using entropy minimization, refAbstract=null), Reference(id=1241788826893484985, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=459, pageEnd=474, url=null, language=null, rfNumber=[28], rfOrder=27, authorNames=DONG Xingping, SHEN Jianbing, journalName=null, refType=null, unstructuredReference=DONG Xingping, SHEN Jianbing. Triplet loss in Siamese network for object tracking[C]// European Conference on Computer Vision, Munich, Germany. 2018:459-474., articleTitle=Triplet loss in Siamese network for object tracking, refAbstract=null), Reference(id=1241788826968982457, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2015, volume=37, issue=3, pageStart=583, pageEnd=596, url=null, language=null, rfNumber=[29], rfOrder=28, authorNames=HENRIQUES J F, CASEIRO R, MARTINS P, journalName=IEEE Transactions on Pattern Analysis and Machine Intelligence, refType=null, unstructuredReference=HENRIQUES J F, CASEIRO R, MARTINS P, et al. High-speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3): 583-596., articleTitle=High-speed tracking with kernelized correlation filters, refAbstract=null), Reference(id=1241788827052868539, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, doi=null, pmid=null, pmcid=null, year=2016, volume=38, issue=10, pageStart=2096, pageEnd=2109, url=null, language=null, rfNumber=[30], rfOrder=29, authorNames=HARE S, GOLODETZ S, SAFFARI A, journalName=IEEE Transactions on Pattern Analysis and Machine Intelligence,, refType=null, unstructuredReference=HARE S, GOLODETZ S, SAFFARI A, et al. Struck:Structured output tracking with kernels[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(10): 2096-2109., articleTitle=Struck:Structured output tracking with kernels, refAbstract=null)], funds=[Fund(id=1241788822674015103, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, awardId=61971158,62171150, language=CN, fundingSource=国家自然科学基金(61971158,62171150), fundOrder=null, country=null), Fund(id=1241788822753706881, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, awardId=tsqn202211087, language=CN, fundingSource=泰山学者工程专项经费资助(tsqn202211087), fundOrder=null, country=null), Fund(id=1241788822833398659, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, awardId=62071144, language=CN, fundingSource=国家自然基金面上项目资助(62071144), fundOrder=null, country=null), Fund(id=1241788822917284740, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, awardId=ZR2023MF091, language=CN, fundingSource=山东省自然科学基金项目(ZR2023MF091), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1241788815577252583, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, xref=null, ext=[AuthorCompanyExt(id=1241788815585641191, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Harbin Institute of Technology, Weihai 264200, China), AuthorCompanyExt(id=1241788815589835496, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, companyId=1241788815577252583, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=哈尔滨工业大学(威海) 威海 264200)])], figs=[ArticleFig(id=1241788819134022459, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 1, caption=New diverse branch block, figureFileSmall=Xg8AW3x3U7oNe8GcPGRBfA==, figureFileBig=eczhhWUMLQo6Q1Qa2jON1g==, tableContent=null), ArticleFig(id=1241788819209519935, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图1, caption=新型多分支结构, figureFileSmall=Xg8AW3x3U7oNe8GcPGRBfA==, figureFileBig=eczhhWUMLQo6Q1Qa2jON1g==, tableContent=null), ArticleFig(id=1241788819305988930, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 2, caption=The three structures are reparameterized into a single convolution, figureFileSmall=FEWdC8kZ7H19RRSdf+8S/Q==, figureFileBig=trGwYHsuivT/FDMz4A4bjA==, tableContent=null), ArticleFig(id=1241788819406652229, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图2, caption=三种结构重参数化为单一卷积, figureFileSmall=FEWdC8kZ7H19RRSdf+8S/Q==, figureFileBig=trGwYHsuivT/FDMz4A4bjA==, tableContent=null), ArticleFig(id=1241788819473761096, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 3, caption=Algorithm flow diagram, figureFileSmall=c63JqK7VXRjJ8+lVFcb3lA==, figureFileBig=scMDS5JWkPtc1T9WBCzmXg==, tableContent=null), ArticleFig(id=1241788819540869962, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图3, caption=算法流程示意图, figureFileSmall=c63JqK7VXRjJ8+lVFcb3lA==, figureFileBig=scMDS5JWkPtc1T9WBCzmXg==, tableContent=null), ArticleFig(id=1241788819633144652, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 4, caption=Qualitative analysis comparison diagram of algorithm, figureFileSmall=ZZ3/0YiyOZOH1D7S6venYg==, figureFileBig=lA+MQcMEswd8KGqwWk2ZaQ==, tableContent=null), ArticleFig(id=1241788819717030735, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图4, caption=算法定性分析对比图, figureFileSmall=ZZ3/0YiyOZOH1D7S6venYg==, figureFileBig=lA+MQcMEswd8KGqwWk2ZaQ==, tableContent=null), ArticleFig(id=1241788819796722515, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 5, caption=The tracking accuracy and tracking success rate of the algorithm on OTB100, figureFileSmall=LQQXABVwkUn0+44XzbrN0A==, figureFileBig=K4INkg68VS2npn9vbuFdyg==, tableContent=null), ArticleFig(id=1241788819859637078, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图5, caption=算法在OTB100上的跟踪精度和跟踪成功率, figureFileSmall=LQQXABVwkUn0+44XzbrN0A==, figureFileBig=K4INkg68VS2npn9vbuFdyg==, tableContent=null), ArticleFig(id=1241788819930940249, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 6, caption=The tracking accuracy and tracking success rate of the algorithm on UAV123, figureFileSmall=yayRSSbPSApEHw0Avyt39g==, figureFileBig=Go31PtNhfEQi58Dtgj1A6A==, tableContent=null), ArticleFig(id=1241788820002243419, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图6, caption=算法在UAV123上的跟踪精度和跟踪成功率, figureFileSmall=yayRSSbPSApEHw0Avyt39g==, figureFileBig=Go31PtNhfEQi58Dtgj1A6A==, tableContent=null), ArticleFig(id=1241788820086129501, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Fig. 7, caption=The tracking accuracy and tracking success rate of the algorithm on Temple-Color-128, figureFileSmall=tEGvpd3sH5BlXP1v8vW2IA==, figureFileBig=RYWNFvweXJ1UZRgSOFL3lw==, tableContent=null), ArticleFig(id=1241788820178404192, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=图7, caption=算法在Temple-Color-128上的跟踪精度和跟踪成功率, figureFileSmall=tEGvpd3sH5BlXP1v8vW2IA==, figureFileBig=RYWNFvweXJ1UZRgSOFL3lw==, tableContent=null), ArticleFig(id=1241788820245513059, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Table 1, caption=

The backbone network structure of the training phase

, figureFileSmall=null, figureFileBig=null, tableContent=
卷积层卷积结构步长通道数模板尺寸搜索尺寸
3127×127255×255
Conv111×1129659×59123×123
Pool13×329629×2961×61
DBB11×1,1×1-3×3,1×1-AVG,3×3125627×2759×59
DBB21×1,1×1-3×3,1×1-AVG,3×3125625×2557×57
Pool23×3225612×1228×28
DBB31×1,1×1-3×3,1×1-AVG,3×3138410×1026×26
DBB41×1,1×1-3×3,1×1-AVG,3×313848×824×24
Conv23×312566×622×22
), ArticleFig(id=1241788820312621925, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=表1, caption=

训练阶段的骨干网络结构

, figureFileSmall=null, figureFileBig=null, tableContent=
卷积层卷积结构步长通道数模板尺寸搜索尺寸
3127×127255×255
Conv111×1129659×59123×123
Pool13×329629×2961×61
DBB11×1,1×1-3×3,1×1-AVG,3×3125627×2759×59
DBB21×1,1×1-3×3,1×1-AVG,3×3125625×2557×57
Pool23×3225612×1228×28
DBB31×1,1×1-3×3,1×1-AVG,3×3138410×1026×26
DBB41×1,1×1-3×3,1×1-AVG,3×313848×824×24
Conv23×312566×622×22
), ArticleFig(id=1241788820379730791, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Table 2, caption=

The backbone network structure of the tracking phase

, figureFileSmall=null, figureFileBig=null, tableContent=
卷积层卷积结构步长通道数模板尺寸搜索尺寸
3127×127255×255
Conv111×1129659×59123×123
Pool13×329629×2962×61
Conv23×3125627×2759×59
Conv33×3125625×2557×57
Pool23×3225612×1228×28
Conv43×3138410×1026×26
Conv53×313848×824×24
Conv63×312566×622×22
), ArticleFig(id=1241788820455228265, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=表2, caption=

跟踪阶段的骨干网络结构

, figureFileSmall=null, figureFileBig=null, tableContent=
卷积层卷积结构步长通道数模板尺寸搜索尺寸
3127×127255×255
Conv111×1129659×59123×123
Pool13×329629×2962×61
Conv23×3125627×2759×59
Conv33×3125625×2557×57
Pool23×3225612×1228×28
Conv43×3138410×1026×26
Conv53×313848×824×24
Conv63×312566×622×22
), ArticleFig(id=1241788820530725739, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Table 3, caption=

Ablation experiment

, figureFileSmall=null, figureFileBig=null, tableContent=
骨干网络3×31×11×1-3×31×1-平均池化特征剪枝精确度成功率
0.7590.557
↑0.7%↑0.9%
↑2.2%↑0.7%
↑0.9%↑0.6%
AlexNet+DBB↑4.1%↑2.3%
↑3.1%↑2.4%
↑3.3%↑2.0%
↑3.8%↑3.3%
↑4.8%↑4.6%
), ArticleFig(id=1241788820606223213, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=表3, caption=

消融实验

, figureFileSmall=null, figureFileBig=null, tableContent=
骨干网络3×31×11×1-3×31×1-平均池化特征剪枝精确度成功率
0.7590.557
↑0.7%↑0.9%
↑2.2%↑0.7%
↑0.9%↑0.6%
AlexNet+DBB↑4.1%↑2.3%
↑3.1%↑2.4%
↑3.3%↑2.0%
↑3.8%↑3.3%
↑4.8%↑4.6%
), ArticleFig(id=1241788820698497904, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Table 4, caption=

Tracking speed comparison

, figureFileSmall=null, figureFileBig=null, tableContent=
算法SiamDBBSiamDBB(未重参数化)SiamFCCFNet
速度(FPS)110638043
), ArticleFig(id=1241788820782383987, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=表4, caption=

跟踪速度对比

, figureFileSmall=null, figureFileBig=null, tableContent=
算法SiamDBBSiamDBB(未重参数化)SiamFCCFNet
速度(FPS)110638043
), ArticleFig(id=1241788820853687157, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Table 5, caption=

Tracking accuracy in different scenarios

, figureFileSmall=null, figureFileBig=null, tableContent=
算法SVOPRIPROCCDEFFMIVBCMBOVLR
SiamDBB0.8090.8160.8220.7800.7850.8080.8140.7520.8180.7410.873
SiamFC0.7320.7780.7590.7390.7480.7390.7120.7060.7280.6550.762
MEEM0.7510.8100.8140.7740.7320.7560.7660.7590.7660.7390.678
MUSTER0.7100.7440.7730.7340.6890.6830.7820.7840.6780.5900.673
SRDCF0.7470.7400.7420.7320.7360.7690.7860.7750.7670.6020.663
Staple0.7270.7380.7680.7270.7520.7090.7830.7490.7000.6680.610
SiamTri0.7480.7630.7740.7260.6800.7630.7460.7150.7270.7230.897
CFNet0.7320.7600.7860.7000.7140.7070.7070.7560.6810.6040.750
DSST0.6580.6650.7110.6090.5680.5750.7260.7040.5700.4800.602
), ArticleFig(id=1241788822317499257, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=表5, caption=

不同场景下的跟踪精度

, figureFileSmall=null, figureFileBig=null, tableContent=
算法SVOPRIPROCCDEFFMIVBCMBOVLR
SiamDBB0.8090.8160.8220.7800.7850.8080.8140.7520.8180.7410.873
SiamFC0.7320.7780.7590.7390.7480.7390.7120.7060.7280.6550.762
MEEM0.7510.8100.8140.7740.7320.7560.7660.7590.7660.7390.678
MUSTER0.7100.7440.7730.7340.6890.6830.7820.7840.6780.5900.673
SRDCF0.7470.7400.7420.7320.7360.7690.7860.7750.7670.6020.663
Staple0.7270.7380.7680.7270.7520.7090.7830.7490.7000.6680.610
SiamTri0.7480.7630.7740.7260.6800.7630.7460.7150.7270.7230.897
CFNet0.7320.7600.7860.7000.7140.7070.7070.7560.6810.6040.750
DSST0.6580.6650.7110.6090.5680.5750.7260.7040.5700.4800.602
), ArticleFig(id=1241788822422356859, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=EN, label=Table 6, caption=

Tracking success in different scenarios

, figureFileSmall=null, figureFileBig=null, tableContent=
算法SVOPRIPROCCDEFFMIVBCMBOVLR
SiamDBB0.5920.5810.5920.5690.5530.6090.5930.5560.6320.5580.635
SiamFC0.5420.5620.5600.5570.5430.5790.5340.5240.5800.5100.532
MEEM0.4750.5340.5390.5260.4820.5430.5310.5360.5750.5150.376
MUSTER0.5100.5350.5510.5520.5210.5310.5990.5810.5440.4620.454
SRDCF0.5630.5490.5430.5570.5440.5970.6090.5830.5940.4610.495
Staple0.5210.5330.5480.5430.5510.5400.5920.5600.5410.4750.400
SiamTri0.5630.5630.5800.5490.5010.5850.5790.5420.5670.5430.634
CFNet0.5460.5530.5680.5270.5260.5540.5410.5610.5400.4540.554
DSST0.4030.4410.4750.4160.4060.4270.4900.4810.4390.3640.301
), ArticleFig(id=1241788822506242940, tenantId=1146029695717560320, journalId=1238841944844054536, articleId=1241757971894563721, language=CN, label=表6, caption=

不同场景下的跟踪成功率

, figureFileSmall=null, figureFileBig=null, tableContent=
算法SVOPRIPROCCDEFFMIVBCMBOVLR
SiamDBB0.5920.5810.5920.5690.5530.6090.5930.5560.6320.5580.635
SiamFC0.5420.5620.5600.5570.5430.5790.5340.5240.5800.5100.532
MEEM0.4750.5340.5390.5260.4820.5430.5310.5360.5750.5150.376
MUSTER0.5100.5350.5510.5520.5210.5310.5990.5810.5440.4620.454
SRDCF0.5630.5490.5430.5570.5440.5970.6090.5830.5940.4610.495
Staple0.5210.5330.5480.5430.5510.5400.5920.5600.5410.4750.400
SiamTri0.5630.5630.5800.5490.5010.5850.5790.5420.5670.5430.634
CFNet0.5460.5530.5680.5270.5260.5540.5410.5610.5400.4540.554
DSST0.4030.4410.4750.4160.4060.4270.4900.4810.4390.3640.301
)], attaches=null, journal=Journal(id=1238822691516493831, delFlag=0, nameCn=遥测遥控, nameEn=Journal of Telemetry, Tracking and Command, nameHistory1=null, nameHistory2=null, issn=2095-1000, eissn=null, cn=11-1780/TP, coden=null, periodic=1, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=uUqWBk5pV4VWPULNoAHkuQ==, journalPrice=null, startedYear=null, abbrevIsoEn=Journal of Telemetry, Tracking and Command, journalRemark=null, publicationField=null, createdTime=1773287945871, updatedTime=1773299995823, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=J, firstLetterEn=J, subjectCode=Engineering, subjectName=null, subjectCodeEn=Engineering, subjectNameEn=null, picCn=uUqWBk5pV4VWPULNoAHkuQ==, picEn=RDDslHHvDBDNnJMjk6BxdA==, jcr=null, cjcr=null, exts=[JournalExt(id=1238873232732909900, language=CN, name=遥测遥控, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1773299995835, updatedTime=1773299995835, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://manuscript.spacejournal.cn/ycyk, submissionEditorUrl=https://manuscript.spacejournal.cn/ycyk, submissionReviewUrl=https://manuscript.spacejournal.cn/ycyk, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1238873232779047245, language=EN, name=Journal of Telemetry, Tracking and Command, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1773299995846, updatedTime=1773299995846, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://manuscript.spacejournal.cn/ycyk, submissionEditorUrl=https://manuscript.spacejournal.cn/ycyk, submissionReviewUrl=https://manuscript.spacejournal.cn/ycyk, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1238841944844054536, websiteList=[Website(id=1238843830011744952, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1238841944844054536, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/ycyk/CN, language=CN, createTime=1773292985680, createBy=18614031015, updateTime=1773293577303, updateBy=18614031015, name=遥测遥控-中文, tplId=1146099689490845704, title=遥测遥控, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1238846390617568150, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=articleTextType, value=kx, createTime=1773293596176, updateTime=1773293596176, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390596596627, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=banner, value=null, createTime=1773293596171, updateTime=1773293596171, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390638539673, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=grayFlag, value=0, createTime=1773293596181, updateTime=1773293596181, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390588208018, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=logo, value=https://castjournals.cast.org.cn/joweb/ycyk/CN/file/pic?fileId=g9S2GGWx7iZjf51o9vpmmA==, createTime=1773293596169, updateTime=1773293596169, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390651122587, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=minRunFlag, value=0, createTime=1773293596184, updateTime=1773293596184, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390613373845, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/ycyk/CN/file/pic, createTime=1773293596175, updateTime=1773293596175, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390646928282, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=silenceFlag, value=0, createTime=1773293596183, updateTime=1773293596183, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390604985236, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1773293596173, updateTime=1773293596173, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390625956759, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=themeColor, value=null, createTime=1773293596178, updateTime=1773293596178, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846390634345368, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830011744952, code=themeStyle, value=null, createTime=1773293596180, updateTime=1773293596180, creator=18614031015, updator=18614031015)]), Website(id=1238843830078853819, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1238841944844054536, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/ycyk/EN, language=EN, createTime=1773292985696, createBy=18614031015, updateTime=1773293566698, updateBy=18614031015, name=遥测遥控-英文, tplId=1146101810881728533, title=Journal of Telemetry, Tracking and Command, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1238846419235304352, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=articleTextType, value=kx, createTime=1773293602999, updateTime=1773293602999, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419214332829, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=banner, value=null, createTime=1773293602994, updateTime=1773293602994, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419256275875, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=grayFlag, value=0, createTime=1773293603004, updateTime=1773293603004, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419205944220, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=logo, value=https://castjournals.cast.org.cn/joweb/ycyk/EN/file/pic?fileId=g9S2GGWx7iZjf51o9vpmmA==, createTime=1773293602992, updateTime=1773293602992, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419268858789, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=minRunFlag, value=0, createTime=1773293603007, updateTime=1773293603007, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419231110047, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/ycyk/EN/file/pic, createTime=1773293602998, updateTime=1773293602998, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419264664484, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=silenceFlag, value=0, createTime=1773293603006, updateTime=1773293603006, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419222721438, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1773293602996, updateTime=1773293602996, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419243692961, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=themeColor, value=null, createTime=1773293603001, updateTime=1773293603001, creator=18614031015, updator=18614031015), WebsiteProps(id=1238846419252081570, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1238843830078853819, code=themeStyle, value=null, createTime=1773293603003, updateTime=1773293603003, creator=18614031015, updator=18614031015)])], journalTitle=遥测遥控, weixinUrl=null, journalUrl=https://ycyk.spacejournal.cn/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Journal of Telemetry, Tracking and Command, journalPhotoCn=uUqWBk5pV4VWPULNoAHkuQ==, journalPhotoEn=RDDslHHvDBDNnJMjk6BxdA==, journalFirstLetter=J, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/ycyk/CN/10.12347/j.ycyk.20240408001, detailUrlEn=https://castjournals.cast.org.cn/joweb/ycyk/EN/10.12347/j.ycyk.20240408001, pdfUrlCn=https://castjournals.cast.org.cn/joweb/ycyk/CN/PDF/10.12347/j.ycyk.20240408001, pdfUrlEn=https://castjournals.cast.org.cn/joweb/ycyk/EN/PDF/10.12347/j.ycyk.20240408001, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
基于多分支和重参数化的孪生网络跟踪算法
收藏切换
PDF下载
金铭 , 唐宇 , 韩勇 , 刘帅 , 闫锋刚
遥测遥控 | 人工智能技术 2024,45(4): 22-30
收起
收藏切换
遥测遥控 | 人工智能技术 2024, 45(4): 22-30
基于多分支和重参数化的孪生网络跟踪算法
全屏
金铭, 唐宇, 韩勇, 刘帅, 闫锋刚
作者信息
  • 哈尔滨工业大学(威海) 威海 264200
  • 金铭 1968年生,博士,教授,博士生导师。

    唐宇 2000年生,硕士研究生。

    韩勇 1976年生,博士,讲师,硕士生导师。

    刘帅 1980年生,博士,教授,博士生导师。

    闫锋刚 1983年生,博士,教授,博士生导师,泰山学者青年专家。

Siamese Network Tracking Algorithm Based on Diverse Branch Block and Reparameterization
Ming JIN, Yu TANG, Yong HAN, Shuai LIU, Feng Gang YAN
Affiliations
  • Harbin Institute of Technology, Weihai 264200, China
出版时间: 2024-07-15 doi: 10.12347/j.ycyk.20240408001
文章导航
收藏切换

针对孪生网络对尺度变化目标特征表达能力不足的问题,本文使用不同尺寸的卷积、池化分支和剪枝操作构成多分支结构,以提高特征的鲁棒性并保证孪生网络的平移不变性。针对多分支结构带来参数量过多的问题,本文在跟踪阶段将多分支结构重参数化为单一的卷积,有效减少跟踪阶段时间成本。实验结果表明:本文提出的算法相比于SiamFC,在OTB100数据集上,其精度、成功率和跟踪速度分别提高了5.1%、3%、30 FPS,在UAV123和Temple-Color-128数据集上跟踪精度和成功率均有所提高。

视觉跟踪  /  孪生网络  /  特征提取  /  结构重参数化

Aiming at the problem that the Siamese network has insufficient ability to express the features of scale-varying targets, a multi-branch structure is constructed by using convolution, pooling branches and pruning operations of different sizes to improve the robustness of features and ensure the translation invariance of the Siamese network. Aiming at the problem that the multi-branch structure brings too many parameters, the multi-branch structure is reparameterized into a single convolution in the tracking stage, which effectively reduces the time cost in the tracking stage. The experimental results show that compared with SiamFC, the accuracy, success rate and tracking speed of the proposed algorithm on the OTB100 datasets are improved by 5.1%, 3% and 30 FPS,respectively. The tracking accuracy and success rate are improved on the UAV123 and Temple-Color-128 datasets.

Visual tracking  /  Siamese network  /  Feature extraction  /  Structural reparameterization
金铭, 唐宇, 韩勇, 刘帅, 闫锋刚. 基于多分支和重参数化的孪生网络跟踪算法. 遥测遥控, 2024 , 45 (4) : 22 -30 . DOI: 10.12347/j.ycyk.20240408001
Ming JIN, Yu TANG, Yong HAN, Shuai LIU, Feng Gang YAN. Siamese Network Tracking Algorithm Based on Diverse Branch Block and Reparameterization[J]. Journal of Telemetry, Tracking and Command, 2024 , 45 (4) : 22 -30 . DOI: 10.12347/j.ycyk.20240408001
目标跟踪作为计算机视觉领域富有挑战性的研究方向之一,是高层计算机视觉任务的基石[1],在公共安防、无人驾驶、人机交互等领域有着广泛的应用。但在复杂的现实环境下,要实现实时且准确地跟踪,仍需要进一步深入研究。
近年来,基于相关滤波的跟踪算法和基于孪生网络的跟踪算法是目标跟踪领域中两个主要研究方向。前者大多采用单一特征或融合特征[25],但相较于孪生网络跟踪算法所使用的深度特征,以上特征鲁棒性较差,从而导致跟踪器的性能不佳。对此,研究人员在边界效应[6]、尺度估计[7]等方面进行一系列的研究以提高跟踪器性能。部分研究发现,相关滤波算法结合深度特征进行跟踪时,跟踪速度会大幅下降,难以满足实时性。而孪生网络跟踪算法在保证实时性的同时取得了良好的跟踪精度,因此基于孪生网络的跟踪框架逐渐成为研究的主流。
SiamFC是首个孪生网络跟踪算法[8],针对尺度变化问题,它通过搜索多个比例的搜索图像以适应跟踪目标,但带来了成倍的计算量。对此Li等人引入区域推荐方法[9],提高了模型对不同尺度目标的适应能力,但该方法会引入尺度的先验和超参数,影响模型的泛化性。对此GUO、PENG、CHEN等人采用分类子网络直接预测目标中心点与边界的距离[1012]。为减少跟踪阶段的计算负担并避免目标模板被污染,SiamFC仅采用第一帧目标信息作为跟踪模板,后续不执行任何的模型更新,但这限制了算法的性能,当目标发生形变、遮挡时,可能发生跟踪漂移。对此,David采用线性更新策略[13]、Feng采用记忆模型[14]、Yang[15]和Zhang[16]采用边框回归等方法对模型进行更新,提高了跟踪性能。SiamFC采用AlexNet作为骨干网络,该网络不能提取表达能力非常强的目标特征表示,但针对该问题的研究较少,主要集中在增加骨干网络的深度[17]。该方法在提高跟踪精度的同时增加了计算量,影响跟踪速度,并且需要在超大规模的数据集上进行训练,所需算力和时间成本高。不同于此前研究,本文针对在扩展骨干网络宽度的同时不增加参数这一问题,从多分支结构出发,基于训练和跟踪阶段可分离性质,提出了一种不增加跟踪阶段额外时间成本的多分支结构,通过应用该结构来优化骨干网络,提高跟踪器对目标特征的表达能力,在多个测试数据集上测试可知,相比于基准算法,本文提出的算法在跟踪精度、成功率、速度上均有较大幅度提升。
孪生网络将跟踪问题当作相似性学习问题,它的骨干网络提取到的深度特征会通过度量函数f得到模板图像与搜索图像的相似程度,如式(1)所示,其中z表示模板分支的输入,x表示搜索分支的输入,φ(·)作为骨干网络提取图像的特征表示,*表示互相关运算,b是一个偏置项。
f(zx)=φ(z)*φ(x)+b
虽然孪生网络取得了不错的跟踪性能,但其骨干网络为单一的卷积,在宽度上仍有改进空间。前人在图像分类任务中,通过使用多分支结构搭建的骨干网络提取的特征能更好地应对尺度变化问题[18],但将相同的多分支结构迁移到孪生网络中反而使跟踪性能出现下降。一方面是因为多分支结构不关注环境信息,另一方面则是因为其破坏了孪生网络的平移不变性。对此,本文提出了一种适用于孪生网络的新型多分支结构。
新型多分支结构如图1所示,为了利用环境信息对目标进行定位,新型多分支结构中包含了平均池化分支,以提高跟踪器在目标尺度等特征发生变化时的精确度和成功率。为了使每条分支输出的特征图得以融合,K×K卷积需要进行填充操作,而填充操作会破坏全卷积孪生网络的平移不变性,导致目标容易发生跟踪漂移现象。对此,新型多分支结构在特征融合后通过对特征进行剪枝操作,去除受填充影响的特征,需要剪枝的大小为K/2,剪枝后的特征可以保证孪生网络的平移不变性。同时,为了提高跟踪器对不同尺度目标的特征表达能力,新型多分支结构使用不同尺寸的卷积,并且多次使用1×1卷积进行跨通道信息融合,使输出的特征中既关注了全局特征也关注了局部特征。但多分支会引入过多参数,因此需要对其进行重参数化以压缩参数量。
为降低跟踪阶段时孪生网络的参数量,基于训练和跟踪可分离的性质,在跟踪阶段将训练阶段的新型多分支结构重参数化为单一卷积结构,如图2所示,具体方法如下。
卷积核是四阶张量,具有C个输入通道、D个输出通道和核尺寸为K的卷积核表示为F∈ℝD×C×K×K,可选的偏置表示为b∈ℝD,这样的卷积核以C通道的特征图I∈ℝC×H×W作为输入,输出D通道的特征图O∈ℝD×HW',其中H'和W'由K、填充和步长配置确定。将偏置复制到REP(b)∈ℝD×H’×W,并将其加到卷积结果上。卷积公式用下式表示:
O=I*F+REP(b)
卷积连接归一化层的输出可以用下式表示:
其中,j为输出通道的索引,μj是累积的通道平均值,σj是累积的通道标准差,γj是要学习的缩放因子,βj是要学习的偏置项。由上式,用卷积核F'和偏置b'表示重参数化后的卷积。
假设1×1卷积的输入和输出通道分别为CDK×K卷积的输入和输出通道分别为DE,使用式(4)将卷积连接归一化层重参数化得到F(1)∈ℝD×C×1×1b(1)∈ℝDF(2)∈ℝE×D×K×Kb(2)∈ℝE,输出可以表示为:
O'=I*F(1)*F(2)+REP(b(1))*F(2)+REP(b(2))
由于I*F(1)是1×1卷积,它只进行通道线性组合而不进行空间聚合,因此可以通过线性重组将其参数融入进K×K核中,即:
TRANS(F(1))∈ℝC×D×1×1F(1)的转置。式(5)的第二项是常数矩阵上的卷积,因此输出也是常数矩阵。
据此,可以构建
从而使
对于C个通道,卷积核尺寸为K、步长为s的平均池化相当于Ks相同的卷积。这样的卷积F'∈ℝC×C×K×K可以由下式构造:
综上,对于新型多分支结构中的分支1和分支4,使用式(4)即可重参数化为单一卷积,分别表示为F(1)b(1)F(4)b(4)。对于分支2,使用式(6)与(9)即可重参数化为单一卷积分支,表示为F(2)b(2)。对于分支3,使用式(10)将平均池化重参数化为单一卷积后与分支2操作一致,分支3表示为F(3)b(3)。基于卷积的可加性,将以上四个卷积重参数化为单一卷积:
以SiamFC为基准,对其骨干网络进行改进,将它的5×5卷积拆分成两个3×3卷积来降低参数量,并将除最后一层外的3×3卷积替换为新型多分支结构,提高对尺度变化目标的特征表达能力。训练阶段的骨干网络结构如表1所示,表中DBB表示新型多分支结构,AVG表示平均池化。在跟踪阶段通过重参数化压缩网络参数量,具体结构如表2所示。
算法整体流程如图3所示,通过骨干网络来提取模板图像和搜索图像的模板特征和搜索特征,对模板特征和搜索特征进行互相关操作得到响应图,响应值最大处即为目标位置。该方法可以提取鲁棒性更强的特征,从而有效地应对目标在尺度变化、遮挡等复杂环境下的跟踪问题。
实验在Windows10操作系统下采用PyTorch 2.0.1深度学习框架进行,使用GPU(NVIDIA GTX 2080Ti)进行加速。使用GOT-10K[19]训练集离线训练本文算法SiamDBB,采用Xavier均匀分布对骨干网络进行初始化,学习率采用10-2到10-5的线性衰减策略,共训练50个周期,并采用动量为0.9的随机梯度下降方法作为优化器。选择OTB100、UAV123和Temple-Color-128[2022]作为测试集。
为了验证新型多分支结构中每条分支和特征剪枝的有效性,本节在OTB100数据集上进行消融实验。实验以SiamFC为基准,在其骨干网络AlexNet中添加一层K=3的新型多分支结构(DBB,Diverse Branch Block),每次试验仅改变该层分支结构,实验结果如表1所示。
表3可知,与单一卷积相比,增加了不同分支的骨干网络使跟踪精确度和成功率有不同程度的提升。这是因为:1×1卷积可以实现信息的跨通道整合与交互,且与3×3卷积形成了不同尺度的卷积组合,丰富了特征空间;1×1~3×3序列卷积在感受野与单一卷积相同的情况下叠加了一层卷积,可以引入更多非线性;平均池化使提取的特征中包含了部分背景信息,可以帮助跟踪器进行目标定位;特征剪枝操作去除了受填充影响的特征,恢复了孪生网络的平移不变性,提高了跟踪性能。通过消融实验,证明了新型多分支结构和特征剪枝的有效性。
为了直观呈现算法的跟踪效果,如图4所示,在Soccer、Motor Rolling和Skiing高难度视频序列上,将本文算法SiamDBB与SiamFC、CFNet[23]进行对比分析。
① 相似目标干扰:在“Soccer”视频中,目标不断变换位置和尺度的同时,环境中充斥着相似物体的干扰。由于SiamFC和CFNet的骨干网络均采用单一卷积,特征表达能力不足,都出现了跟踪漂移问题。本文算法通过引入多分支结构,将不同复杂度的分支组合,丰富特征空间,使提取的特征更具鲁棒性,在面对相似目标干扰时跟踪效果更好。
② 尺度变化:在“Motor Rolling”和“Skiing”视频中,目标旋转的同时,摄像的角度和距离也在变化,SiamFC和CFNet由于每层卷积的尺度固定,难以适应这样的复杂场景,而本文提出的多分支结构使用不同尺度的卷积,增强了特征对尺度变化目标的适应能力。
在OTB100数据集中,将本文算法与其他8个算法进行了比较,包括:CFNet、DSST[24]、MUSTER[25]、Staple[26]、SRDCF[6]、MEEEM[27]、SiamFC[8]、SiamTri[28]图5显示了本文算法与8个比较算法在跟踪精度和成功率两方面的对比图。得益于多分支结构的应用,本文算法相较于SiamFC,跟踪精度提高了5.1%,跟踪成功率提高了3%。相比于同样基于SiamFC算法进行改进的CFNet和SiamTri,本文算法在跟踪性能上更具优势。
为了证明重参数化对跟踪性能的影响,将重参数化的SiamDBB算法与未重参数化的SiamDBB算法进行对比发现:在跟踪精度和跟踪成功率上两者表现出相同的性能,而在跟踪速度方面,如表4所示,前者的跟踪速度较后者加快了47 FPS。这是因为未重参数化的SiamDBB在跟踪阶段时的骨干网络依旧是多分支结构,因此花费大量时间在特征提取上,使得跟踪速度远慢于重参数化的SiamDBB算法。而后者优化了SiamFC的骨干网络结构,进一步减少了参数量,因此跟踪速度可以达到110 FPS。
在UAV123数据集上,将SiamDBB与UAV123数据集提供的31个基础算法和MUSTER、SRDCF、KCF[29]、DSST、DCF、MEEM、Struck[30]等7个主流跟踪算法进行比较。图6展示了在UAV123数据集上9个性能最佳的跟踪算法。Siam-DBB取得了最佳的跟踪效果,相较于SiamFC在跟踪精度上提高了1.8个百分点,在跟踪成功率上提高了1.5个百分点。
在Temple-Color-128数据集上,将SiamDBB与SiamFC、MEEM、Struck、KCF等跟踪算法进行比较。图7展示了在Temple-Color-128数据集上的跟踪精度和跟踪成功率对比图。在该数据集上,SiamDBB在跟踪精度和跟踪成功率方面均取得了最佳性能,跟踪精度达到了73.18%,跟踪成功率达到了51.87%,比SiamFC高出约1.8个百分点。
本节在OTB100数据集下对SV(尺度变化)、OPR(平面外旋转)、IPR(平面内旋转)、OCC(遮挡)、DEF(形变)、FM(快速运动)、IV(照明变化)、BC(背景杂乱)、MB(运动模糊)、OV(出视野)和LR(低分辨率)场景进行性能分析。表5表6分别展示了在不同场景下算法的跟踪精度和跟踪成功率,最优结果进行加粗显示,次优结果由下划线标注。
从表中可以看出,本文算法在大部分挑战因素中取得了最好的跟踪效果。相较于SiamFC,本文算法的性能在所有场景下都获得了大幅度提升。跟踪精度方面,SiamDBB在遮挡、尺度变化、运动模糊、变形、照明变化、快速运动、出视野、平面外旋转和平面内旋转场景下取得了最好的跟踪性能,在低分辨率场景下取得次优的跟踪性能;跟踪成功率方面,SiamDBB在遮挡、尺度变化、运动模糊、变形、快速运动、出视野、平面外旋转、平面内旋转和低分辨场景下取得了最好的跟踪性能。
本文从卷积结构出发,提出了基于多分支和重参数化的孪生网络跟踪算法。通过构建多分支结构来替换单一卷积,在训练阶段可以实现对不同尺度目标的适应性学习,提高模型对尺度变化目标的处理能力,强化对目标信息的挖掘,在跟踪阶段对多分支结构进行重参数化,大幅减少计算复杂度。实验结果表明,本文算法的性能在OTB100、UAV123和Temple-Color-128数据集上,相较于基准算法,获得了较大幅度提升。
  • 国家自然科学基金(61971158,62171150)
  • 泰山学者工程专项经费资助(tsqn202211087)
  • 国家自然基金面上项目资助(62071144)
  • 山东省自然科学基金项目(ZR2023MF091)
参考文献 引证文献
排序方式:
[1]
MARVASTI-ZADEH S M, CHENG L, GHANEI-YAKHDAN H, et al. Deep learning for visual tracking: A comprehensive survey[J]. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(5): 3943-3968.
[2]
ZHU X F, WU X J, XU T, et al. Robust visual object tracking via adaptive attribute aware discriminative correlation filters[J]. IEEE Transactions on Multimedia, 2022, 24: 301-312.
[3]
KUMARI L K, RAMALAKSHMI K, SRINIVASA-RAGHAVAN V. Robust visual object tracking in clustered environment[C]// 2022 5th International Conference on Contemporary Computing and Informatics(IC3I), Uttar Pradesh, India. 2022: 1499-1503.
[4]
REN H, XIANG L, SHI T. Research on background learning correlation filtering algorithm with multifeature fusion[J]. IEEE Access, 2023, 11: 32895-32906.
[5]
KUMAR R, DEB A K. Pedestrian tracking in UAV images with Kalman filter motion estimator and correlation filter[J]. IEEE Aerospace and Electronic Systems Magazine, 2023, 38(7): 4-19.
[6]
WU L, FANG H, CUI X, et al. Improved partbased Kernel correlation filter for object tracking and robot grasping[C]// 2023 China Automation Congress(CAC), Chongqing, China. 2023: 505-510.
[7]
CAO Y, SHI G, ZHANG T, et al. Bayesian correlation filter learning with Gaussian scale mixture model for visual tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(5): 3085-3098.
[8]
BERTINETTO L, VALMADRE J, HENRIQUES J F,et al. Fully-convolutional Siamese networks for object tracking[C]//European Conference on Computer Vision,Amsterdam, The Netherlands. 2016: 850-865.
[9]
LI B, YAN J, WU W, et al. High performance visual tracking with Siamese region proposal network[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 2018: 8971-8980.
[10]
GUO D, WANG J, CUI Y, et al. SiamCAR: Siamese fully convolutional classification and regression for visual tracking[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020: 6269-6277.
[11]
ZHANG Z, PENG H, FU J, et al. Ocean: Object-aware anchor-free tracking[C]// Proceedings of the European Conference on Computer Vision. 2020: 771-787.
[12]
CHEN Z, ZHONG B, LI G, et al. Siamese box adaptive network for visual tracking[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 6668-6677.
[13]
HELD D, THRUN S, SAVARESE S. Learning to track at 100 FPS with deep regression networks[C]// European Conference on Computer Vision, Amsterdam, The Netherlands. 2016: 749-765.
[14]
GUO Q, FENG W, ZHOU C, et al. Learning dynamic siamese network for visual object tracking[C]//2017 IEEE International Conference on Computer Vision(ICCV), Venice, Italy. 2017: 1781-1789.
[15]
HAN G, YANG R, GAO H, et al. Deep decoupling classification and regression for visual tracking[J]. IEEE Transactions on Cognitive and Developmental Systems, 2023, 15(3): 1239-1251.
[16]
LI Z, ZHANG J, LI Y, et al. Learning feature channel weighting for real-time visual tracking[J]. IEEE Transactions on Image Processing, 2022, 31: 2190-2200.
[17]
SUN N, ZHANG Z, XUE W. Contrastive-Siamese collaborative networks for visual object tracking[C]// 2022 International Conference on High Performance Big Data and Intelligent Systems(HDIS), Tianjin, China. 2022:95-99.
[18]
SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]// 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA,USA. 2015: 1-9.
[19]
HUANG L, ZHAO X, HUANG K. GOT-10k: A large high-diversity benchmark for generic object tracking in the wild[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(5): 1562-1577.
[20]
WU Y, LIM J, YANG M H. Object tracking benchmark[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1834–1848.
[21]
MUELLER M, SMITH N, GHANEM B. A benchmark and simulator for UAV tracking[C]// Proceedings of the European Conference on Computer Vision. 2016:445-461.
[22]
LIANG P, BLASCH E, LING H. Encodingcolor information for visual tracking: Algorithms and benchmark[J]. IEEE Transactions on Image Processing., 2015, 24(12): 5630-5644.
[23]
VALMADRE J, BERTINETTO L, HENRIQUES J,et al. End-to-end representation learning for correlation filter based tracking[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA. 2017: 5000–5008.
[24]
DANELLJAN M, HÄGER G, KHAN F S, et al. Discriminative scale space tracking[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017, 39(8): 1561-1575.
[25]
HONG Z, CHEN Z, WANG C, et al. MUlti-Store Tracker (MUSTer): A cognitive psychology inspired approach to object tracking[C]// 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA. 2015: 749-758.
[26]
BERTINETTO L, VALMADRE J, GOLODETZ S, et al. Staple: Complementary learners for real-time tracking[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.2016: 1401-1409.
[27]
ZHANG Jianming, MA Shugao, SCLAROFF S. MEEM: Robust tracking via multiple experts using entropy minimization[C]// Proceedings of the 13th European Conference on Computer Vision, Zurich, Switzerland. 2014: 188-203.
[28]
DONG Xingping, SHEN Jianbing. Triplet loss in Siamese network for object tracking[C]// European Conference on Computer Vision, Munich, Germany. 2018:459-474.
[29]
HENRIQUES J F, CASEIRO R, MARTINS P, et al. High-speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3): 583-596.
[30]
HARE S, GOLODETZ S, SAFFARI A, et al. Struck:Structured output tracking with kernels[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(10): 2096-2109.
2024年第45卷第4期
PDF下载
117
52
引用本文
BibTeX
文章信息
doi: 10.12347/j.ycyk.20240408001
  • 接收时间:2024-04-08
  • 首发时间:2026-03-20
  • 出版时间:2024-07-15
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2024-04-08
  • 修回日期:2024-04-28
基金
国家自然科学基金(61971158,62171150)
泰山学者工程专项经费资助(tsqn202211087)
国家自然基金面上项目资助(62071144)
山东省自然科学基金项目(ZR2023MF091)
作者信息
    哈尔滨工业大学(威海) 威海 264200
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/ycyk/CN/10.12347/j.ycyk.20240408001
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏