Article(id=1251505540774445632, tenantId=1146029695717560320, journalId=1251233954884272221, issueId=1251505536634667461, articleNumber=null, orderNo=null, doi=10.13682/j.issn.2095-6533.2025.06.013, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1734624000000, receivedDateStr=2024-12-20, revisedDate=null, revisedDateStr=null, acceptedDate=null, acceptedDateStr=null, onlineDate=1776311772769, onlineDateStr=2026-04-16, pubDate=1762704000000, pubDateStr=2025-11-10, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1776311772769, onlineIssueDateStr=2026-04-16, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1776311772769, creator=13701087609, updateTime=1776311772769, updator=13701087609, issue=Issue{id=1251505536634667461, tenantId=1146029695717560320, journalId=1251233954884272221, year='2025', volume='30', issue='6', pageStart='1', pageEnd='130', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=1, specialIssue=null, createTime=1776311771782, creator=13701087609, updateTime=1776311824541, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1251505758014226723, tenantId=1146029695717560320, journalId=1251233954884272221, issueId=1251505536634667461, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1251505758014226724, tenantId=1146029695717560320, journalId=1251233954884272221, issueId=1251505536634667461, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=113, endPage=122, ext={EN=ArticleExt(id=1251505540992549452, articleId=1251505540774445632, tenantId=1146029695717560320, journalId=1251233954884272221, language=EN, title=A dual-stream network with complementary feature fusion for pest identification, columnId=null, journalTitle=Journal of Xi'an University of Posts and Telecommunications, columnName=null, runingTitle=null, highlight=null, articleAbstract=

To address the challenges of small inter-class differences in pest details,severe field background interference,and imbalanced sample distribution,a complementary feature fusion dual-stream network for pest recognition is proposed.This network combines the local perception capability of convolutional neural networks with the global modeling ability of the Mamba model,capturing and integrating the global and local information of pest images.A hierarchical multiscale perception module is designed to extract multi-scale image features through grouped hierarchical convolution and enhance pest detail information with a detail enhancement perception strategy.An adaptive focusing Mamba module is designed to locate key pest regions using dynamic convolution operators and reduce complex background interference.Additionally,an attention-weighted fusion module is designed to achieve adaptive interaction and optimization of global and local features through a cross-attention mechanism,further improving the accuracy of semantic expression.A balanced loss function is constructed to mitigate the effects of class imbalance in the dataset.The experimental results show that the network achieves an accuracy of 71.19%on the large-scale pest dataset IP102,and an accuracy of 99.36%on the D0 dataset,demonstrating its ability to effectively identify pest species.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Daxiang LI, Jianing SUN, Ying LIU), CN=ArticleExt(id=1251505549158859767, articleId=1251505540774445632, tenantId=1146029695717560320, journalId=1251233954884272221, language=CN, title=面向害虫识别的互补特征融合双流网络, columnId=1251505537641300427, journalTitle=西安邮电大学学报, columnName=人工智能目标检测, runingTitle=null, highlight=null, articleAbstract=

针对害虫类别间细节差异小、田间背景干扰严重及样本分布不均衡等问题,提出一种面向害虫识别的互补特征融合双流网络。该网络结合卷积神经网络的局部感知能力与Mamba模型的全局建模能力,捕获并融合害虫图像的全局与局部信息。设计层次化多尺度感知模块,通过分组层次化卷积提取多尺度图像特征,并采用细节强化感知策略增强害虫细节信息;设计自适应聚焦Mamba模块,利用动态卷积算子定位害虫关键区域,减少复杂背景干扰;设计注意力加权融合模块,通过交叉注意力机制实现全局和局部特征的自适应交互优化,进一步提升语义表达的准确性。最后,构建均衡损失函数,缓解数据集类别不平衡的影响。实验结果表明,该网络在大规模害虫数据集IP102上的准确率达到71.19%,在D0数据集上的准确率为99.36%,能够有效识别害虫种类。

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=x4ZfDrBoob/0u9/4mR5Caw==, magXml=r70rtWsMeimhvCEMZwfUdQ==, pdfUrl=null, pdf=8Y1X8NnwfDSqqAV8R6WPPg==, pdfFileSize=7828288, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=N4MeBUqPit0pfkGa92AxzQ==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=tXx8eKjeEqG8xva3KUskzw==, mapNumber=null, authorCompany=null, fund=null, authors=

李大湘(1974-),男,湖南麻阳人,博士,西安邮电大学副教授,主要研究方向为遥感图像分类、作物图像病虫害识别与机器学习。E-mail:

孙家宁(2000-),女,陕西大荔人,西安邮电大学硕士研究生,主要研究方向为植物病虫害识别。E-mail:

刘颖(1972-),女,陕西户县人,博士,西安邮电大学教授,主要研究方向为图像处理与模式识别。E-mail:

, authorsList=李大湘, 孙家宁, 刘颖)}, authors=[Author(id=1251505549607649297, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=www_ldx@163.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251505549708312599, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, authorId=1251505549607649297, language=EN, stringName=Daxiang LI, firstName=Daxiang, middleName=null, lastName=LI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China
2.Xi'an Key Laboratory of Image Processing Technology and Applications for Public Security,Xi'an 710121,China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251505549796392987, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, authorId=1251505549607649297, language=CN, stringName=李大湘, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1.西安邮电大学通信与信息工程学院,陕西西安 710121
2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121, bio={"img":"UsJKSMv8msoWV7tllm/4Ng==","content":"

李大湘(1974-),男,湖南麻阳人,博士,西安邮电大学副教授,主要研究方向为遥感图像分类、作物图像病虫害识别与机器学习。E-mail:

"}, bioImg=UsJKSMv8msoWV7tllm/4Ng==, bioContent=

李大湘(1974-),男,湖南麻阳人,博士,西安邮电大学副教授,主要研究方向为遥感图像分类、作物图像病虫害识别与机器学习。E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251505549372768260, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=1., ext=[AuthorCompanyExt(id=1251505549389545477, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China), AuthorCompanyExt(id=1251505549397934086, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.西安邮电大学通信与信息工程学院,陕西西安 710121)]), AuthorCompany(id=1251505549473431562, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=2., ext=[AuthorCompanyExt(id=1251505549481820170, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549473431562, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.Xi'an Key Laboratory of Image Processing Technology and Applications for Public Security,Xi'an 710121,China), AuthorCompanyExt(id=1251505549490208779, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549473431562, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121)])]), Author(id=1251505549884473379, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=jianingsun@stu.xupt.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251505549976748074, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, authorId=1251505549884473379, language=EN, stringName=Jianing SUN, firstName=Jianing, middleName=null, lastName=SUN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251505550056439854, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, authorId=1251505549884473379, language=CN, stringName=孙家宁, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.西安邮电大学通信与信息工程学院,陕西西安 710121, bio={"img":"um/BPkwm+kDR5eQpIZzLug==","content":"

孙家宁(2000-),女,陕西大荔人,西安邮电大学硕士研究生,主要研究方向为植物病虫害识别。E-mail:

"}, bioImg=um/BPkwm+kDR5eQpIZzLug==, bioContent=

孙家宁(2000-),女,陕西大荔人,西安邮电大学硕士研究生,主要研究方向为植物病虫害识别。E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251505549372768260, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=1., ext=[AuthorCompanyExt(id=1251505549389545477, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China), AuthorCompanyExt(id=1251505549397934086, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.西安邮电大学通信与信息工程学院,陕西西安 710121)])]), Author(id=1251505550136131632, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=ly_yolanda@sina.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1251505550299709495, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, authorId=1251505550136131632, language=EN, stringName=Ying LIU, firstName=Ying, middleName=null, lastName=LIU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China
2.Xi'an Key Laboratory of Image Processing Technology and Applications for Public Security,Xi'an 710121,China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1251505550408761403, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, authorId=1251505550136131632, language=CN, stringName=刘颖, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1.西安邮电大学通信与信息工程学院,陕西西安 710121
2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121, bio={"img":"dWaMsFtdT37vJfbQ5tY5HA==","content":"

刘颖(1972-),女,陕西户县人,博士,西安邮电大学教授,主要研究方向为图像处理与模式识别。E-mail:

"}, bioImg=dWaMsFtdT37vJfbQ5tY5HA==, bioContent=

刘颖(1972-),女,陕西户县人,博士,西安邮电大学教授,主要研究方向为图像处理与模式识别。E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1251505549372768260, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=1., ext=[AuthorCompanyExt(id=1251505549389545477, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China), AuthorCompanyExt(id=1251505549397934086, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.西安邮电大学通信与信息工程学院,陕西西安 710121)]), AuthorCompany(id=1251505549473431562, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=2., ext=[AuthorCompanyExt(id=1251505549481820170, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549473431562, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.Xi'an Key Laboratory of Image Processing Technology and Applications for Public Security,Xi'an 710121,China), AuthorCompanyExt(id=1251505549490208779, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549473431562, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121)])])], keywords=[Keyword(id=1251505550668808257, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, orderNo=1, keyword=pest recognition), Keyword(id=1251505550752694339, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, orderNo=2, keyword=Mamba), Keyword(id=1251505550811414597, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, orderNo=3, keyword=multi-scale feature extraction), Keyword(id=1251505550995963978, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, orderNo=4, keyword=feature fusion), Keyword(id=1251505551079850062, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, orderNo=5, keyword=attention mechanism), Keyword(id=1251505551146958929, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, orderNo=1, keyword=害虫识别), Keyword(id=1251505551209873491, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, orderNo=2, keyword=Mamba), Keyword(id=1251505551285370965, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, orderNo=3, keyword=多尺度特征提取), Keyword(id=1251505551360868440, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, orderNo=4, keyword=特征融合), Keyword(id=1251505551432171611, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, orderNo=5, keyword=注意力机制)], refs=[Reference(id=1251505553449631893, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=11, issue=2, pageStart=249, pageEnd=259, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=FU X Q, MA Q Y, YANG F F, journalName=Information Processing in Agriculture, refType=null, unstructuredReference=FU X Q,MA Q Y,YANG F F,et al.Crop pest image recognition based on the improved ViT method[J]. Information Processing in Agriculture,2024,11(2):249-259., articleTitle=Crop pest image recognition based on the improved ViT method, refAbstract=null), Reference(id=1251505553520935064, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=9, issue=null, pageStart=100521, pageEnd=null, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=ODOUNFA M G F, GBEMAVO C D S J, TAHI S P G, journalName=Smart Agricultural Technology, refType=null, unstructuredReference=ODOUNFA M G F,GBEMAVO C D S J,TAHI S P G,et al.Deep learning methods for enhanced stress and pest management in market garden crops:A comprehensive analysis[J].Smart Agricultural Technology,2024,9:100521., articleTitle=Deep learning methods for enhanced stress and pest management in market garden crops:A comprehensive analysis, refAbstract=null), Reference(id=1251505553592238234, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=8, issue=3, pageStart=709, pageEnd=734, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=VENKATASAICHANDRAKANTH P, IYAPPARAJA M, journalName=International Journal of Intelligent Robotics and Applications, refType=null, unstructuredReference=VENKATASAICHANDRAKANTH P,IYAPPARAJA M.A survey on pest detection and classification in field crops using artificial intelligence techniques[J]. International Journal of Intelligent Robotics and Applications,2024,8(3):709-734., articleTitle=A survey on pest detection and classification in field crops using artificial intelligence techniques, refAbstract=null), Reference(id=1251505553692901535, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2022, volume=30, issue=8, pageStart=114, pageEnd=118, url=null, language=null, rfNumber=[4], rfOrder=3, authorNames=李睿, 王莹, 王恒, journalName=电子设计工程, refType=null, unstructuredReference=李睿,王莹,王恒.基于树莓派的室内植物病虫害识别系统设计[J].电子设计工程,2022,30(8):114-118., articleTitle=基于树莓派的室内植物病虫害识别系统设计, refAbstract=null), Reference(id=1251505553760010400, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2022, volume=30, issue=8, pageStart=114, pageEnd=118, url=null, language=null, rfNumber=[4], rfOrder=4, authorNames=LI R, WANG Y, WANG H, journalName=Electronic Design Engineering, refType=null, unstructuredReference=LI R,WANG Y,WANG H.Design of house plants diseases and insect pests recognition system based on Raspberry Pi[J]. Electronic Design Engineering, 2022,30(8):114-118.(in Chinese), articleTitle=Design of house plants diseases and insect pests recognition system based on Raspberry Pi, refAbstract=null), Reference(id=1251505553831313571, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=44, issue=3, pageStart=48, pageEnd=62, url=null, language=null, rfNumber=[5], rfOrder=5, authorNames=胡海峰, 倪宗煜, 赵海涛, journalName=南京邮电大学学报(自然科学版), refType=null, unstructuredReference=胡海峰,倪宗煜,赵海涛,.无人机场景下基于Transformer的轻量化行人重识别[J].南京邮电大学学报(自然科学版),2024,44(3):48-62., articleTitle=无人机场景下基于Transformer的轻量化行人重识别, refAbstract=null), Reference(id=1251505553927782562, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=44, issue=3, pageStart=48, pageEnd=62, url=null, language=null, rfNumber=[5], rfOrder=6, authorNames=HU H F, NI Z Y, ZHAO H T, journalName=Journal of Nanjing University of Posts and Telecommunications(Natural Science Edition), refType=null, unstructuredReference=HU H F,NI Z Y,ZHAO H T,et al.Transformer based lightweight person re-identification in unmanned aerial vehicle images[J].Journal of Nanjing University of Posts and Telecommunications(Natural Science Edition),2024,44(3):48-62.(in Chinese), articleTitle=Transformer based lightweight person re-identification in unmanned aerial vehicle images, refAbstract=null), Reference(id=1251505554015862948, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=13, issue=15, pageStart=3008, pageEnd=null, url=null, language=null, rfNumber=[6], rfOrder=7, authorNames=GUO B Y, WANG J J, GUO M H, journalName=Electronics, refType=null, unstructuredReference=GUO B Y,WANG J J,GUO M H,et al.Overview of pest detection and recognition algorithms[J].Electronics,2024,13(15):3008., articleTitle=Overview of pest detection and recognition algorithms, refAbstract=null), Reference(id=1251505554112331942, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=42, issue=3, pageStart=133, pageEnd=142, url=null, language=null, rfNumber=[7], rfOrder=8, authorNames=刘志, 翟瑞芳, 彭万伟, journalName=华中农业大学学报, refType=null, unstructuredReference=刘志,翟瑞芳,彭万伟,.融合注意力机制的Cascade R-CNN田间害虫检测方法[J].华中农业大学学报,2023,42(3):133-142., articleTitle=融合注意力机制的Cascade R-CNN田间害虫检测方法, refAbstract=null), Reference(id=1251505554196218025, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=42, issue=3, pageStart=133, pageEnd=142, url=null, language=null, rfNumber=[7], rfOrder=9, authorNames=LIU Z, ZHAI R F, PENG W W, journalName=Journal of Huazhong Agricultural University, refType=null, unstructuredReference=LIU Z,ZHAI R F,PENG W W,et al.Field pest detection method based on improved Cascade R-CNN by incorporating attention mechanism[J]. Journal of Huazhong Agricultural University,2023,42(3):133-142.(in Chinese), articleTitle=Field pest detection method based on improved Cascade R-CNN by incorporating attention mechanism, refAbstract=null), Reference(id=1251505554288492716, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2021, volume=13, issue=3, pageStart=705, pageEnd=716, url=null, language=null, rfNumber=[8], rfOrder=10, authorNames=LIU W J, WU G Q, REN F J, journalName=IEEE Transactions on Cognitive and Developmental Systems, refType=null, unstructuredReference=LIU W J,WU G Q,REN F J.Deep multibranch fusion residual network for insect pest recognition[J]. IEEE Transactions on Cognitive and Developmental Systems,2021,13(3):705-716., articleTitle=Deep multibranch fusion residual network for insect pest recognition, refAbstract=null), Reference(id=1251505554380767407, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=36, issue=6, pageStart=2867, pageEnd=2882, url=null, language=null, rfNumber=[9], rfOrder=11, authorNames=NANDHINI C, BRINDHA M, journalName=Neural Computing and Applications, refType=null, unstructuredReference=NANDHINI C,BRINDHA M.Visual regenerative fusion network for pest recognition[J].Neural Computing and Applications,2024,36(6):2867-2882., articleTitle=Visual regenerative fusion network for pest recognition, refAbstract=null), Reference(id=1251505554443681970, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=127, issue=null, pageStart=107228, pageEnd=null, url=null, language=null, rfNumber=[10], rfOrder=12, authorNames=HECHEN Z, HUANG W, YIN L, journalName=Engineering Applications of Artificial Intelligence, refType=null, unstructuredReference=HECHEN Z,HUANG W,YIN L,et al.Dilated-windows-based vision transformer with efficient-suppressive-self-attention for insect pests classification[J]. Engineering Applications of Artificial Intelligence,2024,127:107228., articleTitle=Dilated-windows-based vision transformer with efficient-suppressive-self-attention for insect pests classification, refAbstract=null), Reference(id=1251505554514985140, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=255, issue=null, pageStart=124833, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=13, authorNames=FANG M W, TAN Z P, TANG Y, journalName=Expert Systems with Applications, refType=null, unstructuredReference=FANG M W,TAN Z P,TANG Y,et al.Pest-conformer:A hybrid CNN-transformer architecture for large-scale multi-class crop pest recognition[J].Expert Systems with Applications,2024,255:124833., articleTitle=Pest-conformer:A hybrid CNN-transformer architecture for large-scale multi-class crop pest recognition, refAbstract=null), Reference(id=1251505554611454135, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=15, pageEnd=35, url=null, language=null, rfNumber=[12], rfOrder=14, authorNames=DOSOVITSKIY A, BEYER L, KOLESNIKOV A, journalName=null, refType=null, unstructuredReference=DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.An image is worth 16x16 words:Transformers for image recognition at scale[C]//9th International Conference on Learning Representations(ICLR).Austria:ICLR,2021:15-35., articleTitle=An image is worth 16x16 words:Transformers for image recognition at scale, refAbstract=null), Reference(id=1251505554703728823, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=238, issue=null, pageStart=122099, pageEnd=null, url=null, language=null, rfNumber=[13], rfOrder=15, authorNames=PACAL I, journalName=Expert Systems with Applications, refType=null, unstructuredReference=PACAL I.Enhancing crop productivity and sustainability through disease identification in maize leaves:Exploiting a large dataset with an advanced vision transformer model[J].Expert Systems with Applications,2024,238:122099., articleTitle=Enhancing crop productivity and sustainability through disease identification in maize leaves:Exploiting a large dataset with an advanced vision transformer model, refAbstract=null), Reference(id=1251505554812780728, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=60, issue=12, pageStart=2973, pageEnd=2981, url=null, language=null, rfNumber=[14], rfOrder=16, authorNames=李雨晴, 陈燕红, 李永可, journalName=新疆农业科学, refType=null, unstructuredReference=李雨晴,陈燕红,李永可,.作物害虫图像智能识别方法[J].新疆农业科学, 2023, 60 (12):2973-2981., articleTitle=作物害虫图像智能识别方法, refAbstract=null), Reference(id=1251505554896666811, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=60, issue=12, pageStart=2973, pageEnd=2981, url=null, language=null, rfNumber=[14], rfOrder=17, authorNames=LI Y Q, CHEN Y H, LI Y K, journalName=Xinjiang Agricultural Sciences, refType=null, unstructuredReference=LI Y Q,CHEN Y H,LI Y K,et al.Image intelligent recognition method of crop pests[J].Xinjiang Agricultural Sciences,2023,60(12):2973-2981.(in Chinese), articleTitle=Image intelligent recognition method of crop pests, refAbstract=null), Reference(id=1251505554980552893, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=182, issue=2, pageStart=144, pageEnd=158, url=null, language=null, rfNumber=[15], rfOrder=18, authorNames=XIA W S, HAN D Z, LI D, journalName=Annals of Applied Biology, refType=null, unstructuredReference=XIA W S,HAN D Z,LI D,et al.An ensemble learning integration of multiple CNN with improved vision transformer models for pest classification[J].Annals of Applied Biology,2023,182(2):144-158., articleTitle=An ensemble learning integration of multiple CNN with improved vision transformer models for pest classification, refAbstract=null), Reference(id=1251505555077021887, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2020, volume=179, issue=null, pageStart=105809, pageEnd=null, url=null, language=null, rfNumber=[16], rfOrder=19, authorNames=AYAN E, ERBAY H, VARC1N F, journalName=Computers and Electronics in Agriculture, refType=null, unstructuredReference=AYAN E,ERBAY H,VARC1N F.Crop pest classification with a genetic algorithm-based weighted ensemble of deep convolutional neural networks[J]. Computers and Electronics in Agriculture, 2020, 179:105809., articleTitle=Crop pest classification with a genetic algorithm-based weighted ensemble of deep convolutional neural networks, refAbstract=null), Reference(id=1251505555181879489, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2018, volume=77, issue=17, pageStart=22159, pageEnd=22171, url=null, language=null, rfNumber=[17], rfOrder=20, authorNames=LIU Y G, YU J Z, HAN Y H, journalName=Multimedia Tools and Applications, refType=null, unstructuredReference=LIU Y G,YU J Z,HAN Y H.Understanding the effective receptive field in semantic image segmentation[J].Multimedia Tools and Applications,2018,77 (17):22159-22171., articleTitle=Understanding the effective receptive field in semantic image segmentation, refAbstract=null), Reference(id=1251505555274154179, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024-12-08, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[18], rfOrder=21, authorNames=GU A, GOEL K, RÉC , journalName=null, refType=null, unstructuredReference=GU A,GOEL K,RÉC .Efficiently modeling long sequences with structured state spaces[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2111.00396., articleTitle=Efficiently modeling long sequences with structured state spaces, refAbstract=null), Reference(id=1251505555362234566, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024-12-08, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[19], rfOrder=22, authorNames=GU A, DAO T, journalName=null, refType=null, unstructuredReference=GU A,DAO T.Mamba:Linear-time sequence modeling with selective state spaces[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2312.00752., articleTitle=Mamba:Linear-time sequence modeling with selective state spaces, refAbstract=null), Reference(id=1251505555450314953, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024-12-08, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[20], rfOrder=23, authorNames=LIU Y, TIAN Y J, ZHAO Y Z, journalName=null, refType=null, unstructuredReference=LIU Y,TIAN Y J,ZHAO Y Z,et al.VMamba:Visual state space model[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2401.10166., articleTitle=VMamba:Visual state space model, refAbstract=null), Reference(id=1251505555521618124, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=6000, pageEnd=6010, url=null, language=null, rfNumber=[21], rfOrder=24, authorNames=VASWANI A, SHAZEER N, PARMAR N, journalName=null, refType=null, unstructuredReference=VASWANI A,SHAZEER N,PARMAR N,et al. Attention is all you need[C]//31st International Conference on Neural Information Processing Systems (NIPS'17).Long Beach:Curran Associates,2017:6000-6010., articleTitle=Attention is all you need, refAbstract=null), Reference(id=1251505555613892815, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=5652, pageEnd=5661, url=null, language=null, rfNumber=[22], rfOrder=25, authorNames=XIONG Y W, LI Z Q, CHEN Y T, journalName=null, refType=null, unstructuredReference=XIONG Y W,LI Z Q,CHEN Y T,et al.Efficient deformable ConvNets:Rethinking dynamic and sparse operator for vision applications[C]//2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2024:5652-5661., articleTitle=Efficient deformable ConvNets:Rethinking dynamic and sparse operator for vision applications, refAbstract=null), Reference(id=1251505555693584594, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=5710, pageEnd=5714, url=null, language=null, rfNumber=[23], rfOrder=26, authorNames=CHEN B, ZOU X C, ZHANG Y, journalName=null, refType=null, unstructuredReference=CHEN B,ZOU X C,ZHANG Y,et al.LEFormer:A hybrid CNN-transformer architecture for accurate lake extraction from remote sensing imagery[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics,Speech and Signal Processing.Seoul:IEEE, 2024:5710-5714., articleTitle=LEFormer:A hybrid CNN-transformer architecture for accurate lake extraction from remote sensing imagery, refAbstract=null), Reference(id=1251505555781664981, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=8779, pageEnd=8788, url=null, language=null, rfNumber=[24], rfOrder=27, authorNames=WU X P, ZHAN C, LAI Y K, journalName=null, refType=null, unstructuredReference=WU X P,ZHAN C,LAI Y K,et al.IP102:A largescale benchmark dataset for insect pest recognition[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2020:8779-8788., articleTitle=IP102:A largescale benchmark dataset for insect pest recognition, refAbstract=null), Reference(id=1251505555857162455, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2018, volume=152, issue=null, pageStart=233, pageEnd=241, url=null, language=null, rfNumber=[25], rfOrder=28, authorNames=XIE C J, WANG R J, ZHANG J, journalName=Computers and Electronics in Agriculture, refType=null, unstructuredReference=XIE C J,WANG R J,ZHANG J,et al.Multi-level learning features for automatic classification of field crop pests[J].Computers and Electronics in Agriculture,2018,152:233-241., articleTitle=Multi-level learning features for automatic classification of field crop pests, refAbstract=null), Reference(id=1251505555924271322, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=1314, pageEnd=1324, url=null, language=null, rfNumber=[26], rfOrder=29, authorNames=HOWARD A, SANDLER M, CHEN B, journalName=null, refType=null, unstructuredReference=HOWARD A, SANDLER M, CHEN B, et al. Searching for MobileNetV3[C]//2019 IEEE/CVF International Conference on Computer Vision. Seoul:IEEE,2019:1314-1324., articleTitle=Searching for MobileNetV3, refAbstract=null), Reference(id=1251505556008157405, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=124, issue=null, pageStart=106563, pageEnd=null, url=null, language=null, rfNumber=[27], rfOrder=30, authorNames=WANG C, ZHANG J R, HE J, journalName=Engineering Applications of Artificial Intelligence, refType=null, unstructuredReference=WANG C,ZHANG J R,HE J,et al.A two-stream network with complementary feature fusion for pest image classification[J].Engineering Applications of Artificial Intelligence,2023,124:106563., articleTitle=A two-stream network with complementary feature fusion for pest image classification, refAbstract=null), Reference(id=1251505556071071968, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=11966, pageEnd=11976, url=null, language=null, rfNumber=[28], rfOrder=31, authorNames=LIU Z, MAO H Z, WU C Y, journalName=null, refType=null, unstructuredReference=LIU Z,MAO H Z,WU C Y,et al.A ConvNet for the 2020s[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans:IEEE,2022:11966-11976., articleTitle=A ConvNet for the 2020s, refAbstract=null), Reference(id=1251505556138180834, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=12021, pageEnd=12031, url=null, language=null, rfNumber=[29], rfOrder=32, authorNames=CHEN J R, KAO S H, HE H, journalName=null, refType=null, unstructuredReference=CHEN J R,KAO S H,HE H,et al.Run,don't walk:Chasing higher FLOPS for faster neural networks[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Vancouver:IEEE, 2023:12021-12031., articleTitle=Run,don't walk:Chasing higher FLOPS for faster neural networks, refAbstract=null), Reference(id=1251505556213678308, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=9992, pageEnd=10002, url=null, language=null, rfNumber=[30], rfOrder=33, authorNames=LIU Z, LIN Y T, CAO Y, journalName=null, refType=null, unstructuredReference=LIU Z,LIN Y T,CAO Y,et al.Swin transformer:Hierarchical vision transformer using shifted windows[C]//2021 IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2022:9992-10002., articleTitle=Swin transformer:Hierarchical vision transformer using shifted windows, refAbstract=null), Reference(id=1251505556305952997, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2023, volume=82, issue=19, pageStart=29567, pageEnd=29586, url=null, language=null, rfNumber=[31], rfOrder=34, authorNames=SU Z B, LUO J Q, WANG Y, journalName=Multimedia Tools and Applications, refType=null, unstructuredReference=SU Z B,LUO J Q,WANG Y,et al.Comparative study of ensemble models of deep convolutional neural networks for crop pests classification[J].Multimedia Tools and Applications,2023,82(19):29567-29586., articleTitle=Comparative study of ensemble models of deep convolutional neural networks for crop pests classification, refAbstract=null), Reference(id=1251505556398227686, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2024-12-08, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[32], rfOrder=35, authorNames=ZHU L H, LIAO B C, ZHANG Q, journalName=null, refType=null, unstructuredReference=ZHU L H,LIAO B C,ZHANG Q,et al.Vision mamba:Efficient visual representation learning with bidirectional state space model[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2401.09417., articleTitle=Vision mamba:Efficient visual representation learning with bidirectional state space model, refAbstract=null), Reference(id=1251505556494696680, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, doi=null, pmid=null, pmcid=null, year=2020, volume=128, issue=2, pageStart=336, pageEnd=359, url=null, language=null, rfNumber=[33], rfOrder=36, authorNames=SELVARAJU R R, COGSWELL M, DAS A, journalName=International Journal of Computer Vision, refType=null, unstructuredReference=SELVARAJU R R,COGSWELL M,DAS A,et al. Grad-CAM:Visual explanations from deep networks via gradient-based localization[J].International Journal of Computer Vision,2020,128(2):336-359., articleTitle=Grad-CAM:Visual explanations from deep networks via gradient-based localization, refAbstract=null)], funds=null, companyList=[AuthorCompany(id=1251505549372768260, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=1., ext=[AuthorCompanyExt(id=1251505549389545477, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China), AuthorCompanyExt(id=1251505549397934086, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549372768260, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.西安邮电大学通信与信息工程学院,陕西西安 710121)]), AuthorCompany(id=1251505549473431562, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, xref=2., ext=[AuthorCompanyExt(id=1251505549481820170, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549473431562, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.Xi'an Key Laboratory of Image Processing Technology and Applications for Public Security,Xi'an 710121,China), AuthorCompanyExt(id=1251505549490208779, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, companyId=1251505549473431562, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121)])], figs=[ArticleFig(id=1251505551562195039, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=iCMh5N3iCSYWqW7Rd+A7mw==, figureFileBig=N4MeBUqPit0pfkGa92AxzQ==, tableContent=null), ArticleFig(id=1251505551658664035, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=图1, caption=CFFDS-Net整体架构示意图, figureFileSmall=iCMh5N3iCSYWqW7Rd+A7mw==, figureFileBig=N4MeBUqPit0pfkGa92AxzQ==, tableContent=null), ArticleFig(id=1251505551859990636, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=zfUkj7Qvo84WllTfv1iqmA==, figureFileBig=CUcoHwpuQFPMdwsWlYNM7A==, tableContent=null), ArticleFig(id=1251505551960653935, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=图2, caption=HMSP模块结构示意图, figureFileSmall=zfUkj7Qvo84WllTfv1iqmA==, figureFileBig=CUcoHwpuQFPMdwsWlYNM7A==, tableContent=null), ArticleFig(id=1251505552086483059, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=hgNU55ljTZJq57jUu1/vKw==, figureFileBig=Wyar9G9XS6syeuJ46NYeng==, tableContent=null), ArticleFig(id=1251505552170369143, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=图3, caption=AF-Mamba模块结构示意图, figureFileSmall=hgNU55ljTZJq57jUu1/vKw==, figureFileBig=Wyar9G9XS6syeuJ46NYeng==, tableContent=null), ArticleFig(id=1251505552245866617, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=kXxx/RgBLLFNYx8INhOOXw==, figureFileBig=38gRFeGHgTE/j9WtLJ7eDQ==, tableContent=null), ArticleFig(id=1251505552317169788, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=图4, caption=AWF模块结构示意图, figureFileSmall=kXxx/RgBLLFNYx8INhOOXw==, figureFileBig=38gRFeGHgTE/j9WtLJ7eDQ==, tableContent=null), ArticleFig(id=1251505552413638782, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=lh7RqhIy06Qx8BrLwT6X8w==, figureFileBig=Ce9z9qMT+or/826RzrIhVQ==, tableContent=null), ArticleFig(id=1251505552493330559, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=图5, caption=IP102数据集中的害虫样本示例, figureFileSmall=lh7RqhIy06Qx8BrLwT6X8w==, figureFileBig=Ce9z9qMT+or/826RzrIhVQ==, tableContent=null), ArticleFig(id=1251505552556245122, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=AZY03TvQWkb9tcUWD/0cHA==, figureFileBig=gaiZqoQ829phQkBVus1dwg==, tableContent=null), ArticleFig(id=1251505552631742597, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=图6, caption=部分害虫在不同网络上的可视化结果, figureFileSmall=AZY03TvQWkb9tcUWD/0cHA==, figureFileBig=gaiZqoQ829phQkBVus1dwg==, tableContent=null), ArticleFig(id=1251505552707240069, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
类别名称型号和参数
硬件中央处理器AMD EPYCTM 9754
图像处理器NVIDIARTX 4090D(24G)
操作系统Windows10
软件框架Pytorch 2.0-cuda 11.8
编程语言Python 3.8
环境Anaconda 4.12
), ArticleFig(id=1251505552778543240, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=表1, caption=

实验环境

, figureFileSmall=null, figureFileBig=null, tableContent=
类别名称型号和参数
硬件中央处理器AMD EPYCTM 9754
图像处理器NVIDIARTX 4090D(24G)
操作系统Windows10
软件框架Pytorch 2.0-cuda 11.8
编程语言Python 3.8
环境Anaconda 4.12
), ArticleFig(id=1251505552875012234, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
网络准确率/%召回率/%F1分数/%模型参数量/M
DMF-ResNet59.2258.3729.70
ResNet-5049.5040.1025.64
Mobilent-V363.2058.7060.904.30
EfficientNet-B765.7060.9065.2064.70
ConvNeXt68.4067.2067.8027.90
FasterNet66.1030.00
ViT65.5057.7061.4049.30
SwinViT70.2069.7069.9087.70
GAEnsemble67.1367.1365.76
SAEnsemble66.30
Vmamba66.4165.3065.1130.00
ViM62.4256.6858.6226.00
VRFNet68.3468.3368.34
CFFDS-Net71.1971.0171.0829.22
), ArticleFig(id=1251505552971481230, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=表2, caption=

不同网络在IP102数据集上的性能对比

, figureFileSmall=null, figureFileBig=null, tableContent=
网络准确率/%召回率/%F1分数/%模型参数量/M
DMF-ResNet59.2258.3729.70
ResNet-5049.5040.1025.64
Mobilent-V363.2058.7060.904.30
EfficientNet-B765.7060.9065.2064.70
ConvNeXt68.4067.2067.8027.90
FasterNet66.1030.00
ViT65.5057.7061.4049.30
SwinViT70.2069.7069.9087.70
GAEnsemble67.1367.1365.76
SAEnsemble66.30
Vmamba66.4165.3065.1130.00
ViM62.4256.6858.6226.00
VRFNet68.3468.3368.34
CFFDS-Net71.1971.0171.0829.22
), ArticleFig(id=1251505553042784396, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
网络准确率/%召回率/%F1分数/%模型参数量/M
VRFNet99.1299.1299.13
ResNet-5097.2094.8096.0023.70
MobilentV397.7797.3497.304.30
EfficientNet-B796.2095.2095.7064.70
ConvNeXt97.1094.0095.5027.90
ViT95.2093.6094.4049.30
SwinViT97.6096.8097.3087.70
GAEnsemble98.8198.8198.81
SAEnsemble98.50
Vmamba98.0997.5197.8730.00
ViM97.0796.7196.9526.00
VRFNet99.1299.1299.13
CFFDS-Net99.3699.2899.2729.22
), ArticleFig(id=1251505553126670478, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=表3, caption=

不同网络在D0数据集上的性能对比

, figureFileSmall=null, figureFileBig=null, tableContent=
网络准确率/%召回率/%F1分数/%模型参数量/M
VRFNet99.1299.1299.13
ResNet-5097.2094.8096.0023.70
MobilentV397.7797.3497.304.30
EfficientNet-B796.2095.2095.7064.70
ConvNeXt97.1094.0095.5027.90
ViT95.2093.6094.4049.30
SwinViT97.6096.8097.3087.70
GAEnsemble98.8198.8198.81
SAEnsemble98.50
Vmamba98.0997.5197.8730.00
ViM97.0796.7196.9526.00
VRFNet99.1299.1299.13
CFFDS-Net99.3699.2899.2729.22
), ArticleFig(id=1251505553227333776, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=EN, label=null, caption=null, figureFileSmall=null, figureFileBig=null, tableContent=
网络HMSPAF-MambaAWFLEQ准确率/%F1分数/%
单分支×××64.5159.53
×××66.9465.13
××65.7361.17
××67.4565.76
双分支××68.5767.28
×70.3269.56
×69.4369.17
CFFDS-Net71.1971.08
), ArticleFig(id=1251505553319608467, tenantId=1146029695717560320, journalId=1251233954884272221, articleId=1251505540774445632, language=CN, label=表4, caption=

IP102数据集上的消融实验结果

, figureFileSmall=null, figureFileBig=null, tableContent=
网络HMSPAF-MambaAWFLEQ准确率/%F1分数/%
单分支×××64.5159.53
×××66.9465.13
××65.7361.17
××67.4565.76
双分支××68.5767.28
×70.3269.56
×69.4369.17
CFFDS-Net71.1971.08
)], attaches=null, journal=Journal(id=1251231493423411294, delFlag=0, nameCn=西安邮电大学学报, nameEn=Journal of Xi'an University of Posts and Telecommunications, nameHistory1=null, nameHistory2=null, issn=2095-6533, eissn=, cn=61-1493/TN, coden=null, periodic=双月刊, language=CN, oaType=1, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=, officePhone=, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=tH6CrDBQFqwqmwUccjpjow==, journalPrice=null, startedYear=null, abbrevIsoEn=Journal of Xi'an University of Posts and Telecommunications, journalRemark=null, publicationField=null, createdTime=1776246434792, updatedTime=1776251858439, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=J, firstLetterEn=J, subjectCode=Natural Sciences, subjectName=自然科学, subjectCodeEn=Natural Sciences, subjectNameEn=null, picCn=tH6CrDBQFqwqmwUccjpjow==, picEn=TGVaHCgPGQlZdesVp9aDew==, jcr=null, cjcr=null, exts=[JournalExt(id=1251254241990493156, language=CN, name=西安邮电大学学报, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1776251858470, updatedTime=1776251858470, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://xayd.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://xayd.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://xayd.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1251254242036630501, language=EN, name=Journal of Xi'an University of Posts and Telecommunications, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1776251858481, updatedTime=1776251858481, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://xayd.cbpt.cnki.net/index.aspx?t=1, submissionEditorUrl=https://xayd.cbpt.cnki.net/index.aspx?t=3, submissionReviewUrl=https://xayd.cbpt.cnki.net/index.aspx?t=2, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1251233954884272221, websiteList=[Website(id=1251257283410346025, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1251233954884272221, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/xayddxxb/CN, language=CN, createTime=1776252583601, createBy=18614031015, updateTime=1776252942185, updateBy=18614031015, name=西安邮电大学学报-中文, tplId=1146099689490845704, title=西安邮电大学学报, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1251258885185683591, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=articleTextType, value=kx, createTime=1776252965494, updateTime=1776252965494, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885160517764, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=banner, value=null, createTime=1776252965488, updateTime=1776252965488, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885202460810, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=grayFlag, value=0, createTime=1776252965498, updateTime=1776252965498, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885152129155, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=logo, value=https://castjournals.cast.org.cn/joweb/xayddxxb/CN/file/pic?fileId=oIRdMBF7r3ynDYM5hP49NA==, createTime=1776252965486, updateTime=1776252965486, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885219238028, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=minRunFlag, value=0, createTime=1776252965502, updateTime=1776252965502, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885177294982, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/xayddxxb/CN/file/pic, createTime=1776252965492, updateTime=1776252965492, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885210849419, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=silenceFlag, value=0, createTime=1776252965500, updateTime=1776252965500, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885168906373, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1776252965490, updateTime=1776252965490, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885194072200, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=themeColor, value=null, createTime=1776252965496, updateTime=1776252965496, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258885198266505, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283410346025, code=themeStyle, value=null, createTime=1776252965497, updateTime=1776252965497, creator=18614031015, updator=18614031015)]), Website(id=1251257283594895425, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1251233954884272221, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/xayddxxb/EN, language=EN, createTime=1776252583645, createBy=18614031015, updateTime=1776252932924, updateBy=18614031015, name=西安邮电大学学报-英文, tplId=1146101810881728533, title=Journal of Xi'an University of Posts and Telecommunications, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1251258912788398232, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=articleTextType, value=kx, createTime=1776252972075, updateTime=1776252972075, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912767426709, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=banner, value=null, createTime=1776252972070, updateTime=1776252972070, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912805175451, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=grayFlag, value=0, createTime=1776252972079, updateTime=1776252972079, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912759038100, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=logo, value=https://castjournals.cast.org.cn/joweb/xayddxxb/EN/file/pic?fileId=oIRdMBF7r3ynDYM5hP49NA==, createTime=1776252972068, updateTime=1776252972068, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912813564061, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=minRunFlag, value=0, createTime=1776252972081, updateTime=1776252972081, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912780009623, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/xayddxxb/EN/file/pic, createTime=1776252972073, updateTime=1776252972073, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912809369756, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=silenceFlag, value=0, createTime=1776252972080, updateTime=1776252972080, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912771621014, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1776252972071, updateTime=1776252972071, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912792592537, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=themeColor, value=null, createTime=1776252972076, updateTime=1776252972076, creator=18614031015, updator=18614031015), WebsiteProps(id=1251258912796786842, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283594895425, code=themeStyle, value=null, createTime=1776252972077, updateTime=1776252972077, creator=18614031015, updator=18614031015)])], journalTitle=西安邮电大学学报, weixinUrl=null, journalUrl=https://xayd.cbpt.cnki.net/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Journal of Xi'an University of Posts and Telecommunications, journalPhotoCn=tH6CrDBQFqwqmwUccjpjow==, journalPhotoEn=TGVaHCgPGQlZdesVp9aDew==, journalFirstLetter=J, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/xayddxxb/CN/10.13682/j.issn.2095-6533.2025.06.013, detailUrlEn=https://castjournals.cast.org.cn/joweb/xayddxxb/EN/10.13682/j.issn.2095-6533.2025.06.013, pdfUrlCn=https://castjournals.cast.org.cn/joweb/xayddxxb/CN/PDF/10.13682/j.issn.2095-6533.2025.06.013, pdfUrlEn=https://castjournals.cast.org.cn/joweb/xayddxxb/EN/PDF/10.13682/j.issn.2095-6533.2025.06.013, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
面向害虫识别的互补特征融合双流网络
收藏切换
PDF下载
李大湘 1, 2 , 孙家宁 1 , 刘颖 1, 2
西安邮电大学学报 | 人工智能目标检测 2025,30(6): 113-122
收起
收藏切换
西安邮电大学学报 | 人工智能目标检测 2025, 30(6): 113-122
面向害虫识别的互补特征融合双流网络
全屏
李大湘1, 2 , 孙家宁1 , 刘颖1, 2
作者信息
  • 1.西安邮电大学通信与信息工程学院,陕西西安 710121
  • 2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121
  • 李大湘(1974-),男,湖南麻阳人,博士,西安邮电大学副教授,主要研究方向为遥感图像分类、作物图像病虫害识别与机器学习。E-mail:

    孙家宁(2000-),女,陕西大荔人,西安邮电大学硕士研究生,主要研究方向为植物病虫害识别。E-mail:

    刘颖(1972-),女,陕西户县人,博士,西安邮电大学教授,主要研究方向为图像处理与模式识别。E-mail:

A dual-stream network with complementary feature fusion for pest identification
Daxiang LI1, 2 , Jianing SUN1 , Ying LIU1, 2
Affiliations
  • 1.School of Communications and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China
  • 2.Xi'an Key Laboratory of Image Processing Technology and Applications for Public Security,Xi'an 710121,China
出版时间: 2025-11-10 doi: 10.13682/j.issn.2095-6533.2025.06.013
文章导航
收藏切换

针对害虫类别间细节差异小、田间背景干扰严重及样本分布不均衡等问题,提出一种面向害虫识别的互补特征融合双流网络。该网络结合卷积神经网络的局部感知能力与Mamba模型的全局建模能力,捕获并融合害虫图像的全局与局部信息。设计层次化多尺度感知模块,通过分组层次化卷积提取多尺度图像特征,并采用细节强化感知策略增强害虫细节信息;设计自适应聚焦Mamba模块,利用动态卷积算子定位害虫关键区域,减少复杂背景干扰;设计注意力加权融合模块,通过交叉注意力机制实现全局和局部特征的自适应交互优化,进一步提升语义表达的准确性。最后,构建均衡损失函数,缓解数据集类别不平衡的影响。实验结果表明,该网络在大规模害虫数据集IP102上的准确率达到71.19%,在D0数据集上的准确率为99.36%,能够有效识别害虫种类。

害虫识别  /  Mamba  /  多尺度特征提取  /  特征融合  /  注意力机制

To address the challenges of small inter-class differences in pest details,severe field background interference,and imbalanced sample distribution,a complementary feature fusion dual-stream network for pest recognition is proposed.This network combines the local perception capability of convolutional neural networks with the global modeling ability of the Mamba model,capturing and integrating the global and local information of pest images.A hierarchical multiscale perception module is designed to extract multi-scale image features through grouped hierarchical convolution and enhance pest detail information with a detail enhancement perception strategy.An adaptive focusing Mamba module is designed to locate key pest regions using dynamic convolution operators and reduce complex background interference.Additionally,an attention-weighted fusion module is designed to achieve adaptive interaction and optimization of global and local features through a cross-attention mechanism,further improving the accuracy of semantic expression.A balanced loss function is constructed to mitigate the effects of class imbalance in the dataset.The experimental results show that the network achieves an accuracy of 71.19%on the large-scale pest dataset IP102,and an accuracy of 99.36%on the D0 dataset,demonstrating its ability to effectively identify pest species.

pest recognition  /  Mamba  /  multi-scale feature extraction  /  feature fusion  /  attention mechanism
李大湘, 孙家宁, 刘颖. 面向害虫识别的互补特征融合双流网络. 西安邮电大学学报, 2025 , 30 (6) : 113 -122 . DOI: 10.13682/j.issn.2095-6533.2025.06.013
Daxiang LI, Jianing SUN, Ying LIU. A dual-stream network with complementary feature fusion for pest identification[J]. Journal of Xi'an University of Posts and Telecommunications, 2025 , 30 (6) : 113 -122 . DOI: 10.13682/j.issn.2095-6533.2025.06.013
害虫的广泛传播和快速演变已对全球农业构成了严重威胁[1]。由于害虫种类繁多且具有一定的相似性,传统人工识别方法不仅耗时费力,且需要高度的专业知识与实践经验,在大规模农业生产中难以高效实施[2]。因此,利用图像处理技术研究害虫识别算法[3]快速且准确地识别害虫种类,并及时采取精细化防治措施,已成为保障农业可持续发展的关键[4]
近年来,深度学习技术凭借其自动提取深层特征的能力,在复杂图像识别任务中展现了优越性能[5],逐渐成为害虫识别领域的研究热点[6]。现有的害虫识别网络主要可分为以下3类。第一类,卷积神经网络[7](Convolutional Neural Network,CNN):Liu等[8]基于Resnet在每个残差块中使用三分支提取图像的多尺度特征,设计了改进的深度多分支融合残差(Deep Multibranch Fusion Residual Network,DMF-ResNet)网络,提高了害虫识别的准确率;Nandhini等[9]设计视觉再生融合(Visual Regenerative Fusion Network,VRFNet)网络,利用数据增强技术和特征融合技术有效缓解了害虫识别任务中扭曲图像和遮挡图像的影响。然而,尽管CNN在提取局部特征方面表现优异,但受限于局部感受野,难以捕获复杂的全局特征。第二类,视觉变换器[10-11](Vision Transformer,ViT):Dosovitskiy等[12]通过ViT建模图像中的远程依赖关系,捕捉更为全面的上下文信息。Ishak[13]将ViT引入植物病虫害识别任务中,通过将主干网络中的传统卷积结构替换为SE-Net(Squeeze-Excitation Networks)提高精度,在4类玉米叶片病害数据集上达到了99.24%的准确率。然而,由于ViT缺乏归纳偏置且具有二次计算复杂度,导致其训练缓慢且难以收敛,需要更大的数据集和更长的训练时间。第三类,集成学习(Ensemble Learning,EL)方法[14-15]:Ayan等[16]对7个不同的预训练CNN模型进行微调和再训练,选择效果最好的3个模型进行集成,并使用遗传算法获得优化权重。EL方法通过训练多个基分类器,选择合适的策略组合多个模型的预测结果,获得比单个模型更好的预测效果。但是,该方法需要训练和维护多个模型,计算和存储成本较高,难以适用于实际的农业场景。
尽管上述网络表现良好,但面对背景复杂、种类繁多且类别不平衡的害虫图像时,现有研究存在分类精度低、泛化能力较弱的问题。因此,为了提高真实农业场景中的大规模害虫图像的识别准确性,提出一种面向害虫识别的互补特征融合双流网络(Complementary Feature Fusion Dual-Stream Network,CFFDS-Net)。针对害虫类别间细节差异较小的问题,CFFDS-Net在局部分支采用分层残差连接和细节强化感知的联合策略,捕捉多尺度的图像特征并增强网络对害虫微小差异的辨别能力。在全局分支通过远距离依赖建模和动态区域定位机制,以线性复杂度捕获全局语义信息,有效定位害虫关键区域,抑制背景干扰,并设计双向交叉注意力融合机制,实现全局与局部特征的自适应优化与协同整合。最后,构建均衡损失函数,缓解类别不平衡的负面影响,提升网络对少数类别害虫的识别精度。
CFFDS-Net采用双流主干结构并行处理输入图像,旨在高效提取与融合害虫图像的局部细粒度特征与全局上下文特征,其主要由4个核心模块组成,即层次化多尺度感知(Hierarchical Multi-Scale Perception,HMSP)模块、自适应聚焦Mamba(Adaptive Focus Mamba,AF-Mamba)模块、注意力加权融合(Attention-Weighted Fusion,AWF)模块和均衡损失函数LEQ,整体架构示意图如图1所示。首先,对输入图像IR224×224×3进行初步特征处理。在HMSP分支中,采用由步长为2的7×7卷积层构成的CNN Stem处理输入图像,提取浅层特征的同时将图像分辨率减半,减少计算开销。AF-Mamba分支则通过Patch Partition操作进行图像分块,生成分辨率为56×56×64初始特征图。在后续的特征提取过程中,网络分别对两条分支进行4个阶段(Stage1至Stage4)的特征下采样与表示学习。其中,HMSP分支在每个阶段堆叠不同数量的HMSP模块,逐步生成56×56×64、28×28×128、14×14×256和7×7×512的多层次特征表示。为了提高网络对害虫局部细节的敏感度,在HMSP模块中设计细节强化感知策略,提取更加细致的局部细节特征Flocal。AF-Mamba分支则在Stage2至Stage4阶段对特征图进行Patch Merging下采样操作,同时利用AF-Mamba模块的线性计算复杂度优势和长距离建模能力学习害虫图像的全局上下文关系,捕获全局聚焦特征Fglobal。然后,利用AWF模块对FlocalFglobal进行动态协同融合,自适应平衡FlocalFglobal的贡献度,最终得到互补特征Z。融合后的特征Z通过分类头与Softmax激活函数完成害虫分类任务,并在均衡损失函数LEQ(Equalization Loss,LEQ)的作用下,改善类别不平衡问题,进一步提升少数类别的识别性能。
基于深度神经网络中感受野随层数增加而扩大的特点[17],设计HMSP模块。与传统方法不同,HMSP模块可以直观地分成4个操作步骤,即划分(Split)、分组层次化卷积、细节强化感知(Detail Enhancement Perception,DEP)与特征拼接(Concat)。总结起来,HMSP模块主要是采用分组层次化卷积与DEP操作,且结合残差连接而不断累积扩展感受野,使模型能够在不同尺度上捕捉更加丰富的害虫图像细节信息,从而显著提升特征表达的全面性和精细度。HMSP模块结构示意图如图2所示。
给定XRH×W×C作为输入HMSP模块的特征图谱,其中WHC分别表示X的宽度、高度与通道数。首先,沿通道维度将X均匀划分为4个特征图子集,其中xiCi满足C=4×Ci。随后,对每个子特征xi依次执行一组卷积操作,表示为Ψi(·),其中包括3×3标准卷积和批量归一化,生成对应的特征图子集。HMSP模块最核心的设计是对每组Ψi(·)采用层次化的残差连接。这种设计使Ψi(·)不仅能处理当前层的特征图子集,还能整合前一层提取的特征信息,提升特征表达的丰富性。具体而言,第一个特征图子集x1直接通过Ψi(·)操作生成,然后第二个特征图子集x2与前一层的输出相加,再经卷积层Ψi(·)操作生成,重复该过程直到处理完所有特征图子集。上述计算过程可表示为
式中:⊕表示逐元素求和。
为了增强网络对害虫细节特征的感知能力,设计DEP策略进一步处理,DEP的具体过程如图2右侧部分所示。首先,对输入特征图进行全局平均池化(Global Average Pooling,GAP),得到f1Rh×w。接着将f1重塑为f2R1×1×(h×w,并将f2输入到Softmax函数中,生成权重矩阵。然后,将权重矩阵ADEP与输入特征相乘,实现内容感知的注意力增强,得到增强后的特征图。这个过程可表示为
式中:Reshape和Reshape-1分别表示重塑操作和其反向操作;G(·)表示全局平均池化操作;δ表示Softmax激活函数;⊙表示元素乘法。
最后,将DEP处理后的增强特征拼接起来,并在HMSP中使用残差结构保留原始特征,最终得到输出特征图RH×W×C,其计算过程可表示为
状态空间模型[18](State Space Model,SSM)源于现代控制理论中的线性时不变系统,具有线性复杂度,其核心是在连续时间t上,通过隐藏状态ht)∈RN将输入信号xt)∈R映射为输出信号yt)∈R,该过程可用线性常微分方程表示为
式中:ARN×NBRN×1CRN分别为状态矩阵、输入矩阵和输出矩阵。
在深度学习中,为了更好地处理文本和图像等离散输入,使用零阶保持技术将连续SSM转换为离散SSM,利用可学习的时间尺度参数Δ对AB进行离散化,具体过程为
式中:分别为AB的离散化版本;I为单位矩阵。
经式(5)离散化处理后,式(4)可转换为
式中:hn-1hn分别代表n-1时刻与n时刻的状态信息。
在选择性状态空间模型S6,也称为Mamba[19]中,矩阵BC和时间尺度参数Δ均由输入数据产生,使模型能够根据输入数据自适应调整参数,增强对输入上下文的感知能力。
针对复杂背景下害虫识别任务中的全局依赖关系建模与背景噪声抑制的协同需求,设计AF-Mamba模块,该模块包括关键区域自适应聚焦器(Adaptive Key Region Focuser,AKRF)和2D选择扫描(2D-Selective-Scan,SS2D)机制[20],结构示意图如图3所示。AKRF采用与Transformer[21]相似的结构,结合可变形卷积网络[22](Deformable Convolutional Networks v4,DCNv4)和混合前馈网络[23](Mix-Feed Forward Network,Mix-FFN),通过残差连接将特征进行逐步聚合和传递,增强网络对复杂图像信息的捕捉能力。
给定输入特征图MRH×W×C,其中HWC分别表示特征的高度、宽度与通道数。DCNv4通过动态学习的偏移量Δpk和调制权重Δmk,从输入特征图M中采样,生成具有更强空间感知能力的输出特征图M1,其计算过程可表示为
式中:M1p)表示在每个位置p上聚合的动态特征;K为卷积核大小,例如3×3时K=9;ΔmkΔpk分别为网络动态学习到的调制权重和偏移量,且Δmk∈[0,1];Mp+Δpk)表示输入特征图M在动态采样点p+Δpk处的值。
DCNv4的输出特征M1经过层归一化与残差连接,得到的特征表示为
式中:LN(·)表示层归一化操作。
M2输入到Mix-FFN中并使用残差连接,进行特征聚合与非线性变换,旨在更好地捕捉上下文特征之间的复杂关系,得到增强特征M3RH×W×C,其计算过程为
式中:W1W2为Mix-FFN中全连接层的权重矩阵;SiLU(·)为激活函数。
M3进行线性投影和深度卷积处理,将其输入到SS2D中。SS2D将形状为RH×W×C的特征图沿4个方向(左上到右下、右下到左上、右上到左下、左下到右上)展开,生成4个大小为RH×W×C的独立序列。这些序列通过SSM提取多方向的长距离依赖关系,并将4个方向的特征合并,从而生成完整的2D特征图RH×W×C
有效融合不同特征对提升网络的表征能力至关重要,传统的静态融合策略,如特征相加或特征拼接操作,因缺乏两种特征之间的交互感知能力,导致无法充分利用局部特征与全局特征的互补优势,因此设计AWF模块。该模块通过动态平衡HMSP分支和AF-Mamba分支的贡献,进行交互互补融合,使全局特征提供整体语义信息,局部特征强化细节表达,从而滤除背景干扰,获得更加全面且精准的特征表示。AWF模块结构示意图如图4所示。
FlocalFglocalRH×W×C分别表示来自HMSP分支的局部细节特征和来自AF-Mamba分支的全局聚焦特征。首先,对FlocalFglocal分别进行展平,得到相同维度的特征图谱S1S2S1S2RH×W×C。然后对S1S2进行线性投影,生成查询矩阵Q、键矩阵K和值V矩阵,具体表达式为
式中:WQWKWV1分别表示与QKV1V2相对应的可学习的线性变换矩阵。
不同于自注意力机制,AWF模块中的QK分别来自FglocalFlocal,这使得网络能够在特征空间中引入显示的全局和局部信息的交互关系。对QK进行缩放点积运算,并使用SoftMax函数进行归一化,从而得到注意力权重矩阵为
式中:d为缩放因子,用于平衡点积的数值范围,确保梯度稳定性。
使用注意力权重AAWFV1V2进行加权融合,实现FlocalFglocal的动态交互与整合,得到融合特征为
在自然环境中,难以获取害虫图像并进行标注,导致害虫数据集的类别分布极不均衡,传统交叉熵损失因梯度更新方向被多数类主导,导致分类决策边界严重偏向高频类别。因此,设计均衡损失函数LEQLEQ根据不同类别的样本数量差异,动态调整损失计算,从而对每个类别给予更加公平的关注,避免多数类主导训练过程。
具体来说,设D=为由N张图像组成的任意一个批次的训练样本集合,Il表示第l幅图像,yl表示该图像的类别标签。LEQ引入一个权重系数调整每个类别的损失,具体计算方式为
其中,
αl=1-βTλpl)(1-yl)。
式中:yl分别表示第l幅图像的真实标签与预测标签;αj表示LEQ的权重系数,用来抑制多数类的梯度更新,使网络能够更关注少数类的学习;β为随机变量,用于决定是否保持负样本的梯度,避免在少数类别训练过程中完全忽视负样本;pi表示类别i在数据集中的频率,由类别i的图像数量除以整个数据集的图像总数得到;Tλ(·)表示阈值函数,用于判断l是否属于该数据集中的少数类别,当plλ时输出1,即判断l为少数类别,在训练过程中增强该类的梯度更新,否则输出0。
在实验过程中,采用的软硬件平台配置如表1所示。所有输入图片的分辨率均被统一调整为224×224,在训练与测试过程中,选择Adam优化器,初始学习率设置为0.00003,且采用余弦退火衰减策略进行更新,批处理大小设置为16,训练迭代次数Epochs设置为100。
选取IP102[24]和D0[25]两个公开且常用的害虫数据集评估CFFDS-Net的有效性。IP102是一个用于害虫识别和检测任务的大型农作物害虫数据集,涵盖102种常见的农作物害虫,共计75222张图片,分辨率在150~400像素之间,按6∶1∶3的比例将数据集划分为训练集、验证集和测试集,其样本示例如图5所示。
IP102最大程度保留了真实农田场景的复杂性,具体表现为:1)种内差异突出,种间相似性高。图像覆盖了害虫的整个生命周期,包括卵、幼虫、蛹和成虫4个阶段,导致同一种害虫在不同阶段的外观特征差异显著同时,不同种类害虫在相同的生长阶段具有极大的相似性;2)数据集呈现严重的类别不平衡问题,其中最大类别包含3444张图像,最小类别仅包含42张图像;3)图像中包含大量复杂的背景信息,如叶片、杂草等,导致网络难以捕捉关键的害虫特征。
D0数据集涵盖了在野外捕获的40种害虫,总共4500张害虫图像,分辨率均为200×200,按照7∶2∶1的比例将D0数据集划分为训练、测试和验证3个子集,D0数据集同样存在数据不平衡问题,其中最大类包含238张图片,最小类包含50张图片。
选取准确率、精确率、召回率、F1分数及模型参数量等5个指标作为模型的综合评价指标。准确率是指预测正确的样本数占总样本数的比例,反映模型的总体预测能力;精确率是指模型预测为正且实际为正的样本占模型预测为正类样本总数的比例;召回率是指模型预测为正的样本占实际为正的样本的比例;F1分数则为精确率和召回率的加权调和平均值;模型参数量用来评估各模型的复杂度和计算开销,验证其在实际农业生产应用中的可行性。
为全面评估CFFDS-Net的性能,选取经典网络和最新害虫识别网络进行对比分析,分别为基于CNN的网络:DMF-ResNet、Res-Net50[24]、Mobilent-V3[26]、EfficientNet-B7[27]、ConvNeXt[28]和FasterNet[29];基于Transformer的网络:ViT、Sw inViT[30];基于集成学习方法的网络:GAEnsemble和SAEnsemble[31]。此外,Mamba作为图像处理领域内新兴且具有影响力的模型,已在其他下游任务中得到广泛应用。为进一步验证CFFDS-Net网络的性能,将其与Vmamba和ViM[32]进行比较。最后,将CFFDS-Net与当前最新的双流网络VRFNet进行对比,以验证其在害虫识别任务中的先进性。不同网络在IP102数据集上的实验结果如表2所示。
根据表2的数据可知,CFFDS-Net在IP102数据集上展现了良好的识别效果,其准确率、召回率和F1分数分别达到了71.19%、71.01%和71.08%,达到最优值,且CFFDS-Net的参数量为29.22M,在性能与模型复杂度之间实现了较好的平衡,比较适合实际的模型部署。在基于CNN的网络中,与IP102数据集创建者在ResNet-50网络的测试结果相比,CFFDS-Net的准确率提高了21.69个百分点,同时与其他CNN网络相比,CFFDS-Net的准确率均可以提高2.79~11.97个百分点;在基于Transformer和Mamba的网络中,相比SwinViT和Vmamba,CFFDS-Net的准确率分别提高0.99个百分点和4.78个百分点,表明CFFDS-Net在捕捉远距离特征依赖上具有优势;最后,与最新的双流网络VRFNet相比,CFFDS-Net表现出显著优势,表明CFFDS-Net能够更加有效地提取并融合互补特征。综上所述,在大型复杂害虫数据集IP102上的实验结果表明,CFFDS-Net能够在包含多类害虫和干扰背景的场景下有效地识别和区分不同的害虫种类,展现出了较高的鲁棒性和良好的泛化性能。
为了更全面地评估CFFDS-Net的性能,使用D0数据集进行对比实验,结果如表3所示。由表3数据可知,现有模型在D0数据集上的识别精度均较高,但基于Transformer的网络在D0数据集上表现不佳,这是因为Transformer通常需要大量数据来充分捕捉和学习数据中的复杂特征和模式。相比之下,CFFDS-Net在D0数据集上获得了99.36%的准确率,优于所有模型,表明CFFDS-Net不仅在大型数据集上具备出色的识别能力,在较小的数据集上也展现了强大的泛化性能。
为了验证在CFFDS-Net中的关键模块(即HMSP模块、AF-Mamba模块和AWF模块)以及均衡损失函数LEQ在害虫识别网络中的有效性,基于IP102数据集进行消融实验。实验中,使用交叉熵损失函数作为对比基线,之后使用LEQ代替交叉熵损失函数,以验证LEQ的有效性。在进行双分支网络的模块消融实验时,为验证AWF模块的有效性,使用简单的特征相加操作代替AWF模块融合HMSP模块和AF-Mamba模块提取的特征。消融实验结果如表4所示。
表4可知,在单分支网络的消融实验中,与交叉熵损失函数相比,在使用LEQ后,HMSP模块的准确率和F1分数分别提高了1.22个百分点和1.64个百分点,AF-Mamba模块的准确率和F1分数分别提高0.51个百分点和0.63个百分点,说明LEQ能够有效缓解害虫图像的类别不平衡问题,有效地提升了网络对稀有类别的识别精度。在双分支网络的消融实验中,首先,“HMSP+AF-Mamba+特征相加”虽然不如完整的CFFDS-Net有效,但相比于单分支网络仍有一定的性能提升,这说明HMSP和AF-Mamba模块在特征提取和表示学习方面具有互补性。其次,相比于“HMSP+AF-Mamba+特征相加”,“HMSP+AF-Mamba+AWF”网络的准确率和F1分数分别提高了1.75个百分点和2.28个百分点,由此可见AWF模块能够充分利用局部特征和全局特征之间的互补优势,增强了网络对特征的判别能力。最后,“HMSP+AF-Mamba+AWF+LEQ”的准确率达到71.19%,F1分数达到71.08%,完整网络CFFDS-Net取得了最优性能,说明各模块之间具有良好的互补性。
为了直观展现CFFDS-Net的有效性并直观展示经AWF模块融合所得互补特征图谱对害虫的感知能力,采用Grad-CAM[33]技术对IP102数据集中的部分害虫图像进行了可视化分析,如图6所示。由图6可以看出,与其他网络相比,CFFDS-Net在害虫目标区域定位的精度和关注度方面表现出显著优势。具体而言,CFFDS-Net能够精确聚焦目标区域并关注害虫细节,以第二行中的绿芫菁为例,Restnet-50、EfficientNet等网络易陷入局部关注,但CFFDS-Net的类激活图能够完全覆盖害虫的关键部位,并且关注到害虫的触角、前肢等细节信息。其次,CFFDS-Net在能够有效抑制背景干扰。例如在第三行含有复杂背景的玉米害虫图像中,CFFDSNet的激活区域紧密贴合害虫本体,而其他模型的激活区域则较为分散,部分包含了无关背景信息。此外,CFFDS-Net在不同害虫类型上展现出高度一致的目标关注区域,这种一致性反映了模型在不同害虫类别和环境条件下的卓越泛化能力,确保了其在多样化农业场景中的准确及稳定识别。综上所述,CFFDS-Net在定位精度、背景抑制和目标细节关注方面均表现出显著优势,验证了其在害虫识别任务中的优越性能。
针对真实农业场景中害虫识别任务面临的类间细节差异小、背景干扰严重及类别分布不平衡的挑战,设计了互补特征融合双流网络CFFDS-Net实现对害虫图像的准确识别。利用HMSP模块和AF-Mamba模块捕获局部细节特征和全局语义特征,通过AWF模块动态融合高相关性特征,增强网络的语义表征能力并抑制背景噪声干扰。构建均衡损失函数并自适应调整各类别的权重,增强网络对少数类别的关注,有效缓解了多数类主导训练过程的问题,从而提升了网络对类别分布不均衡数据集的分类性能。实验结果表明,CFFDS-Net能够有效识别害虫图像,所设计的模块均对提高网络的识别精度和泛化能力起到了积极作用。未来工作将进一步探索更轻量化的网络设计,以实现在移动设备上的高效部署,为智能化害虫监测与精准农业管理提供强有力的技术支持。
参考文献 引证文献
排序方式:
[1]
FU X Q,MA Q Y,YANG F F,et al.Crop pest image recognition based on the improved ViT method[J]. Information Processing in Agriculture,2024,11(2):249-259.
[2]
ODOUNFA M G F,GBEMAVO C D S J,TAHI S P G,et al.Deep learning methods for enhanced stress and pest management in market garden crops:A comprehensive analysis[J].Smart Agricultural Technology,2024,9:100521.
[3]
VENKATASAICHANDRAKANTH P,IYAPPARAJA M.A survey on pest detection and classification in field crops using artificial intelligence techniques[J]. International Journal of Intelligent Robotics and Applications,2024,8(3):709-734.
[4]
李睿,王莹,王恒.基于树莓派的室内植物病虫害识别系统设计[J].电子设计工程,2022,30(8):114-118.
LI R,WANG Y,WANG H.Design of house plants diseases and insect pests recognition system based on Raspberry Pi[J]. Electronic Design Engineering, 2022,30(8):114-118.(in Chinese)
[5]
胡海峰,倪宗煜,赵海涛,.无人机场景下基于Transformer的轻量化行人重识别[J].南京邮电大学学报(自然科学版),2024,44(3):48-62.
HU H F,NI Z Y,ZHAO H T,et al.Transformer based lightweight person re-identification in unmanned aerial vehicle images[J].Journal of Nanjing University of Posts and Telecommunications(Natural Science Edition),2024,44(3):48-62.(in Chinese)
[6]
GUO B Y,WANG J J,GUO M H,et al.Overview of pest detection and recognition algorithms[J].Electronics,2024,13(15):3008.
[7]
刘志,翟瑞芳,彭万伟,.融合注意力机制的Cascade R-CNN田间害虫检测方法[J].华中农业大学学报,2023,42(3):133-142.
LIU Z,ZHAI R F,PENG W W,et al.Field pest detection method based on improved Cascade R-CNN by incorporating attention mechanism[J]. Journal of Huazhong Agricultural University,2023,42(3):133-142.(in Chinese)
[8]
LIU W J,WU G Q,REN F J.Deep multibranch fusion residual network for insect pest recognition[J]. IEEE Transactions on Cognitive and Developmental Systems,2021,13(3):705-716.
[9]
NANDHINI C,BRINDHA M.Visual regenerative fusion network for pest recognition[J].Neural Computing and Applications,2024,36(6):2867-2882.
[10]
HECHEN Z,HUANG W,YIN L,et al.Dilated-windows-based vision transformer with efficient-suppressive-self-attention for insect pests classification[J]. Engineering Applications of Artificial Intelligence,2024,127:107228.
[11]
FANG M W,TAN Z P,TANG Y,et al.Pest-conformer:A hybrid CNN-transformer architecture for large-scale multi-class crop pest recognition[J].Expert Systems with Applications,2024,255:124833.
[12]
DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.An image is worth 16x16 words:Transformers for image recognition at scale[C]//9th International Conference on Learning Representations(ICLR).Austria:ICLR,2021:15-35.
[13]
PACAL I.Enhancing crop productivity and sustainability through disease identification in maize leaves:Exploiting a large dataset with an advanced vision transformer model[J].Expert Systems with Applications,2024,238:122099.
[14]
李雨晴,陈燕红,李永可,.作物害虫图像智能识别方法[J].新疆农业科学, 2023, 60 (12):2973-2981.
LI Y Q,CHEN Y H,LI Y K,et al.Image intelligent recognition method of crop pests[J].Xinjiang Agricultural Sciences,2023,60(12):2973-2981.(in Chinese)
[15]
XIA W S,HAN D Z,LI D,et al.An ensemble learning integration of multiple CNN with improved vision transformer models for pest classification[J].Annals of Applied Biology,2023,182(2):144-158.
[16]
AYAN E,ERBAY H,VARC1N F.Crop pest classification with a genetic algorithm-based weighted ensemble of deep convolutional neural networks[J]. Computers and Electronics in Agriculture, 2020, 179:105809.
[17]
LIU Y G,YU J Z,HAN Y H.Understanding the effective receptive field in semantic image segmentation[J].Multimedia Tools and Applications,2018,77 (17):22159-22171.
[18]
GU A,GOEL K,RÉC .Efficiently modeling long sequences with structured state spaces[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2111.00396.
[19]
GU A,DAO T.Mamba:Linear-time sequence modeling with selective state spaces[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2312.00752.
[20]
LIU Y,TIAN Y J,ZHAO Y Z,et al.VMamba:Visual state space model[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2401.10166.
[21]
VASWANI A,SHAZEER N,PARMAR N,et al. Attention is all you need[C]//31st International Conference on Neural Information Processing Systems (NIPS'17).Long Beach:Curran Associates,2017:6000-6010.
[22]
XIONG Y W,LI Z Q,CHEN Y T,et al.Efficient deformable ConvNets:Rethinking dynamic and sparse operator for vision applications[C]//2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2024:5652-5661.
[23]
CHEN B,ZOU X C,ZHANG Y,et al.LEFormer:A hybrid CNN-transformer architecture for accurate lake extraction from remote sensing imagery[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics,Speech and Signal Processing.Seoul:IEEE, 2024:5710-5714.
[24]
WU X P,ZHAN C,LAI Y K,et al.IP102:A largescale benchmark dataset for insect pest recognition[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2020:8779-8788.
[25]
XIE C J,WANG R J,ZHANG J,et al.Multi-level learning features for automatic classification of field crop pests[J].Computers and Electronics in Agriculture,2018,152:233-241.
[26]
HOWARD A, SANDLER M, CHEN B, et al. Searching for MobileNetV3[C]//2019 IEEE/CVF International Conference on Computer Vision. Seoul:IEEE,2019:1314-1324.
[27]
WANG C,ZHANG J R,HE J,et al.A two-stream network with complementary feature fusion for pest image classification[J].Engineering Applications of Artificial Intelligence,2023,124:106563.
[28]
LIU Z,MAO H Z,WU C Y,et al.A ConvNet for the 2020s[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans:IEEE,2022:11966-11976.
[29]
CHEN J R,KAO S H,HE H,et al.Run,don't walk:Chasing higher FLOPS for faster neural networks[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Vancouver:IEEE, 2023:12021-12031.
[30]
LIU Z,LIN Y T,CAO Y,et al.Swin transformer:Hierarchical vision transformer using shifted windows[C]//2021 IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2022:9992-10002.
[31]
SU Z B,LUO J Q,WANG Y,et al.Comparative study of ensemble models of deep convolutional neural networks for crop pests classification[J].Multimedia Tools and Applications,2023,82(19):29567-29586.
[32]
ZHU L H,LIAO B C,ZHANG Q,et al.Vision mamba:Efficient visual representation learning with bidirectional state space model[EB/OL].[2024-12-08].https://doi.org/10.48550/arXiv.2401.09417.
[33]
SELVARAJU R R,COGSWELL M,DAS A,et al. Grad-CAM:Visual explanations from deep networks via gradient-based localization[J].International Journal of Computer Vision,2020,128(2):336-359.
2025年第30卷第6期
PDF下载
102
50
引用本文
BibTeX
文章信息
doi: 10.13682/j.issn.2095-6533.2025.06.013
  • 接收时间:2024-12-20
  • 首发时间:2026-04-16
  • 出版时间:2025-11-10
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2024-12-20
基金
作者信息
    1.西安邮电大学通信与信息工程学院,陕西西安 710121
    2.西安市公共安全图像处理技术及应用重点实验室,陕西西安 710121
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/xayddxxb/CN/10.13682/j.issn.2095-6533.2025.06.013
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏