Article(id=1249044017460224402, tenantId=1146029695717560320, journalId=1249024232475115590, issueId=1249044006114628363, articleNumber=null, orderNo=null, doi=10.11834/jig.240547, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1727107200000, receivedDateStr=2024-09-24, revisedDate=1747670400000, revisedDateStr=2025-05-20, acceptedDate=null, acceptedDateStr=null, onlineDate=1775724899878, onlineDateStr=2026-04-09, pubDate=1765814400000, pubDateStr=2025-12-16, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1775724899878, onlineIssueDateStr=2026-04-09, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1775724899878, creator=13041195026, updateTime=1775724899878, updator=13041195026, issue=Issue{id=1249044006114628363, tenantId=1146029695717560320, journalId=1249024232475115590, year='2025', volume='30', issue='12', pageStart='3707', pageEnd='3968', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=1, specialIssue=null, createTime=1775724897161, creator=13041195026, updateTime=1775726353303, updator=13041195026, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1249050113662984471, tenantId=1146029695717560320, journalId=1249024232475115590, issueId=1249044006114628363, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1249050113667178776, tenantId=1146029695717560320, journalId=1249024232475115590, issueId=1249044006114628363, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=3824, endPage=3837, ext={EN=ArticleExt(id=1249044021600002469, articleId=1249044017460224402, tenantId=1146029695717560320, journalId=1249024232475115590, language=EN, title=Lightweight pyramid cross-attention network for orbital image defect detection, columnId=1249044019595125147, journalTitle=Journal of Image and Graphics, columnName=Image Analysis and Recognition, runingTitle=null, highlight=null, articleAbstract=
Objective Most existing vision-based rail defect detection methods face challenges such as high parameter counts, computational complexity, slow detection speeds, and limited accuracy. Aiming to overcome these limitations, this paper introduces a lightweight pyramid cross-attention network (LPCANet) for orbital image defect detection using RGB images and depth images.
Method LPCANet adopts MobileNetv2 as its backbone network to extract multiscale feature maps from RGB images. Simultaneously, a lightweight pyramid module (LPM) is employed to extract similarly-sized feature maps from depth images. Each stage of the LPM comprises a sequence of operations including max pooling, a 3 × 3 convolutional layer, batch normalization, and ReLU activation, enabling efficient extraction of features from depth images. By leveraging deep learning, RGB-D technology, and salient object detection, LPCANet efficiently extracts multiscale feature representations from RGB and depth data. The LPM handles depth image features, while the backbone captures detailed pyramid features from RGB images. Subsequently, a cross-attention mechanism (CAM) is applied to integrate the feature maps from both modalities, enhancing the network’s focus on relevant defect regions. Additionally, a spatial feature extractor (SFE) is introduced to further boost defect detection performance. Finally, a “pixel shuffle” operation is used to restore the output to the original image resolution.
Result The proposed scheme was computationally evaluated using the PyTorch library in an environment equipped with an NVIDIA 3090 GPU, alongside several benchmark models for comparison. For the evaluation of LPCANet, three publicly available unsupervised RGB-D rail datasets were used: NEU-RSDDS-AUG, RSDD-TYPE1, and RSDD-TYPE2. Experimental results on the NEU-RSDDS-AUG dataset indicate that LPCANet achieves excellent efficiency, with 9.90 million parameters, a computational complexity of 2.50 G, a model size of 37.95 MB, and a running speed of 162.60 frames per second. Compared to 18 existing rail defect detection schemes, LPCANet exhibits superior lightness in performance. In particular, when compared against CSEPNet, the current best-performing model, LPCANet achieves improvements across several evaluation metrics: +1.48% in Sα, +0.86% in intersection over union (IOU), +0.14% in Fβmax, +0.03% in mean average precision (mAP), and +1.77% in mean absolute error (MAE). An ablation study was conducted on four upsampling methods (interpolation, transposed convolution, patch merging, and “pixel shuffle”) to evaluate their effectiveness within the LPCANet framework. Among these, the “pixel shuffle” method demonstrated clear advantages and was found to be the most suitable for the LPCANet model. Further ablation studies were conducted on four different components (backbone network, LPM, SFE, and CAM). The results indicate that CAM and SFE notably enhance the detection performance of LPCANet. An in-depth analysis of various backbone networks confirmed that LPCANet model is not only compatible with existing backbone networks but also consistently achieves superior detection results. Aiming to evaluate the model’s generalization capability beyond rail datasets, experiments were also conducted on three non-rail defect datasets: DAGM2007, MT, and Kolektor-SDD2. The results show that LPCANet delivers improved performance across three key metrics: mAP, MAE, and IOU, demonstrating its potential for general-purpose defect detection tasks.
Conclusion The LPCANet model proposed in this study effectively combines the advantages of traditional and deep learning approaches, demonstrating strong practical value in the field of rail defect image processing. In the future, this scheme will focus on further reducing the model size to achieve rapid detection speeds while ensuring further improvements in performance quality.
, correspAuthors=Sixu Guo, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Sixu Guo, Huizheng Geng, Li Su, Shen He, Xinyue Zhang), CN=ArticleExt(id=1249044038108783323, articleId=1249044017460224402, tenantId=1146029695717560320, journalId=1249024232475115590, language=CN, title=用于轨道图像缺陷检测的轻量级金字塔交叉注意力网络, columnId=1249044019838394781, journalTitle=中国图象图形学报, columnName=图像分析和识别, runingTitle=null, highlight=null, articleAbstract=
目的 基于视觉的轨道缺陷检测方法大多存在高参数、计算复杂、检测速度慢以及精度低等缺陷,为了解决上述问题,提出一种使用RGB和深度图像进行轨道缺陷检测的轻量级金字塔交叉注意网络(lightweight pyramid cross-attention network for orbital image defect,LPCANet)。
方法 LPCANet模型利用深度学习、RGB-D与显著性目标检测等技术,设计一种轻量级金字塔模块,能够从深度图像中提取多尺度特征图,而骨干模块从RGB图像中捕获金字塔特征细节;然后,将交叉注意力模块(cross-attention mechanism,CAM)应用于两种类型的特征映射;其次,利用空间特征提取子(spatial feature extractor,SFE)提高缺陷检测性能;最后,应用像素洗牌(pixel shuffle)操作恢复原始图像的大小。
结果 在NEU-RSDDS-AUG、RSDD-TYPE1和RSDD-TYPE2 3种公开无服务RGB-D轨道数据集进行实验。结果表明,提出方法在NEU-RSDDS-AUG数据集的运行参数为9.90 M,计算量为2.50 G,模型大小为37.95 MB,运行速度为162.60帧/s,相比现有18种轨道缺陷检测方法,更为轻量化;与当前性能最优的CSEPNet相比,S-度量、交并比、最大F-度量、平均精度和平均绝对误差指标分别提高1.48%、0.86%、0.14%、0.03% 和1.77%;在消融实验中,像素洗牌方法表现出明显优势,更适合LPCANet模型。深入分析各种骨干网络性能,实验表明,LPCANet模型不仅适用现有各种骨干网络,而且检测结果更加优秀。在非轨道数据集DAGM2007、MT和Kolektor-SDD2上进行实验,LPCANet模型在mAP、MAE与IOU指标均有提高,具备一定的泛用性。
结论 提出的LPCANet模型综合了传统模型和深度学习模型的优点,在轨道缺陷图像检测领域具备良好的实际应用价值。
, correspAuthors=郭斯栩, authorNote=null, correspAuthorsNote=
, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=cfyAfqmEECLQTaF2tYzJbQ==, magXml=eEQsGx3MfCewPnIeRbo5UQ==, pdfUrl=null, pdf=UTnacIBKTqTOG4tjqGbjig==, pdfFileSize=4532308, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=jm8P4a3RkgI4pduj0pdb7A==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=SJJ2oHzdlJzYhtgk9bOEAQ==, mapNumber=null, authorCompany=null, fund=null, authors=
粟栗,男,教授级高级工程师,主要研究方向为AI数据安全。E-mail: suli@chinamobie
, authorsList=郭斯栩, 耿慧拯, 粟栗, 何申, 张鑫月)}, authors=[Author(id=1249044040428233476, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=guosixu@chinamobile.com, emailSecond=null, emailThird=null, correspondingAuthor=1, authorType=1, ext={EN=AuthorExt(id=1249044040616977166, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044040428233476, language=EN, stringName=Sixu Guo, firstName=Sixu, middleName=null, lastName=Guo, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, 2, *, address=
1China Mobile Research Institute, Beijing100032, China
2Vehicle Engineering College, Dalian Jiaotong University, Dalian116021, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1249044040734417685, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044040428233476, language=CN, stringName=郭斯栩, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, 2, *, address=
1中国移动通信有限公司研究院,北京100032
2大连交通大学中车学院,大连116021, bio={"content":"
郭斯栩,男,工程师,主要研究方向为图像处理、深度学习和密码学。E-mail: guosixu@chinamobile.com
"}, bioImg=null, bioContent=
郭斯栩,男,工程师,主要研究方向为图像处理、深度学习和密码学。E-mail: guosixu@chinamobile.com
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1249044038662431472, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=1, ext=[AuthorCompanyExt(id=1249044038675014385, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1China Mobile Research Institute, Beijing100032, China), AuthorCompanyExt(id=1249044038695985906, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1中国移动通信有限公司研究院,北京100032)]), AuthorCompany(id=1249044040075911925, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=2, ext=[AuthorCompanyExt(id=1249044040143020794, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044040075911925, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
2Vehicle Engineering College, Dalian Jiaotong University, Dalian116021, China), AuthorCompanyExt(id=1249044040168186619, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044040075911925, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
2大连交通大学中车学院,大连116021)])]), Author(id=1249044040847663898, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=genghuizheng@chinamobile.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1249044041023824676, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044040847663898, language=EN, stringName=Huizheng Geng, firstName=Huizheng, middleName=null, lastName=Geng, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1China Mobile Research Institute, Beijing100032, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1249044041149653805, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044040847663898, language=CN, stringName=耿慧拯, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1中国移动通信有限公司研究院,北京100032, bio={"content":"
耿慧拯,男,副高级工程师,主要研究方向为AI数据安全、图像处理。E-mail: genghuizheng@chinamobile.com
"}, bioImg=null, bioContent=
耿慧拯,男,副高级工程师,主要研究方向为AI数据安全、图像处理。E-mail: genghuizheng@chinamobile.com
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1249044038662431472, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=1, ext=[AuthorCompanyExt(id=1249044038675014385, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1China Mobile Research Institute, Beijing100032, China), AuthorCompanyExt(id=1249044038695985906, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1中国移动通信有限公司研究院,北京100032)])]), Author(id=1249044041292260148, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1249044041476809533, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044041292260148, language=EN, stringName=Li Su, firstName=Li, middleName=null, lastName=Su, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1China Mobile Research Institute, Beijing100032, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1249044041627804479, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044041292260148, language=CN, stringName=粟栗, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1中国移动通信有限公司研究院,北京100032, bio={"content":"
粟栗,男,教授级高级工程师,主要研究方向为AI数据安全。E-mail: suli@chinamobie
"}, bioImg=null, bioContent=
粟栗,男,教授级高级工程师,主要研究方向为AI数据安全。E-mail: suli@chinamobie
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1249044038662431472, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=1, ext=[AuthorCompanyExt(id=1249044038675014385, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1China Mobile Research Institute, Beijing100032, China), AuthorCompanyExt(id=1249044038695985906, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1中国移动通信有限公司研究院,北京100032)])]), Author(id=1249044041745244995, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=heshen@chinamobile.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1249044041879462728, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044041745244995, language=EN, stringName=Shen He, firstName=Shen, middleName=null, lastName=He, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1China Mobile Research Institute, Beijing100032, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1249044042030457677, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044041745244995, language=CN, stringName=何申, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1中国移动通信有限公司研究院,北京100032, bio={"content":"
何申,男,教授级高级工程师,主要研究方向为AI数据安全。E-mail: heshen@chinamobile.com
"}, bioImg=null, bioContent=
何申,男,教授级高级工程师,主要研究方向为AI数据安全。E-mail: heshen@chinamobile.com
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1249044038662431472, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=1, ext=[AuthorCompanyExt(id=1249044038675014385, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1China Mobile Research Institute, Beijing100032, China), AuthorCompanyExt(id=1249044038695985906, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1中国移动通信有限公司研究院,北京100032)])]), Author(id=1249044042185646929, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=zhangxinyue@chinamobile.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1249044042307281751, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044042185646929, language=EN, stringName=Xinyue Zhang, firstName=Xinyue, middleName=null, lastName=Zhang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1China Mobile Research Institute, Beijing100032, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1249044042382779228, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, authorId=1249044042185646929, language=CN, stringName=张鑫月, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=
1, address=
1中国移动通信有限公司研究院,北京100032, bio={"content":"
张鑫月,女,工程师,主要研究方向为图像处理和AI数据安全。E-mail: zhangxinyue@chinamobile.com
"}, bioImg=null, bioContent=
张鑫月,女,工程师,主要研究方向为图像处理和AI数据安全。E-mail: zhangxinyue@chinamobile.com
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1249044038662431472, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=1, ext=[AuthorCompanyExt(id=1249044038675014385, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1China Mobile Research Institute, Beijing100032, China), AuthorCompanyExt(id=1249044038695985906, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1中国移动通信有限公司研究院,北京100032)])])], keywords=[Keyword(id=1249044042605077351, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, orderNo=1, keyword=track defect detection), Keyword(id=1249044042827375471, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, orderNo=2, keyword=deep learning), Keyword(id=1249044042965787510, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, orderNo=3, keyword=RGB-D), Keyword(id=1249044044609954687, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, orderNo=4, keyword=significant object detection), Keyword(id=1249044044798698373, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, orderNo=5, keyword=spatial feature extractor(SFE)), Keyword(id=1249044045004219280, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, orderNo=1, keyword=轨道缺陷检测), Keyword(id=1249044045230711705, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, orderNo=2, keyword=深度学习), Keyword(id=1249044045322986399, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, orderNo=3, keyword=RGB-D), Keyword(id=1249044045448815523, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, orderNo=4, keyword=显著性目标检测), Keyword(id=1249044045608199086, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, orderNo=5, keyword=空间特征提取子(SFE))], refs=[Reference(id=1249044051199205531, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=164, pageEnd=177, url=null, language=null, rfNumber=null, rfOrder=0, authorNames=Aqeel M, Sharifi S, Cristani M, Setti F, journalName=null, refType=null, unstructuredReference=
Aqeel M,
Sharifi S,
Cristani M and
Setti F.
2024. Self-supervised learning for robust surface defect detection//Proceedings of the 5th International Conference on Deep Learning Theory and Applications. Dijon, France: Springer:164-177 [DOI:
10.1007/978-3-031-66705-3_11], articleTitle=Self-supervised learning for robust surface defect detection, refAbstract=null), Reference(id=1249044051312451748, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2025, volume=null, issue=null, pageStart=173, pageEnd=183, url=null, language=null, rfNumber=null, rfOrder=1, authorNames=Aqeel M, Sharifi S, Cristani M, Setti F, journalName=null, refType=null, unstructuredReference=
Aqeel M,
Sharifi S,
Cristani M and
Setti F.
2025. Self-supervised iterative refinement for anomaly detection in industrial quality control//Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications. Porto, Portugal: [s.n.]:173-183, articleTitle=Self-supervised iterative refinement for anomaly detection in industrial quality control, refAbstract=null), Reference(id=1249044051442475181, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=406, pageEnd=416, url=null, language=null, rfNumber=null, rfOrder=2, authorNames=Cong R M, Liu H Y, Zhang C, Zhang W, Zheng F, Song R, Kwong S, journalName=null, refType=null, unstructuredReference=
Cong R M,
Liu H Y,
Zhang C,
Zhang W,
Zheng F,
Song R and
Kwong S.
2023. Point-aware interaction and CNN-induced refinement network for RGB-D salient object detection//Proceedings of the 31st ACM International Conference on Multimedia. Ottawa, Canada: ACM:406-416 [DOI:
10.1145/3581783.3611982], articleTitle=Point-aware interaction and CNN-induced refinement network for RGB-D salient object detection, refAbstract=null), Reference(id=1249044051543138484, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=199, issue=null, pageStart=null, pageEnd=111429, url=null, language=null, rfNumber=null, rfOrder=3, authorNames=Ding T, Li G Y, Liu Z, Wang Y K, journalName=Measurement, refType=null, unstructuredReference=
Ding T,
Li G Y,
Liu Z and
Wang Y K.
2022. Cross-scale edge purification network for salient object detection of steel defect images.
Measurement,
199: #111429 [DOI:
10.1016/j.measurement.2022.111429], articleTitle=Cross-scale edge purification network for salient object detection of steel defect images, refAbstract=null), Reference(id=1249044051656384701, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=476, pageEnd=480, url=null, language=null, rfNumber=null, rfOrder=4, authorNames=Dong K, Zhou C J, Ruan Y H, Li Y Z, journalName=null, refType=null, unstructuredReference=
Dong K,
Zhou C J,
Ruan Y H and
Li Y Z.
2020. MobileNetV2 model for image classification//Proceedings of the 2nd International Conference on Information Technology and Computer Application. Guangzhou, China: IEEE:476-480 [DOI:
10.1109/ITCA52113.2020.00106], articleTitle=MobileNetV2 model for image classification, refAbstract=null), Reference(id=1249044051782213829, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=12124, pageEnd=12134, url=null, language=null, rfNumber=null, rfOrder=5, authorNames=Dong X Y, Bao J M, Chen D D, Zhang W M, Yu N H, Yuan L, Chen D, Guo B N, journalName=null, refType=null, unstructuredReference=
Dong X Y,
Bao J M,
Chen D D,
Zhang W M,
Yu N H,
Yuan L,
Chen D and
Guo B N.
2022. CSWin transformer: a general vision transformer backbone with cross-shaped windows//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New Orleans, USA: IEEE:12124-12134 [DOI:
10.1109/CVPR52688.2022.01181], articleTitle=CSWin transformer: a general vision transformer backbone with cross-shaped windows, refAbstract=null), Reference(id=1249044051933208783, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=2777, pageEnd=2787, url=null, language=null, rfNumber=null, rfOrder=6, authorNames=Fan D P, Ji G P, Sun G L, Cheng M M, Shen J B, Shao L, journalName=null, refType=null, unstructuredReference=
Fan D P,
Ji G P,
Sun G L,
Cheng M M,
Shen J B and
Shao L.
2020. Camouflaged object detection//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE:2777-2787 [DOI:
10.1109/CVPR42600.2020.00285], articleTitle=Camouflaged object detection, refAbstract=null), Reference(id=1249044053506072789, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2020, volume=69, issue=12, pageStart=9709, pageEnd=9719, url=null, language=null, rfNumber=null, rfOrder=7, authorNames=Fan G R, Song K C, Yan Y H, journalName=IEEE Transactions on Instrumentation and Measurement, refType=null, unstructuredReference=
Fan G R,
Song K C and
Yan Y H.
2020. EDRNet: encoder-decoder residual network for salient object detection of strip steel surface defects.
IEEE Transactions on Instrumentation and Measurement,
69(12): 9709-9719 [DOI:
10.1109/TIM.2020.3002277], articleTitle=EDRNet: encoder-decoder residual network for salient object detection of strip steel surface defects, refAbstract=null), Reference(id=1249044053619319005, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=62, issue=null, pageStart=753, pageEnd=766, url=null, language=null, rfNumber=null, rfOrder=8, authorNames=Gao Y P, Li X Y, Wang X V, Wang L H, Gao L, journalName=Journal of Manufacturing Systems, refType=null, unstructuredReference=
Gao Y P,
Li X Y,
Wang X V,
Wang L H and
Gao L.
2022. A review on recent advances in vision-based defect recognition towards industrial intelligence.
Journal of Manufacturing Systems,
62: 753-766 [DOI:
10.1016/j.jmsy.2021.05.008], articleTitle=A review on recent advances in vision-based defect recognition towards industrial intelligence, refAbstract=null), Reference(id=1249044053707399395, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2024, volume=20, issue=3, pageStart=4571, pageEnd=4581, url=null, language=null, rfNumber=null, rfOrder=9, authorNames=Huang L M, Gong A J, journalName=IEEE Transactions on Industrial Informatics, refType=null, unstructuredReference=
Huang L M and
Gong A J.
2024. Surface defect detection for no-service rails with skeleton-aware accurate and fast network.
IEEE Transactions on Industrial Informatics,
20(3): 4571-4581 [DOI:
10.1109/TII.2023.3327341], articleTitle=Surface defect detection for no-service rails with skeleton-aware accurate and fast network, refAbstract=null), Reference(id=1249044053824839915, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=60, issue=4, pageStart=903, pageEnd=915, url=null, language=null, rfNumber=null, rfOrder=10, authorNames=Jiang Z T, Zhai F S, Qian Y, Xiao Y, Zhang S Q, journalName=Journal of Computer Research and Development, refType=null, unstructuredReference=
Jiang Z T,
Zhai F S,
Qian Y,
Xiao Y and
Zhang S Q.
2023. Low illumination object detection combined with feature enhancement and multi-scale receptive field.
Journal of Computer Research and Development,
60(4): 903-915, articleTitle=Low illumination object detection combined with feature enhancement and multi-scale receptive field, refAbstract=null), Reference(id=1249044053912920305, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=60, issue=4, pageStart=903, pageEnd=915, url=null, language=null, rfNumber=null, rfOrder=11, authorNames=江泽涛, 翟丰硕, 钱艺, 肖芸, 张少钦, journalName=计算机研究与发展, refType=null, unstructuredReference=江泽涛, 翟丰硕, 钱艺, 肖芸, 张少钦.
2023. 结合特征增强和多尺度感受野的低照度目标检测.
计算机研究与发展,
60(4): 903-915 [DOI:
10.7544/issn1000-1239.202111092], articleTitle=结合特征增强和多尺度感受野的低照度目标检测, refAbstract=null), Reference(id=1249044053980029176, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=32, issue=11, pageStart=7632, pageEnd=7645, url=null, language=null, rfNumber=null, rfOrder=12, authorNames=Jin X, Yi K, Xu J, journalName=IEEE Transactions on Circuits and Systems for Video Technology, refType=null, unstructuredReference=
Jin X,
Yi K and
Xu J.
2022. MoADNet: mobile asymmetric dual-stream networks for real-time and lightweight RGB-D salient object detection.
IEEE Transactions on Circuits and Systems for Video Technology,
32(11): 7632-7645 [DOI:
10.1109/TCSVT.2022.3180274], articleTitle=MoADNet: mobile asymmetric dual-stream networks for real-time and lightweight RGB-D salient object detection, refAbstract=null), Reference(id=1249044054047138045, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=2940, pageEnd=2950, url=null, language=null, rfNumber=null, rfOrder=13, authorNames=Ke Y Y, Tsubono T, journalName=null, refType=null, unstructuredReference=
Ke Y Y and
Tsubono T.
2022. Recursive contour-saliency blending network for accurate salient object detection//Proceedings of 2022 IEEE/CVF Winter Conference on Applications of Computer Vision. Waikoloa, USA: IEEE:2940-2950 [DOI:
10.1109/WACV51458.2022.00143], articleTitle=Recursive contour-saliency blending network for accurate salient object detection, refAbstract=null), Reference(id=1249044054126829825, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2021, volume=30, issue=null, pageStart=3528, pageEnd=3542, url=null, language=null, rfNumber=null, rfOrder=14, authorNames=Li G Y, Liu Z, Chen M Y, Bai Z, Lin W S, Ling H B, journalName=IEEE Transactions on Image Processing, refType=null, unstructuredReference=
Li G Y,
Liu Z,
Chen M Y,
Bai Z,
Lin W S and
Ling H B.
2021. Hierarchical alternate interaction network for RGB-D salient object detection.
IEEE Transactions on Image Processing,
30: 3528-3542 [DOI:
10.1109/TIP.2021.3062689], articleTitle=Hierarchical alternate interaction network for RGB-D salient object detection, refAbstract=null), Reference(id=1249044054202327302, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2004, volume=9, issue=3, pageStart=318, pageEnd=322, url=null, language=null, rfNumber=null, rfOrder=15, authorNames=Li M, Ma C, Yang X Q, journalName=Journal of Image and Graphics, refType=null, unstructuredReference=
Li M,
Ma C and
Yang X Q.
2004. Detection of texture defects for machined surface.
Journal of Image and Graphics,
9(3): 318-322, articleTitle=Detection of texture defects for machined surface, refAbstract=null), Reference(id=1249044054277824779, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2004, volume=9, issue=3, pageStart=318, pageEnd=322, url=null, language=null, rfNumber=null, rfOrder=16, authorNames=黎明, 马聪, 杨小芹, journalName=中国图象图形学报, refType=null, unstructuredReference=黎明, 马聪, 杨小芹.
2004. 机械加工零件表面纹理缺陷检测.
中国图象图形学报,
9(3): 318-322 [DOI:
10.3969/j.issn.1006-8961.2004.03.011], articleTitle=机械加工零件表面纹理缺陷检测, refAbstract=null), Reference(id=1249044054512705809, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=13756, pageEnd=13765, url=null, language=null, rfNumber=null, rfOrder=17, authorNames=Liu N, Zhang N, Han J W, journalName=null, refType=null, unstructuredReference=
Liu N,
Zhang N and
Han J W.
2020. Learning selective self-mutual attention for RGB-D saliency detection//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE:13756-13765 [DOI:
10.1109/CVPR42600.2020.01377], articleTitle=Learning selective self-mutual attention for RGB-D saliency detection, refAbstract=null), Reference(id=1249044054600786197, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022a, volume=null, issue=null, pageStart=12009, pageEnd=12019, url=null, language=null, rfNumber=null, rfOrder=18, authorNames=Liu Z, Hu H, Lin Y T, Yao Z L, Xie Y D, Wei Y X, Ning J, Cao Y, Zhang Z, Dong L, Wei F R, Guo B N, journalName=null, refType=null, unstructuredReference=
Liu Z,
Hu H,
Lin Y T,
Yao Z L,
Xie Y D,
Wei Y X,
Ning J,
Cao Y,
Zhang Z,
Dong L,
Wei F R and
Guo B N.
2022a. Swin transformer V2: scaling up capacity and resolution//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New Orleans, USA: IEEE:12009-12019 [DOI:
10.1109/CVPR52688.2022.01170], articleTitle=Swin transformer V2: scaling up capacity and resolution, refAbstract=null), Reference(id=1249044054760169756, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022b, volume=null, issue=null, pageStart=11976, pageEnd=11986, url=null, language=null, rfNumber=null, rfOrder=19, authorNames=Liu Z, Mao H Z, Wu C Y, Feichtenhofer C, Darrell T, Xie S N, journalName=null, refType=null, unstructuredReference=
Liu Z,
Mao H Z,
Wu C Y,
Feichtenhofer C,
Darrell T and
Xie S N.
2022b. A ConvNet for the 2020s//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New Orleans, USA: IEEE:11976-11986 [DOI:
10.1109/CVPR52688.2022.01167], articleTitle=A ConvNet for the 2020s, refAbstract=null), Reference(id=1249044054848250146, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=32, issue=null, pageStart=1026, pageEnd=1038, url=null, language=null, rfNumber=null, rfOrder=20, authorNames=Ma M C, Xia C Q, Xie C X, Chen X W, Li J, journalName=IEEE Transactions on Image Processing, refType=null, unstructuredReference=
Ma M C,
Xia C Q,
Xie C X,
Chen X W and
Li J.
2023. Boosting broader receptive fields for salient object detection.
IEEE Transactions on Image Processing,
32: 1026-1038 [DOI:
10.1109/TIP.2022.3232209], articleTitle=Boosting broader receptive fields for salient object detection, refAbstract=null), Reference(id=1249044054927941927, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2018, volume=5, issue=5, pageStart=12792, pageEnd=12802, url=null, language=null, rfNumber=null, rfOrder=21, authorNames=Manish R, Venkatesh A, Ashok S D, journalName=Materials Today: Proceedings, refType=null, unstructuredReference=
Manish R,
Venkatesh A and
Ashok S D.
2018. Machine vision based image processing techniques for surface finish and defect inspection in a grinding process.
Materials Today: Proceedings,
5(5): 12792-12802 [DOI:
10.1016/j.matpr.2018.02.263], articleTitle=Machine vision based image processing techniques for surface finish and defect inspection in a grinding process, refAbstract=null), Reference(id=1249044055020216620, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=10428, pageEnd=10436, url=null, language=null, rfNumber=null, rfOrder=22, authorNames=Radosavovic I, Kosaraju R P, Girshick R, He K M and Dollár P, journalName=null, refType=null, unstructuredReference=
Radosavovic I,
Kosaraju R P,
Girshick R,
He K M and Dollár P.
2020. Designing network design spaces//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE:10428-10436 [DOI:
10.1109/CVPR42600.2020.01044], articleTitle=Designing network design spaces, refAbstract=null), Reference(id=1249044055104102706, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=1025, pageEnd=1031, url=null, language=null, rfNumber=null, rfOrder=23, authorNames=Sun Y J, Chen G, Zhou T, Zhang Y, Liu N, journalName=null, refType=null, unstructuredReference=
Sun Y J,
Chen G,
Zhou T,
Zhang Y and
Liu N.
2021. Context-aware cross-level fusion network for camouflaged object detection//Proceedings of the 30th International Joint Conference on Artificial Intelligence. Montreal, Canada: ijcai.org:1025-1031 [DOI:
10.24963/ijcai.2021/142], articleTitle=Context-aware cross-level fusion network for camouflaged object detection, refAbstract=null), Reference(id=1249044055175405878, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2017, volume=22, issue=12, pageStart=1640, pageEnd=1663, url=null, language=null, rfNumber=null, rfOrder=24, authorNames=Tang B, Kong J Y, Wu S Q, journalName=Journal of Image and Graphics, refType=null, unstructuredReference=
Tang B,
Kong J Y and
Wu S Q.
2017. Review of surface defect detection based on machine vision.
Journal of Image and Graphics,
22(12): 1640-1663, articleTitle=Review of surface defect detection based on machine vision, refAbstract=null), Reference(id=1249044055259291964, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2017, volume=22, issue=12, pageStart=1640, pageEnd=1663, url=null, language=null, rfNumber=null, rfOrder=25, authorNames=汤勃, 孔建益, 伍世虔, journalName=中国图象图形学报, refType=null, unstructuredReference=汤勃, 孔建益, 伍世虔.
2017. 机器视觉表面缺陷检测综述.
中国图象图形学报,
22(12): 1640-1663 [DOI:
10.11834/jig.160623], articleTitle=机器视觉表面缺陷检测综述, refAbstract=null), Reference(id=1249044055359955265, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=3455, pageEnd=3464, url=null, language=null, rfNumber=null, rfOrder=26, authorNames=Wu Z W, Wang J J, Zhou Z Y, An Z C, Jiang Q P, Demonceaux C, Sun G L, Timofte R, journalName=null, refType=null, unstructuredReference=
Wu Z W,
Wang J J,
Zhou Z Y,
An Z C,
Jiang Q P,
Demonceaux C,
Sun G L and
Timofte R.
2023. Object segmentation by mining cross-modal semantics//Proceedings of the 31st ACM International Conference on Multimedia. Ottawa, Canada: ACM:3455-3464 [DOI:
10.1145/3581783.3611970], articleTitle=Object segmentation by mining cross-modal semantics, refAbstract=null), Reference(id=1249044055448035652, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=28, issue=11, pageStart=3497, pageEnd=3508, url=null, language=null, rfNumber=null, rfOrder=27, authorNames=Yan G W, Zhou X J, Jiao R H, He H, journalName=Journal of Image and Graphics, refType=null, unstructuredReference=
Yan G W,
Zhou X J,
Jiao R H and
He H.
2023. Defect detection of tower bolts by fusion of priori information and feature constraints.
Journal of Image and Graphics,
28(11): 3497-3508, articleTitle=Defect detection of tower bolts by fusion of priori information and feature constraints, refAbstract=null), Reference(id=1249044055548698953, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2023, volume=28, issue=11, pageStart=3497, pageEnd=3508, url=null, language=null, rfNumber=null, rfOrder=28, authorNames=阎光伟, 周香君, 焦润海, 何慧, journalName=中国图象图形学报, refType=null, unstructuredReference=阎光伟, 周香君, 焦润海, 何慧.
2023. 融合先验信息和特征约束的杆塔螺栓缺陷检测.
中国图象图形学报,
28(11): 3497-3508 [DOI:
10.11834/jig.221077], articleTitle=融合先验信息和特征约束的杆塔螺栓缺陷检测, refAbstract=null), Reference(id=1249044055624196428, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=71, issue=null, pageStart=null, pageEnd=5005714, url=null, language=null, rfNumber=null, rfOrder=29, authorNames=Yang H F, Wang Y Z, Hu J Y, He J T, Yao Z W, Bi Q S, journalName=IEEE Transactions on Instrumentation and Measurement, refType=null, unstructuredReference=
Yang H F,
Wang Y Z,
Hu J Y,
He J T,
Yao Z W and
Bi Q S.
2022. Deep learning and machine vision-based inspection of rail surface defects.
IEEE Transactions on Instrumentation and Measurement,
71: #5005714 [DOI:
10.1109/TIM.2021.3138498], articleTitle=Deep learning and machine vision-based inspection of rail surface defects, refAbstract=null), Reference(id=1249044055703888209, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=417, pageEnd=426, url=null, language=null, rfNumber=null, rfOrder=30, authorNames=Yu Z Y, Wu X J, Gu X D, journalName=null, refType=null, unstructuredReference=
Yu Z Y,
Wu X J and
Gu X D.
2017. Fully convolutional networks for surface defect inspection in industrial environment//Proceedings of the 11th International Conference on Computer Vision Systems. Shenzhen, China: Springer:417-426 [DOI:
10.1007/978-3-319-68345-4_37], articleTitle=Fully convolutional networks for surface defect inspection in industrial environment, refAbstract=null), Reference(id=1249044055787774293, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=null, rfOrder=31, authorNames=Yun Y K, Lin W, journalName=null, refType=null, unstructuredReference=
Yun Y K and
Lin W.
2022. SelfReformer: self-refined network with transformer for salient object detection [EB/OL]. [2024-09-24].
https://arxiv.org/pdf/2205.11283.pdf, articleTitle=SelfReformer: self-refined network with transformer for salient object detection, refAbstract=null), Reference(id=1249044055867466074, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2024, volume=34, issue=3, pageStart=1656, pageEnd=1669, url=null, language=null, rfNumber=null, rfOrder=32, authorNames=Zeng Z H, Liu H J, Chen F L, Tan X H, journalName=IEEE Transactions on Circuits and Systems for Video Technology, refType=null, unstructuredReference=
Zeng Z H,
Liu H J,
Chen F L and
Tan X H.
2024. AirSOD: a lightweight network for RGB-D salient object detection.
IEEE Transactions on Circuits and Systems for Video Technology,
34(3): 1656-1669 [DOI:
10.1109/TCSVT.2023.3295588], articleTitle=AirSOD: a lightweight network for RGB-D salient object detection, refAbstract=null), Reference(id=1249044055951352157, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=null, rfOrder=33, authorNames=Zhang J, Ding R W, Ban M J, Guo T Y, journalName=null, refType=null, unstructuredReference=
Zhang J,
Ding R W,
Ban M J and
Guo T Y.
2022. FDSNeT: an accurate real-time surface defect segmentation network//Proceedings of 2022 IEEE International Conference on Acoustics, Speech and Signal Processing. Singapore, Singapore: IEEE [DOI:
10.1109/ICASSP43922.2022.9747311], articleTitle=FDSNeT: an accurate real-time surface defect segmentation network, refAbstract=null), Reference(id=1249044056039432543, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2024, volume=229, issue=null, pageStart=null, pageEnd=114398, url=null, language=null, rfNumber=null, rfOrder=34, authorNames=Zhou X, Zhou S H, Zhang Y C, Ren Z H, Jiang Z Y, Luo H F, journalName=Measurement, refType=null, unstructuredReference=
Zhou X,
Zhou S H,
Zhang Y C,
Ren Z H,
Jiang Z Y and
Luo H F.
2024. GDALR: global dual attention and local representations in transformer for surface defect detection.
Measurement,
229: #114398 [DOI:
10.1016/j.measurement.2024.114398], articleTitle=GDALR: global dual attention and local representations in transformer for surface defect detection, refAbstract=null), Reference(id=1249044056114930020, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, doi=null, pmid=null, pmcid=null, year=2021, volume=9, issue=null, pageStart=149465, pageEnd=149476, url=null, language=null, rfNumber=null, rfOrder=35, authorNames=Zhou X F, Fang H, Fei X B, Shi R, Zhang J Y, journalName=IEEE Access, refType=null, unstructuredReference=
Zhou X F,
Fang H,
Fei X B,
Shi R and
Zhang J Y.
2021. Edge-aware multi-level interactive network for salient object detection of strip steel surface defects.
IEEE Access,
9: 149465-149476 [DOI:
10.1109/ACCESS.2021.3124814], articleTitle=Edge-aware multi-level interactive network for salient object detection of strip steel surface defects, refAbstract=null)], funds=null, companyList=[AuthorCompany(id=1249044038662431472, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=1, ext=[AuthorCompanyExt(id=1249044038675014385, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1China Mobile Research Institute, Beijing100032, China), AuthorCompanyExt(id=1249044038695985906, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044038662431472, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
1中国移动通信有限公司研究院,北京100032)]), AuthorCompany(id=1249044040075911925, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, xref=2, ext=[AuthorCompanyExt(id=1249044040143020794, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044040075911925, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
2Vehicle Engineering College, Dalian Jiaotong University, Dalian116021, China), AuthorCompanyExt(id=1249044040168186619, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, companyId=1249044040075911925, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=
2大连交通大学中车学院,大连116021)])], figs=[ArticleFig(id=1249044045826302905, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.1, caption=
Differences between orbital depth images and normal depth images, figureFileSmall=xyRGiZrpsCDTfqGVLBHtrQ==, figureFileBig=jm8P4a3RkgI4pduj0pdb7A==, tableContent=null), ArticleFig(id=1249044046086349769, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图1, caption=
轨道深度图像与普通深度图像的差异, figureFileSmall=xyRGiZrpsCDTfqGVLBHtrQ==, figureFileBig=jm8P4a3RkgI4pduj0pdb7A==, tableContent=null), ArticleFig(id=1249044046526751705, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.2, caption=
Process of LPCANet model, figureFileSmall=NXG6SZ81XJ0gCS1mr29RsA==, figureFileBig=vk/jdCOd3XtN/+SmrYnsaw==, tableContent=null), ArticleFig(id=1249044046665163743, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图2, caption=
LPCANet模型的流程, figureFileSmall=NXG6SZ81XJ0gCS1mr29RsA==, figureFileBig=vk/jdCOd3XtN/+SmrYnsaw==, tableContent=null), ArticleFig(id=1249044046858101733, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.3, caption=
Pixel distribution of LPCANet feature maps after channel compression ((a) stage 1; (b) stage 2), figureFileSmall=bY98T2kIyB3IA3erZIaMJg==, figureFileBig=f+DQY92gcqFHNGjAxwzuwg==, tableContent=null), ArticleFig(id=1249044047130731500, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图3, caption=
LPCANet的特征图通道压缩后的像素分布, figureFileSmall=bY98T2kIyB3IA3erZIaMJg==, figureFileBig=f+DQY92gcqFHNGjAxwzuwg==, tableContent=null), ArticleFig(id=1249044047302697972, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.4, caption=
SFE mode, figureFileSmall=hjeaE+mt/4mFNrvhVtWj3w==, figureFileBig=TX9tVTdiDxI0KLNOg4CASg==, tableContent=null), ArticleFig(id=1249044047495635963, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图4, caption=
空间特征提取子, figureFileSmall=hjeaE+mt/4mFNrvhVtWj3w==, figureFileBig=TX9tVTdiDxI0KLNOg4CASg==, tableContent=null), ArticleFig(id=1249044049081081860, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.5, caption=
Comparison of computational load between LPCANet and state-of-the-art models, figureFileSmall=1r1VuDCrjCREB/ytSBn7Fg==, figureFileBig=9RtiNBfKA3YQp8TSmH8LFA==, tableContent=null), ArticleFig(id=1249044049194328074, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图5, caption=
LPCANet与先进模型之间的计算负载比较, figureFileSmall=1r1VuDCrjCREB/ytSBn7Fg==, figureFileBig=9RtiNBfKA3YQp8TSmH8LFA==, tableContent=null), ArticleFig(id=1249044049278214162, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig. 6, caption=
Comparison of different sampling methods, figureFileSmall=mRLQWIHZbz8FoZYP7Zr40A==, figureFileBig=YX/jgnSk4+AJ1Z4bSGDgRg==, tableContent=null), ArticleFig(id=1249044049399848987, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图6, caption=
不同采样方法的比较, figureFileSmall=mRLQWIHZbz8FoZYP7Zr40A==, figureFileBig=YX/jgnSk4+AJ1Z4bSGDgRg==, tableContent=null), ArticleFig(id=1249044049508900897, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.7, caption=
Influence of LPM channel dimension, figureFileSmall=glg7kDW2+kd6QT/VGWBHCg==, figureFileBig=vF6nzXho58lgGrULs0ssTg==, tableContent=null), ArticleFig(id=1249044049592786983, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图7, caption=
LPM通道维度的影响, figureFileSmall=glg7kDW2+kd6QT/VGWBHCg==, figureFileBig=vF6nzXho58lgGrULs0ssTg==, tableContent=null), ArticleFig(id=1249044049689255981, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.8, caption=
Influence of different components on segmentation mask, figureFileSmall=0ABxRMxPZOxqZq9FAP910g==, figureFileBig=p7IJ0KRxEXLHibbrOOWtog==, tableContent=null), ArticleFig(id=1249044049785724981, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图8, caption=
不同组件对分割掩码的影响, figureFileSmall=0ABxRMxPZOxqZq9FAP910g==, figureFileBig=p7IJ0KRxEXLHibbrOOWtog==, tableContent=null), ArticleFig(id=1249044049877999676, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Fig.9, caption=
Comparison of PR and ROC curves between various backbone networks, top-level SOD, SOD-D,, figureFileSmall=nVaieFNwsSs7NjCtV0wHWg==, figureFileBig=BbLH1MJYzwmM7TblGWIYFw==, tableContent=null), ArticleFig(id=1249044049961885761, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=图9, caption=
各种骨干网络、顶级SOD、SOD-D、缺陷检测模型与本文所提LPCANet之间的精确率—召回率曲线和ROC曲线比较defect detection models, and the LPCANet proposed in this scheme ((a) PR curves; (b) ROC curves)
, figureFileSmall=nVaieFNwsSs7NjCtV0wHWg==, figureFileBig=BbLH1MJYzwmM7TblGWIYFw==, tableContent=null), ArticleFig(id=1249044050054160456, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Tab.1, caption=
Quantitative comparison between proposed model and 18 other methods on the NEU-RSDDS-AUG dataset
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | mAP | IOU | MAE |  |  |  |
|---|
| HAINet | 92.45 | 81.50 | 7.51 | 87.70 | 91.89 | 82.96 |
| CONet | 86.48 | 71.91 | 13.11 | 79.21 | 84.32 | 75.19 |
| XMSNet | 91.31 | 80.11 | 9.06 | 84.80 | 89.47 | 81.10 |
| PICRNet | 92.25 | 81.75 | 7.46 | 87.70 | 91.41 | 83.46 |
| TriTransNet | 88.36 | 80.11 | 6.90 | 87.93 | 92.47 | 83.28 |
| MoADNet | 90.19 | 80.97 | 7.20 | 88.14 | 91.81 | 83.13 |
| C2FNet | 91.05 | 82.97 | 6.52 | 88.24 | 92.09 | 84.37 |
| PGNet | 93.18 | 81.94 | 7.82 | 87.05 | 92.21 | 83.80 |
| SRNet | 91.75 | 79.20 | 9.27 | 85.64 | 90.55 | 81.95 |
| SINet | 94.02 | 82.02 | 8.15 | 88.34 | 9.21 | 84.01 |
| BBRFNet | 93.73 | 81.71 | 7.79 | 88.31 | 91.83 | 83.28 |
| RCSBNet | 90.30 | 81.29 | 6.88 | 88.14 | 92.17 | 83.71 |
| FDSNet | 83.95 | 68.62 | 18.49 | 73.92 | 81.14 | 71.86 |
| CSEPNet | 94.40 | 82.22 | 8.88 | 88.43 | 92.37 | 83.10 |
| EMINet | 81.04 | 77.82 | 7.69 | 8.54 | 90.99 | 81.76 |
| EDRNet | 81.80 | 78.25 | 7.61 | 86.15 | 90.77 | 82.43 |
| DACNet | 83.03 | 79.40 | 6.95 | 88.68 | 91.76 | 82.90 |
| GDALRNet | 92.26 | 82.10 | 6.93 | 88.24 | 92.10 | 83.98 |
| LPCANet | 94.43 | 83.08 | 7.11 | 88.57 | 92.17 | 84.58 |
), ArticleFig(id=1249044050159018064, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=表1, caption=
提出的模型与其他18种方法在 NEU-RSDDS-AUG数据集的定量比较
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | mAP | IOU | MAE |  |  |  |
|---|
| HAINet | 92.45 | 81.50 | 7.51 | 87.70 | 91.89 | 82.96 |
| CONet | 86.48 | 71.91 | 13.11 | 79.21 | 84.32 | 75.19 |
| XMSNet | 91.31 | 80.11 | 9.06 | 84.80 | 89.47 | 81.10 |
| PICRNet | 92.25 | 81.75 | 7.46 | 87.70 | 91.41 | 83.46 |
| TriTransNet | 88.36 | 80.11 | 6.90 | 87.93 | 92.47 | 83.28 |
| MoADNet | 90.19 | 80.97 | 7.20 | 88.14 | 91.81 | 83.13 |
| C2FNet | 91.05 | 82.97 | 6.52 | 88.24 | 92.09 | 84.37 |
| PGNet | 93.18 | 81.94 | 7.82 | 87.05 | 92.21 | 83.80 |
| SRNet | 91.75 | 79.20 | 9.27 | 85.64 | 90.55 | 81.95 |
| SINet | 94.02 | 82.02 | 8.15 | 88.34 | 9.21 | 84.01 |
| BBRFNet | 93.73 | 81.71 | 7.79 | 88.31 | 91.83 | 83.28 |
| RCSBNet | 90.30 | 81.29 | 6.88 | 88.14 | 92.17 | 83.71 |
| FDSNet | 83.95 | 68.62 | 18.49 | 73.92 | 81.14 | 71.86 |
| CSEPNet | 94.40 | 82.22 | 8.88 | 88.43 | 92.37 | 83.10 |
| EMINet | 81.04 | 77.82 | 7.69 | 8.54 | 90.99 | 81.76 |
| EDRNet | 81.80 | 78.25 | 7.61 | 86.15 | 90.77 | 82.43 |
| DACNet | 83.03 | 79.40 | 6.95 | 88.68 | 91.76 | 82.90 |
| GDALRNet | 92.26 | 82.10 | 6.93 | 88.24 | 92.10 | 83.98 |
| LPCANet | 94.43 | 83.08 | 7.11 | 88.57 | 92.17 | 84.58 |
), ArticleFig(id=1249044050251292757, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Tab.2, caption=
Quantitative comparison between proposed model and 18 other methods on the RSDDs-TYPE1 dataset
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | mAP | IOU | MAE |  |  |  |
|---|
| HAINet | 83.77 | 76.45 | 12.34 | 78.45 | 87.65 | 82.34 |
| CONet | 81.53 | 72.89 | 16.78 | 82.34 | 84.32 | 74.56 |
| XMSNet | 84.56 | 81.23 | 9.56 | 75.67 | 89.10 | 81.23 |
| PICRNet | 87.12 | 74.56 | 14.23 | 86.78 | 82.78 | 83.78 |
| TriTransNet | 82.31 | 79.34 | 18.90 | 74.23 | 90.56 | 80.90 |
| MoADNet | 86.17 | 80.78 | 11.09 | 80.90 | 81.23 | 84.12 |
| C2FNet | 86.76 | 77.12 | 7.98 | 79.56 | 86.45 | 82.67 |
| PGNet | 87.44 | 71.90 | 15.45 | 84.12 | 88.98 | 81.45 |
| SRNet | 86.10 | 82.56 | 13.67 | 76.89 | 85.67 | 83.65 |
| SINet | 87.48 | 75.32 | 19.87 | 81.34 | 83.45 | 80.78 |
| BBRFNet | 86.29 | 78.67 | 8.45 | 77.65 | 91.78 | 84.98 |
| RCSBNet | 84.43 | 83.09 | 17.32 | 87.23 | 87.12 | 82.10 |
| FDSNet | 79.98 | 70.45 | 19.12 | 73.45 | 80.34 | 69.54 |
| CSEPNet | 87.50 | 73.87 | 12.78 | 85.67 | 89.56 | 83.21 |
| EMINet | 71.11 | 81.54 | 7.89 | 72.98 | 84.78 | 80.45 |
| EDRNet | 70.98 | 79.10 | 14.56 | 83.10 | 86.90 | 84.89 |
| DACNet | 72.68 | 72.34 | 16.34 | 88.09 | 82.12 | 82.76 |
| GDALRNet | 87.12 | 80.65 | 19.21 | 79.76 | 85.34 | 81.34 |
| LPCANet | 87.52 | 83.90 | 7.85 | 88.71 | 91.80 | 85.90 |
), ArticleFig(id=1249044050339373146, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=表2, caption=
提出的模型与其他18种方法在 RSDDs-TYPE1数据集的定量比较
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | mAP | IOU | MAE |  |  |  |
|---|
| HAINet | 83.77 | 76.45 | 12.34 | 78.45 | 87.65 | 82.34 |
| CONet | 81.53 | 72.89 | 16.78 | 82.34 | 84.32 | 74.56 |
| XMSNet | 84.56 | 81.23 | 9.56 | 75.67 | 89.10 | 81.23 |
| PICRNet | 87.12 | 74.56 | 14.23 | 86.78 | 82.78 | 83.78 |
| TriTransNet | 82.31 | 79.34 | 18.90 | 74.23 | 90.56 | 80.90 |
| MoADNet | 86.17 | 80.78 | 11.09 | 80.90 | 81.23 | 84.12 |
| C2FNet | 86.76 | 77.12 | 7.98 | 79.56 | 86.45 | 82.67 |
| PGNet | 87.44 | 71.90 | 15.45 | 84.12 | 88.98 | 81.45 |
| SRNet | 86.10 | 82.56 | 13.67 | 76.89 | 85.67 | 83.65 |
| SINet | 87.48 | 75.32 | 19.87 | 81.34 | 83.45 | 80.78 |
| BBRFNet | 86.29 | 78.67 | 8.45 | 77.65 | 91.78 | 84.98 |
| RCSBNet | 84.43 | 83.09 | 17.32 | 87.23 | 87.12 | 82.10 |
| FDSNet | 79.98 | 70.45 | 19.12 | 73.45 | 80.34 | 69.54 |
| CSEPNet | 87.50 | 73.87 | 12.78 | 85.67 | 89.56 | 83.21 |
| EMINet | 71.11 | 81.54 | 7.89 | 72.98 | 84.78 | 80.45 |
| EDRNet | 70.98 | 79.10 | 14.56 | 83.10 | 86.90 | 84.89 |
| DACNet | 72.68 | 72.34 | 16.34 | 88.09 | 82.12 | 82.76 |
| GDALRNet | 87.12 | 80.65 | 19.21 | 79.76 | 85.34 | 81.34 |
| LPCANet | 87.52 | 83.90 | 7.85 | 88.71 | 91.80 | 85.90 |
), ArticleFig(id=1249044050423259235, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Tab.3, caption=
Quantitative comparison between proposed model and 18 other methods on the RSDDs-TYPE2 dataset
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | mAP | IOU | MAE |  |  |  |
|---|
| HAINet | 82.85 | 76.34 | 15.67 | 82.34 | 86.78 | 82.45 |
| CONet | 80.93 | 82.12 | 20.34 | 79.65 | 82.34 | 75.12 |
| XMSNet | 83.77 | 70.56 | 12.78 | 86.78 | 90.12 | 83.67 |
| PICRNet | 85.49 | 78.90 | 9.89 | 77.23 | 84.56 | 81.90 |
| TriTransNet | 79.91 | 81.23 | 18.45 | 84.56 | 89.78 | 84.23 |
| MoADNet | 84.98 | 74.87 | 14.23 | 90.12 | 81.90 | 80.78 |
| C2FNet | 95.30 | 79.45 | 11.56 | 88.78 | 91.23 | 85.65 |
| PGNet | 86.64 | 69.78 | 21.78 | 75.45 | 87.45 | 81.34 |
| SRNet | 84.67 | 83.65 | 8.67 | 81.90 | 85.67 | 83.78 |
| SINet | 87.10 | 73.21 | 16.90 | 85.34 | 88.32 | 82.10 |
| BBRFNet | 84.25 | 77.54 | 13.45 | 76.87 | 92.10 | 84.56 |
| RCSBNet | 82.78 | 80.12 | 19.12 | 89.23 | 83.45 | 80.98 |
| FDSNet | 77.69 | 75.98 | 23.46 | 83.45 | 80.76 | 70.12 |
| CSEPNet | 85.91 | 84.01 | 17.56 | 87.67 | 84.98 | 81.54 |
| EMINet | 70.08 | 71.34 | 10.34 | 78.98 | 91.54 | 83.21 |
| EDRNet | 70.43 | 72.67 | 21.23 | 80.10 | 82.12 | 85.90 |
| DACNet | 71.96 | 84.45 | 14.87 | 90.76 | 87.89 | 82.76 |
| GDALRNet | 86.65 | 78.23 | 16.54 | 84.54 | 90.34 | 84.02 |
| LPCANet | 86.63 | 84.47 | 8.44 | 91.20 | 92.11 | 86.61 |
), ArticleFig(id=1249044050515533929, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=表3, caption=
提出的模型与其他18种方法在 RSDDs-TYPE2数据集的定量比较
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | mAP | IOU | MAE |  |  |  |
|---|
| HAINet | 82.85 | 76.34 | 15.67 | 82.34 | 86.78 | 82.45 |
| CONet | 80.93 | 82.12 | 20.34 | 79.65 | 82.34 | 75.12 |
| XMSNet | 83.77 | 70.56 | 12.78 | 86.78 | 90.12 | 83.67 |
| PICRNet | 85.49 | 78.90 | 9.89 | 77.23 | 84.56 | 81.90 |
| TriTransNet | 79.91 | 81.23 | 18.45 | 84.56 | 89.78 | 84.23 |
| MoADNet | 84.98 | 74.87 | 14.23 | 90.12 | 81.90 | 80.78 |
| C2FNet | 95.30 | 79.45 | 11.56 | 88.78 | 91.23 | 85.65 |
| PGNet | 86.64 | 69.78 | 21.78 | 75.45 | 87.45 | 81.34 |
| SRNet | 84.67 | 83.65 | 8.67 | 81.90 | 85.67 | 83.78 |
| SINet | 87.10 | 73.21 | 16.90 | 85.34 | 88.32 | 82.10 |
| BBRFNet | 84.25 | 77.54 | 13.45 | 76.87 | 92.10 | 84.56 |
| RCSBNet | 82.78 | 80.12 | 19.12 | 89.23 | 83.45 | 80.98 |
| FDSNet | 77.69 | 75.98 | 23.46 | 83.45 | 80.76 | 70.12 |
| CSEPNet | 85.91 | 84.01 | 17.56 | 87.67 | 84.98 | 81.54 |
| EMINet | 70.08 | 71.34 | 10.34 | 78.98 | 91.54 | 83.21 |
| EDRNet | 70.43 | 72.67 | 21.23 | 80.10 | 82.12 | 85.90 |
| DACNet | 71.96 | 84.45 | 14.87 | 90.76 | 87.89 | 82.76 |
| GDALRNet | 86.65 | 78.23 | 16.54 | 84.54 | 90.34 | 84.02 |
| LPCANet | 86.63 | 84.47 | 8.44 | 91.20 | 92.11 | 86.61 |
), ArticleFig(id=1249044050612002928, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Tab.4, caption=
Quantitative comparisons between the proposed model on non-orbit datasets and eight other methods
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | DAGM2007 | MT | Kolektor-SDD2 |
|---|
| mAP | IOU | MAE | mAP | IOU | MAE | mAP | IOU | MAE |
|---|
| XMSNet | 89.34 | 88.90 | 8.54 | 92.76 | 91.78 | 10.23 | 86.54 | 87.65 | 9.78 |
| PICRNet | 91.23 | 81.23 | 11.65 | 87.45 | 79.56 | 8.90 | 90.12 | 82.78 | 10.45 |
| SRNet | 85.67 | 80.45 | 9.32 | 93.89 | 82.12 | 11.12 | 94.00 | 79.98 | 8.76 |
| BBRFNet | 88.78 | 82.34 | 10.89 | 90.34 | 80.67 | 8.15 | 85.98 | 81.01 | 9.54 |
| RCSBNet | 92.10 | 79.12 | 8.45 | 86.45 | 82.89 | 11.78 | 91.56 | 80.45 | 10.32 |
| FDSNet | 87.23 | 81.65 | 9.98 | 93.01 | 79.78 | 8.67 | 89.78 | 82.56 | 11.01 |
| CSEPNet | 90.56 | 80.98 | 11.45 | 85.12 | 81.34 | 9.23 | 92.34 | 82.78 | 8.29 |
| GDALRNet | 88.90 | 79.56 | 10.67 | 91.78 | 81.90 | 11.56 | 87.65 | 80.23 | 9.87 |
| LPCANet | 94.46 | 83.98 | 8.71 | 93.79 | 82.71 | 8.52 | 93.83 | 82.83 | 8.67 |
), ArticleFig(id=1249044050716860540, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=表4, caption=
非轨道数据集上提出的模型与其他8种方法之间的定量比较表
, figureFileSmall=null, figureFileBig=null, tableContent=
| 方法 | DAGM2007 | MT | Kolektor-SDD2 |
|---|
| mAP | IOU | MAE | mAP | IOU | MAE | mAP | IOU | MAE |
|---|
| XMSNet | 89.34 | 88.90 | 8.54 | 92.76 | 91.78 | 10.23 | 86.54 | 87.65 | 9.78 |
| PICRNet | 91.23 | 81.23 | 11.65 | 87.45 | 79.56 | 8.90 | 90.12 | 82.78 | 10.45 |
| SRNet | 85.67 | 80.45 | 9.32 | 93.89 | 82.12 | 11.12 | 94.00 | 79.98 | 8.76 |
| BBRFNet | 88.78 | 82.34 | 10.89 | 90.34 | 80.67 | 8.15 | 85.98 | 81.01 | 9.54 |
| RCSBNet | 92.10 | 79.12 | 8.45 | 86.45 | 82.89 | 11.78 | 91.56 | 80.45 | 10.32 |
| FDSNet | 87.23 | 81.65 | 9.98 | 93.01 | 79.78 | 8.67 | 89.78 | 82.56 | 11.01 |
| CSEPNet | 90.56 | 80.98 | 11.45 | 85.12 | 81.34 | 9.23 | 92.34 | 82.78 | 8.29 |
| GDALRNet | 88.90 | 79.56 | 10.67 | 91.78 | 81.90 | 11.56 | 87.65 | 80.23 | 9.87 |
| LPCANet | 94.46 | 83.98 | 8.71 | 93.79 | 82.71 | 8.52 | 93.83 | 82.83 | 8.67 |
), ArticleFig(id=1249044050809135234, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Tab.5, caption=
Influence of SFE at different stages
, figureFileSmall=null, figureFileBig=null, tableContent=
第1 阶段 | 第2 阶段 | 第3 阶段 | 第4 阶段 | 参数量/M | FLOPs/G | 运行速度 /(帧/s) | mAP/% | IOU/% | MAE/% | /% | /% | /% |
|---|
| √ | - | - | - | 9.69 | 2.40 | 92.38 | 94.79 | 82.16 | 8.41 | 86.62 | 91.50 | 83.54 |
| √ | √ | - | - | 9.70 | 2.41 | 83.88 | 94.15 | 82.55 | 7.48 | 87.61 | 91.63 | 83.73 |
| √ | √ | √ | - | 9.90 | 2.50 | 162.60 | 94.43 | 83.08 | 7.11 | 88.57 | 92.17 | 84.58 |
| √ | √ | √ | √ | 10.73 | 2.58 | 84.81 | 94.21 | 82.72 | 7.22 | 88.21 | 91.90 | 84.25 |
), ArticleFig(id=1249044050897215625, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=表5, caption=
不同阶段空间特征提取子的影响
, figureFileSmall=null, figureFileBig=null, tableContent=
第1 阶段 | 第2 阶段 | 第3 阶段 | 第4 阶段 | 参数量/M | FLOPs/G | 运行速度 /(帧/s) | mAP/% | IOU/% | MAE/% | /% | /% | /% |
|---|
| √ | - | - | - | 9.69 | 2.40 | 92.38 | 94.79 | 82.16 | 8.41 | 86.62 | 91.50 | 83.54 |
| √ | √ | - | - | 9.70 | 2.41 | 83.88 | 94.15 | 82.55 | 7.48 | 87.61 | 91.63 | 83.73 |
| √ | √ | √ | - | 9.90 | 2.50 | 162.60 | 94.43 | 83.08 | 7.11 | 88.57 | 92.17 | 84.58 |
| √ | √ | √ | √ | 10.73 | 2.58 | 84.81 | 94.21 | 82.72 | 7.22 | 88.21 | 91.90 | 84.25 |
), ArticleFig(id=1249044050997878928, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=EN, label=Tab.6, caption=
The impact of different backbone networks on segmentation results
, figureFileSmall=null, figureFileBig=null, tableContent=
| 模型 | 参数量/M | FLOPs/G | 运行速度/(帧/s) | mAP/% | IOU/% | MAE/% | /% | /% | /% |
|---|
| RegNet | 53.32 | 9.79 | 96.48 | 93.56 | 83.43 | 6.41 | 89.74 | 93.34 | 84.89 |
| ConvNeXt | 66.63 | 14.38 | 101.40 | 94.72 | 84.35 | 5.80 | 90.23 | 93.99 | 86.31 |
| Cswin | 45.20 | 10.17 | 29.08 | 93.85 | 84.00 | 5.74 | 90.07 | 94.15 | 85.48 |
| Swinv2 | 66.14 | 11.50 | 46.98 | 95.19 | 85.02 | 5.68 | 90.55 | 94.08 | 86.63 |
| 本文 | 9.90 | 2.50 | 162.60 | 94.43 | 83.08 | 7.11 | 88.57 | 92.17 | 84.58 |
), ArticleFig(id=1249044051085959317, tenantId=1146029695717560320, journalId=1249024232475115590, articleId=1249044017460224402, language=CN, label=表6, caption=
不同骨干网络对分割结果的影响
, figureFileSmall=null, figureFileBig=null, tableContent=
| 模型 | 参数量/M | FLOPs/G | 运行速度/(帧/s) | mAP/% | IOU/% | MAE/% | /% | /% | /% |
|---|
| RegNet | 53.32 | 9.79 | 96.48 | 93.56 | 83.43 | 6.41 | 89.74 | 93.34 | 84.89 |
| ConvNeXt | 66.63 | 14.38 | 101.40 | 94.72 | 84.35 | 5.80 | 90.23 | 93.99 | 86.31 |
| Cswin | 45.20 | 10.17 | 29.08 | 93.85 | 84.00 | 5.74 | 90.07 | 94.15 | 85.48 |
| Swinv2 | 66.14 | 11.50 | 46.98 | 95.19 | 85.02 | 5.68 | 90.55 | 94.08 | 86.63 |
| 本文 | 9.90 | 2.50 | 162.60 | 94.43 | 83.08 | 7.11 | 88.57 | 92.17 | 84.58 |
)], attaches=null, journal=Journal(id=1249023527618129992, delFlag=0, nameCn=中国图象图形学报, nameEn=Journal of Image and Graphics, nameHistory1=null, nameHistory2=null, issn=1006-8961, eissn=null, cn=11-3758, coden=CODEN ZTTXFZ, periodic=0, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=uirXtX858YS3zEpFXZttJA==, journalPrice=null, startedYear=null, abbrevIsoEn=Journal of Image and Graphics, journalRemark=null, publicationField=null, createdTime=1775720014721, updatedTime=1775720337198, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=J, firstLetterEn=J, subjectCode=Engineering, subjectName=null, subjectCodeEn=Engineering, subjectNameEn=null, picCn=uirXtX858YS3zEpFXZttJA==, picEn=bud7qaxfvWHeFsbyBTAiKQ==, jcr=null, cjcr=null, exts=[JournalExt(id=1249024880377786590, language=CN, name=中国图象图形学报, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1775720337242, updatedTime=1775720337242, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://journal.ids.fzyun.cn/auth/realms/journal/protocol/openid-connect/auth?client_id=journal-cjig-author&redirect_uri=https%3A%2F%2Fcjig.portal.founderss.cn%2Foauth%2Fcallback&response_type=code&scope=phone+openid+email+profile&state=e6369def-2842-41d8, submissionEditorUrl=https://journal.ids.fzyun.cn/auth/realms/journal/protocol/openid-connect/auth?client_id=journal-cjig-editor&redirect_uri=https%3A%2F%2Fcjigeditor.portal.founderss.cn%2Foauth%2Fcallback&response_type=code&scope=phone+openid+email+profile&state=9ccec05b-6bd, submissionReviewUrl=https://journal.ids.fzyun.cn/auth/realms/journal/protocol/openid-connect/auth?client_id=journal-cjig-author&redirect_uri=https%3A%2F%2Fcjig.portal.founderss.cn%2Foauth%2Fcallback&response_type=code&scope=phone+openid+email+profile&state=1e8a31c8-5434-4f78, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1249024880449089759, language=EN, name=Journal of Image and Graphics, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1775720337259, updatedTime=1775720337259, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://journal.ids.fzyun.cn/auth/realms/journal/protocol/openid-connect/auth?client_id=journal-cjig-author&redirect_uri=https%3A%2F%2Fcjig.portal.founderss.cn%2Foauth%2Fcallback&response_type=code&scope=phone+openid+email+profile&state=e6369def-2842-41d8, submissionEditorUrl=https://journal.ids.fzyun.cn/auth/realms/journal/protocol/openid-connect/auth?client_id=journal-cjig-editor&redirect_uri=https%3A%2F%2Fcjigeditor.portal.founderss.cn%2Foauth%2Fcallback&response_type=code&scope=phone+openid+email+profile&state=9ccec05b-6bd, submissionReviewUrl=https://journal.ids.fzyun.cn/auth/realms/journal/protocol/openid-connect/auth?client_id=journal-cjig-author&redirect_uri=https%3A%2F%2Fcjig.portal.founderss.cn%2Foauth%2Fcallback&response_type=code&scope=phone+openid+email+profile&state=1e8a31c8-5434-4f78, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1249024232475115590, websiteList=[Website(id=1249025782459334881, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1249024232475115590, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/zgtxtxxb/CN, language=CN, createTime=1775720552315, createBy=18614031015, updateTime=1775720586268, updateBy=18614031015, name=中国图象图形学报-中文, tplId=1146099689490845704, title=中国图象图形学报, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1249026166254928133, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=articleTextType, value=kx, createTime=1775720643819, updateTime=1775720643819, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166221373698, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=banner, value=null, createTime=1775720643811, updateTime=1775720643811, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166271705352, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=grayFlag, value=0, createTime=1775720643823, updateTime=1775720643823, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166212985089, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=logo, value=https://castjournals.cast.org.cn/joweb/zgtxtxxb/CN/file/pic?fileId=TDRjKTHfgAnvFKZaDA70wA==, createTime=1775720643809, updateTime=1775720643809, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166288482570, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=minRunFlag, value=0, createTime=1775720643827, updateTime=1775720643827, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166246539524, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/zgtxtxxb/CN/file/pic, createTime=1775720643817, updateTime=1775720643817, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166280093961, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=silenceFlag, value=0, createTime=1775720643825, updateTime=1775720643825, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166233956611, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1775720643814, updateTime=1775720643814, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166259122438, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=themeColor, value=null, createTime=1775720643820, updateTime=1775720643820, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026166267511047, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782459334881, code=themeStyle, value=null, createTime=1775720643822, updateTime=1775720643822, creator=18614031015, updator=18614031015)]), Website(id=1249025782681633001, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1249024232475115590, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/zgtxtxxb/EN, language=EN, createTime=1775720552368, createBy=18614031015, updateTime=1775720607118, updateBy=18614031015, name=中国图象图形学报-英文, tplId=1146101810881728533, title=Journal of Image and Graphics, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1249026195371786511, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=articleTextType, value=kx, createTime=1775720650761, updateTime=1775720650761, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195355009292, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=banner, value=null, createTime=1775720650757, updateTime=1775720650757, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195392758034, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=grayFlag, value=0, createTime=1775720650766, updateTime=1775720650766, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195342426379, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=logo, value=https://castjournals.cast.org.cn/joweb/zgtxtxxb/EN/file/pic?fileId=TDRjKTHfgAnvFKZaDA70wA==, createTime=1775720650754, updateTime=1775720650754, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195409535252, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=minRunFlag, value=0, createTime=1775720650770, updateTime=1775720650770, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195367592206, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/zgtxtxxb/EN/file/pic, createTime=1775720650760, updateTime=1775720650760, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195401146643, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=silenceFlag, value=0, createTime=1775720650768, updateTime=1775720650768, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195359203597, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1775720650758, updateTime=1775720650758, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195380175120, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=themeColor, value=null, createTime=1775720650763, updateTime=1775720650763, creator=18614031015, updator=18614031015), WebsiteProps(id=1249026195388563729, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1249025782681633001, code=themeStyle, value=null, createTime=1775720650765, updateTime=1775720650765, creator=18614031015, updator=18614031015)])], journalTitle=中国图象图形学报, weixinUrl=null, journalUrl=https://www.cjig.cn/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Journal of Image and Graphics, journalPhotoCn=uirXtX858YS3zEpFXZttJA==, journalPhotoEn=bud7qaxfvWHeFsbyBTAiKQ==, journalFirstLetter=J, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/zgtxtxxb/CN/10.11834/jig.240547, detailUrlEn=https://castjournals.cast.org.cn/joweb/zgtxtxxb/EN/10.11834/jig.240547, pdfUrlCn=https://castjournals.cast.org.cn/joweb/zgtxtxxb/CN/PDF/10.11834/jig.240547, pdfUrlEn=https://castjournals.cast.org.cn/joweb/zgtxtxxb/EN/PDF/10.11834/jig.240547, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)