Article(id=1254010454876020880, tenantId=1146029695717560320, journalId=1251234646239789153, issueId=1254010452460106357, articleNumber=null, orderNo=null, doi=10.12399/j.issn.2097-163x.2025.05.003, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1751817600000, receivedDateStr=2025-07-07, revisedDate=1753286400000, revisedDateStr=2025-07-24, acceptedDate=null, acceptedDateStr=null, onlineDate=1776908990829, onlineDateStr=2026-04-23, pubDate=null, pubDateStr=null, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1776908990829, onlineIssueDateStr=2026-04-23, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1776908990829, creator=13041195026, updateTime=1776908990829, updator=13041195026, issue=Issue{id=1254010452460106357, tenantId=1146029695717560320, journalId=1251234646239789153, year='2025', volume='4', issue='5', pageStart='1', pageEnd='96', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=1, specialIssue=null, createTime=1776908990253, creator=13041195026, updateTime=1777355431505, updator=13041195026, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1255882962894242489, tenantId=1146029695717560320, journalId=1251234646239789153, issueId=1254010452460106357, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1255882962894242490, tenantId=1146029695717560320, journalId=1251234646239789153, issueId=1254010452460106357, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=42, endPage=53, ext={EN=ArticleExt(id=1254010455450640530, articleId=1254010454876020880, tenantId=1146029695717560320, journalId=1251234646239789153, language=EN, title=A node device-type identification method based on large language models and retrieval-augmented generation, columnId=1254010453361881720, journalTitle=Information Countermeasure Technology, columnName=Research Articles, runingTitle=null, highlight=null, articleAbstract=

In large-scale cyberspace mapping,rapidly and accurately detecting node information and identifying the operational status of devices is one of the core research contents. Currently,the version iteration speed of cyberspace devices is accelerating,and a large number of new-type devices are constantly emerging. How to track and identify the device type of the measured node has become a new challenge that needs to be solved urgently. Aiming at the problem that current research relies too much on existing knowledge and cannot adapt to device upgrade changes,a node device type identification method based on large language model(LLM)and retrieval-augmented generation(RAG)technology was proposed. First,relevant data were collected from RFC documents and Internet device manufacturer websites,and a knowledge vector database was constructed based on the embedding model.Then,the detected node feature information was encoded,and relevant background knowledge was retrieved from the vector database. The retrieved knowledge and node feature information were jointly constructed into prompts for the LLM. The reasoning ability of the LLM was used to identify the device type of the probed node. Finally,the effectiveness and performance of the method were verified through ablation experiments and real-network tests.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Guozheng YANG, Chiyu CHEN, Zhaobin SHEN, Dongzhen QI, Junyu PAN), CN=ArticleExt(id=1254010462627094747, articleId=1254010454876020880, tenantId=1146029695717560320, journalId=1251234646239789153, language=CN, title=一种基于大语言模型及RAG技术的节点设备类型识别方法, columnId=1254010453533848188, journalTitle=信息对抗技术, columnName=研究论文, runingTitle=null, highlight=null, articleAbstract=

在大规模网络空间测绘中,快速准确探测节点信息并识别设备运行状态,是核心研究内容之一。当前,网络空间设备版本迭代速度加快,大量新型设备不断涌现,如何跟踪并识别被测节点的设备类型,成为亟待解决的新挑战。针对当前研究过于依赖已有知识,无法适应设备升级变化的问题,提出了一种基于大语言模型(large language model,LLM)和检索增强生成(retrieval-augmented generation,RAG)技术的节点设备类型识别方法。首先,从RFC文档和互联网设备厂商站点收集相关资料,基于嵌入模型构建知识向量数据库;然后,对探测得到的节点特征信息进行编码,从向量数据库中检索相关背景知识,将其与节点特征信息共同构造为提示词并输入LLM,利用其推理能力实现对被测节点的设备类型识别;最后,通过消融实验和实网测试,验证了该方法的有效性和性能。

, correspAuthors=null, authorNote=null, correspAuthorsNote=
通信作者:陈驰昱,E-mail:
, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=2MtriDUXjq2HfSVrnyqtFQ==, magXml=uOiwSJnWFaKulITAXEIVEw==, pdfUrl=null, pdf=6UdvJZoGHtaHFNIxZH4olA==, pdfFileSize=3842709, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=UDvtmxIEiLaggTo43NLB7w==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=xml+n6iptA5vz507OmgpuQ==, mapNumber=null, authorCompany=null, fund=null, authors=

杨国正 男,1982年生,博士,教授,研究方向为网络空间测绘、网络安全态势感知 E-mail:

陈驰昱 男,1996年生,硕士研究生,研究方向为网络安全态势感知 E-mail:

沈照斌 男,2001年生,硕士研究生,研究方向为网络空间测绘 E-mail:

齐冬震 男,2000年生,硕士研究生,研究方向为路由安全 E-mail:

潘俊宇 男,2003年生,硕士研究生,研究方向为网络空间测绘 E-mail:

, authorsList=杨国正, 陈驰昱, 沈照斌, 齐冬震, 潘俊宇)}, authors=[Author(id=1254010463176548589, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=yangguoz0218@163.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1254010463310766324, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010463176548589, language=EN, stringName=Guozheng YANG, firstName=Guozheng, middleName=null, lastName=YANG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China
2Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation, Hefei 230037, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1254010463440789752, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010463176548589, language=CN, stringName=杨国正, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1国防科技大学电子对抗学院,安徽 合肥 230037
2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037, bio={"img":"qu2sp2oZYVX+IlhP5d2c9Q==","content":"

杨国正 男,1982年生,博士,教授,研究方向为网络空间测绘、网络安全态势感知 E-mail:

"}, bioImg=qu2sp2oZYVX+IlhP5d2c9Q==, bioContent=

杨国正 男,1982年生,博士,教授,研究方向为网络空间测绘、网络安全态势感知 E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1254010462895530206, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=1, ext=[AuthorCompanyExt(id=1254010462908113119, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China), AuthorCompanyExt(id=1254010462916501728, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1国防科技大学电子对抗学院,安徽 合肥 230037)]), AuthorCompany(id=1254010463042330852, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=2, ext=[AuthorCompanyExt(id=1254010463075885286, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010463042330852, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation, Hefei 230037, China), AuthorCompanyExt(id=1254010463084273895, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010463042330852, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037)])]), Author(id=1254010463621144828, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=chenchiyu14@nudt.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1254010464065741057, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010463621144828, language=EN, stringName=Chiyu CHEN, firstName=Chiyu, middleName=null, lastName=CHEN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1254010465802182915, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010463621144828, language=CN, stringName=陈驰昱, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1国防科技大学电子对抗学院,安徽 合肥 230037, bio={"img":"UXanMbzf+4wL5HppYP+z0w==","content":"

陈驰昱 男,1996年生,硕士研究生,研究方向为网络安全态势感知 E-mail:

"}, bioImg=UXanMbzf+4wL5HppYP+z0w==, bioContent=

陈驰昱 男,1996年生,硕士研究生,研究方向为网络安全态势感知 E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1254010462895530206, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=1, ext=[AuthorCompanyExt(id=1254010462908113119, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China), AuthorCompanyExt(id=1254010462916501728, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1国防科技大学电子对抗学院,安徽 合肥 230037)])]), Author(id=1254010465915429127, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=zhaobin19@nudt.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1254010466070618382, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010465915429127, language=EN, stringName=Zhaobin SHEN, firstName=Zhaobin, middleName=null, lastName=SHEN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China
2Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation, Hefei 230037, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1254010466179670286, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010465915429127, language=CN, stringName=沈照斌, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1国防科技大学电子对抗学院,安徽 合肥 230037
2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037, bio={"img":"7i/V2CNAIeU9GdcQ32Gsdg==","content":"

沈照斌 男,2001年生,硕士研究生,研究方向为网络空间测绘 E-mail:

"}, bioImg=7i/V2CNAIeU9GdcQ32Gsdg==, bioContent=

沈照斌 男,2001年生,硕士研究生,研究方向为网络空间测绘 E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1254010462895530206, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=1, ext=[AuthorCompanyExt(id=1254010462908113119, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China), AuthorCompanyExt(id=1254010462916501728, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1国防科技大学电子对抗学院,安徽 合肥 230037)]), AuthorCompany(id=1254010463042330852, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=2, ext=[AuthorCompanyExt(id=1254010463075885286, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010463042330852, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation, Hefei 230037, China), AuthorCompanyExt(id=1254010463084273895, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010463042330852, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037)])]), Author(id=1254010466339053842, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=qidongzhen@nudt.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1254010466481660184, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010466339053842, language=EN, stringName=Dongzhen QI, firstName=Dongzhen, middleName=null, lastName=QI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1254010466666209562, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010466339053842, language=CN, stringName=齐冬震, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1国防科技大学电子对抗学院,安徽 合肥 230037, bio={"img":"X7LVMXb1KiUL5PP4midDXw==","content":"

齐冬震 男,2000年生,硕士研究生,研究方向为路由安全 E-mail:

"}, bioImg=X7LVMXb1KiUL5PP4midDXw==, bioContent=

齐冬震 男,2000年生,硕士研究生,研究方向为路由安全 E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1254010462895530206, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=1, ext=[AuthorCompanyExt(id=1254010462908113119, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China), AuthorCompanyExt(id=1254010462916501728, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1国防科技大学电子对抗学院,安徽 合肥 230037)])]), Author(id=1254010466867536158, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=1599561241@qq.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1254010466984976674, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010466867536158, language=EN, stringName=Junyu PAN, firstName=Junyu, middleName=null, lastName=PAN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1254010467060474148, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, authorId=1254010466867536158, language=CN, stringName=潘俊宇, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1国防科技大学电子对抗学院,安徽 合肥 230037, bio={"img":"/chNA3wsggdrMKH3wqajxQ==","content":"

潘俊宇 男,2003年生,硕士研究生,研究方向为网络空间测绘 E-mail:

"}, bioImg=/chNA3wsggdrMKH3wqajxQ==, bioContent=

潘俊宇 男,2003年生,硕士研究生,研究方向为网络空间测绘 E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1254010462895530206, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=1, ext=[AuthorCompanyExt(id=1254010462908113119, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China), AuthorCompanyExt(id=1254010462916501728, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1国防科技大学电子对抗学院,安徽 合肥 230037)])])], keywords=[Keyword(id=1254010467173720360, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, orderNo=1, keyword=cyberspace mapping), Keyword(id=1254010467274383658, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, orderNo=2, keyword=LLM), Keyword(id=1254010467383435564, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, orderNo=3, keyword=RAG), Keyword(id=1254010467458933041, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, orderNo=4, keyword=device type identification), Keyword(id=1254010467580567858, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, orderNo=1, keyword=网络测量), Keyword(id=1254010467794477364, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, orderNo=2, keyword=LLM), Keyword(id=1254010467920306486, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, orderNo=3, keyword=RAG), Keyword(id=1254010468029358392, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, orderNo=4, keyword=设备类型识别)], refs=[Reference(id=1254010472332714341, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2016, volume=18, issue=1, pageStart=94, pageEnd=104, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=XU Q, ZHENG R, SAAD W, journalName=IEEE Communications Surveys & Tutorials, refType=null, unstructuredReference=XU Q,ZHENG R,SAAD W,et al.Device fingerprinting in wireless networks: challenges and opportunities[J]. IEEE Communications Surveys & Tutorials, 2016, 18(1): 94-104., articleTitle=Device fingerprinting in wireless networks: challenges and opportunities, refAbstract=null), Reference(id=1254010472466932071, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2023, volume=10, issue=7, pageStart=6388, pageEnd=6399, url=null, language=null, rfNumber=[2], rfOrder=1, authorNames=WAN S F, LI Q, WANG H, journalName=IEEE Internet of Things Journal, refType=null, unstructuredReference=WAN S F, LI Q, WANG H, et al. DevTag: a benchmark for fingerprinting IoT devices[J]. IEEE Internet of Things Journal, 2023, 10(7): 6388-6399., articleTitle=DevTag: a benchmark for fingerprinting IoT devices, refAbstract=null), Reference(id=1254010472622121321, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2025-07-24, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[3], rfOrder=2, authorNames=null, journalName=null, refType=null, unstructuredReference=Recog: a recognition framework[EB/OL].[2025-07-24].https://github.com/rapid7/recog/#recog-a%20-recognition-framework., articleTitle=Recog: a recognition framework, refAbstract=null), Reference(id=1254010472743756139, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=1, pageEnd=2, url=null, language=null, rfNumber=[4], rfOrder=3, authorNames=LI Q, FENG X, WANG H, journalName=null, refType=null, unstructuredReference=LI Q, FENG X, WANG H, et al. Discovery of Internet of Thing devices based on rules[C]//Proceedings of 2018 IEEE Conference on Computer Communications Poster and Demo.[S.l.]: IEEE, 2018: 1-2., articleTitle=Discovery of Internet of Thing devices based on rules, refAbstract=null), Reference(id=1254010472852808045, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2018, volume=null, issue=null, pageStart=327, pageEnd=341, url=null, language=null, rfNumber=[5], rfOrder=4, authorNames=FENG X, LI Q, WANG H, journalName=null, refType=null, unstructuredReference=FENG X, LI Q, WANG H, et al. Acquisitional rule-based engine for discovering Internet-of-Things devices[C]//Proceedings of the 27th USENIX Security Symposium.[S.l.:s.n.], 2018: 327-341., articleTitle=Acquisitional rule-based engine for discovering Internet-of-Things devices, refAbstract=null), Reference(id=1254010472945082735, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2020, volume=50, issue=3, pageStart=23, pageEnd=29, url=null, language=null, rfNumber=[6], rfOrder=5, authorNames=JAVED T, HASEEB M, ABDULLAH M, journalName=ACM SIGCOMM Computer Communication Review, refType=null, unstructuredReference=JAVED T, HASEEB M, ABDULLAH M, et al. Using application layer banner data to automatically identify IoT devices[J]. ACM SIGCOMM Computer Communication Review, 2020, 50(3): 23-29., articleTitle=Using application layer banner data to automatically identify IoT devices, refAbstract=null), Reference(id=1254010473033163121, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2021, volume=null, issue=null, pageStart=405, pageEnd=410, url=null, language=null, rfNumber=[7], rfOrder=6, authorNames=CHENG H, DONG W Y, ZHENG Y, journalName=null, refType=null, unstructuredReference=CHENG H, DONG W Y, ZHENG Y, et al. Identify IoT devices through web interface characteristics[C]//Proceedings of 2021 IEEE International Conference on Computer and Communication Systems.[S.l.]:IEEE, 2021: 405-410., articleTitle=Identify IoT devices through web interface characteristics, refAbstract=null), Reference(id=1254010473104466291, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=1, pageEnd=7, url=null, language=null, rfNumber=[8], rfOrder=7, authorNames=WU Y C, LI C L, YANG J H, journalName=null, refType=null, unstructuredReference=WU Y C, LI C L, YANG J H, et al. WebIoT:classifying Internet of Things devices at Internet scale through web characteristics[C]//Proceedings of 2022 IEEE Symposium on Computers and Communications.[S.l.]: IEEE, 2022: 1-7., articleTitle=WebIoT:classifying Internet of Things devices at Internet scale through web characteristics, refAbstract=null), Reference(id=1254010474719273333, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=478, pageEnd=484, url=null, language=null, rfNumber=[9], rfOrder=8, authorNames=SARABI A, YIN T X, LIU M Y, journalName=null, refType=null, unstructuredReference=SARABI A, YIN T X, LIU M Y. An LLM-based framework for fingerprinting Internet-connected devices[C]//Proceedings of 2023 ACM on Internet Measurement Conference. New York: ACM, 2023: 478-484., articleTitle=An LLM-based framework for fingerprinting Internet-connected devices, refAbstract=null), Reference(id=1254010474840908150, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=1, pageEnd=8, url=null, language=null, rfNumber=[10], rfOrder=9, authorNames=UEDA T, SASAKI T, YOSHIOKA K, journalName=null, refType=null, unstructuredReference=UEDA T, SASAKI T, YOSHIOKA K, et al. An Internet-wide view of connected cars: discovery of exposed automotive devices[C]//Proceedings of the 17th International Conference on Availability,Reliability and Security. New York: ACM, 2022: 1-8., articleTitle=An Internet-wide view of connected cars: discovery of exposed automotive devices, refAbstract=null), Reference(id=1254010474945765751, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2022, volume=null, issue=null, pageStart=2379, pageEnd=2396, url=null, language=null, rfNumber=[11], rfOrder=10, authorNames=SASAKI T, FUJITA A, GAÑÁN C H, journalName=null, refType=null, unstructuredReference=SASAKI T, FUJITA A, GAÑÁN C H, et al. Exposed infrastructures: discovery, attacks and remediation of insecure ICS remote management devices[C]//Proceedings of 2022 IEEE Symposium on Security and Privacy.[S.l.]: IEEE, 2022: 2379-2396., articleTitle=Exposed infrastructures: discovery, attacks and remediation of insecure ICS remote management devices, refAbstract=null), Reference(id=1254010475038040441, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2023, volume=null, issue=null, pageStart=1, pageEnd=18, url=null, language=null, rfNumber=[12], rfOrder=11, authorNames=NASR T, TORABI S, BOU-HARBE, journalName=null, refType=null, unstructuredReference=NASR T,TORABI S,BOU-HARBE,et al.ChargePrint:a framework for Internet-scale discovery and security analysis of EV charging management systems[C]//Proceedings of the 30th Annual Network and Distributed System Security Symposium.[S.l.:s.n.], 2023:1-18., articleTitle=ChargePrint:a framework for Internet-scale discovery and security analysis of EV charging management systems, refAbstract=null), Reference(id=1254010475117732218, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=1, pageEnd=6, url=null, language=null, rfNumber=[13], rfOrder=12, authorNames=CHEN C Y, LU Y L, YANG G Z, journalName=null, refType=null, unstructuredReference=CHEN C Y, LU Y L, YANG G Z, et al. ZBanner:fast stateless scanning capable of obtaining responses over TCP[C]//Proceedings of 2024 IEEE International Performance,Computing,and Communications Conference.[S.l.]: IEEE, 2024: 1-6., articleTitle=ZBanner:fast stateless scanning capable of obtaining responses over TCP, refAbstract=null), Reference(id=1254010475189035387, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2025-07-24, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[14], rfOrder=13, authorNames=LEE S, SHAKIR A, KOENIG D, journalName=null, refType=null, unstructuredReference=LEE S, SHAKIR A, KOENIG D, et al. Open source strikes bread-new fluffy embeddings model[EB/OL]. (2024-03-08)[2025-07-24]. https://www.mixedbread.ai/blog/mxbai-embed-large-v1., articleTitle=Open source strikes bread-new fluffy embeddings model, refAbstract=null), Reference(id=1254010475272921468, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, doi=null, pmid=null, pmcid=null, year=2024, volume=null, issue=null, pageStart=1825, pageEnd=1839, url=null, language=null, rfNumber=[15], rfOrder=14, authorNames=LI X M, LI J, journalName=null, refType=null, unstructuredReference=LI X M, LI J. AoE: angle-optimized embeddings for semantic textual similarity[C]//Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok: Association for Computational Linguistics, 2024: 1825-1839., articleTitle=AoE: angle-optimized embeddings for semantic textual similarity, refAbstract=null)], funds=null, companyList=[AuthorCompany(id=1254010462895530206, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=1, ext=[AuthorCompanyExt(id=1254010462908113119, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China), AuthorCompanyExt(id=1254010462916501728, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010462895530206, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1国防科技大学电子对抗学院,安徽 合肥 230037)]), AuthorCompany(id=1254010463042330852, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, xref=2, ext=[AuthorCompanyExt(id=1254010463075885286, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010463042330852, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation, Hefei 230037, China), AuthorCompanyExt(id=1254010463084273895, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, companyId=1254010463042330852, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037)])], figs=[ArticleFig(id=1254010468247462203, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.1, caption=Scheme architecture of node device type identification, figureFileSmall=4mXQqwEYuCM69YK1plpZow==, figureFileBig=UDvtmxIEiLaggTo43NLB7w==, tableContent=null), ArticleFig(id=1254010468364902716, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图1, caption=节点设备类型识别方案总体架构, figureFileSmall=4mXQqwEYuCM69YK1plpZow==, figureFileBig=UDvtmxIEiLaggTo43NLB7w==, tableContent=null), ArticleFig(id=1254010468612366655, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.2, caption=Flowchart of node information feature extraction, figureFileSmall=yC7bVzYxaFWRSOGzTQFQYw==, figureFileBig=BScMvOCYnXmTnyNdcnDRYQ==, tableContent=null), ArticleFig(id=1254010468784333121, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图2, caption=节点探测信息特征提取工作流程图, figureFileSmall=yC7bVzYxaFWRSOGzTQFQYw==, figureFileBig=BScMvOCYnXmTnyNdcnDRYQ==, tableContent=null), ArticleFig(id=1254010470281699651, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.3, caption=Vector database construction and background knowledge retrieval, figureFileSmall=Y9cTOzXWWKCmQIvRJLLjxA==, figureFileBig=gVtruRNfbHicXWvjxEDIEQ==, tableContent=null), ArticleFig(id=1254010470382362948, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图3, caption=向量数据库构建与背景知识检索, figureFileSmall=Y9cTOzXWWKCmQIvRJLLjxA==, figureFileBig=gVtruRNfbHicXWvjxEDIEQ==, tableContent=null), ArticleFig(id=1254010470499803462, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.4, caption=Design of prompt words in the inference stage of LLM, figureFileSmall=nmYb+Z22dr3DNRum8Wgegg==, figureFileBig=nC6a5S17iOd4FkwanH+X8A==, tableContent=null), ArticleFig(id=1254010470587883848, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图4, caption=LLM推理阶段提示词设计, figureFileSmall=nmYb+Z22dr3DNRum8Wgegg==, figureFileBig=nC6a5S17iOd4FkwanH+X8A==, tableContent=null), ArticleFig(id=1254010470684352841, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.5, caption=Normalized heatmap of confusion matrix for LingoVision experimental results, figureFileSmall=51saNfX2XxjRiKXrjUy81w==, figureFileBig=SamTT7rz4+PyUE7y1lwrJA==, tableContent=null), ArticleFig(id=1254010470835347787, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图5, caption=LingoVision实验结果的混淆矩阵归一化热力图, figureFileSmall=51saNfX2XxjRiKXrjUy81w==, figureFileBig=SamTT7rz4+PyUE7y1lwrJA==, tableContent=null), ArticleFig(id=1254010470927622476, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.6, caption=Classification score bar chart of LingoVision experimental results, figureFileSmall=ucrA15nXuQ4gotGmhf4DVQ==, figureFileBig=bE8YM7kj/CEc6f4b1EUSyA==, tableContent=null), ArticleFig(id=1254010471015702862, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图6, caption=LingoVision实验结果的分类分数柱状图, figureFileSmall=ucrA15nXuQ4gotGmhf4DVQ==, figureFileBig=bE8YM7kj/CEc6f4b1EUSyA==, tableContent=null), ArticleFig(id=1254010471091200336, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.7, caption=Normalized heatmap of confusion matrix in ablation experiment, figureFileSmall=viiIHcKtFn+9Kjf7xq63Cg==, figureFileBig=0n7ZO2sIlkqigRYRGWvT6A==, tableContent=null), ArticleFig(id=1254010471170892114, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图7, caption=消融实验混淆矩阵归一化热力图, figureFileSmall=viiIHcKtFn+9Kjf7xq63Cg==, figureFileBig=0n7ZO2sIlkqigRYRGWvT6A==, tableContent=null), ArticleFig(id=1254010471267361109, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Fig.8, caption=Bar chart of ablation experiment classification scores, figureFileSmall=QONtwQ5f35Lu1OKUPrV8EQ==, figureFileBig=+WTh3ZO4xlCtq21SIFpNHA==, tableContent=null), ArticleFig(id=1254010471355441495, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=图8, caption=消融实验分类分数柱状图, figureFileSmall=QONtwQ5f35Lu1OKUPrV8EQ==, figureFileBig=+WTh3ZO4xlCtq21SIFpNHA==, tableContent=null), ArticleFig(id=1254010471439327575, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Tab.1, caption=

Examples of node device type identification

, figureFileSmall=null, figureFileBig=null, tableContent=
示例关键特征的类型特征内容重点部分背景知识判别设备类型
1HTTP协议响应横幅WWW-Authenticate:Basic realm=”TP-LINK Wireless N 3G/4G Router MR3420”无需无线路由器
2FTP协议响应横幅220 ET0021B78D865C Dell B2360dn Laser Printer FTP Server NH6.CY.N632 ready无需打印机
3HTTP协议响应横幅Server:cisco-IOS思科公司是全球领先的网络解决方案供应商,主要产品包括路由器、交换机等网络设备路由器
4TLS证书信息证书发行者组织名:WatchGuardWatchGuard公司是全球排名前列的专业防火墙产品制造商之一,主要业务涉及众多安全产品,包括防火墙与网关设备等防火墙设备
5端口协议该开放端口运行IPCAM协议IPCAM是一种用于摄像头在网络中传输数字视频流的通信协议网络摄像头
6HTTPS协议响应横幅SESSIONID=…Secoway USG2110Secoway USG是华为旗下的防火墙终端产品系列防火墙设备
), ArticleFig(id=1254010471560962392, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=表1, caption=

节点设备类型识别相关示例

, figureFileSmall=null, figureFileBig=null, tableContent=
示例关键特征的类型特征内容重点部分背景知识判别设备类型
1HTTP协议响应横幅WWW-Authenticate:Basic realm=”TP-LINK Wireless N 3G/4G Router MR3420”无需无线路由器
2FTP协议响应横幅220 ET0021B78D865C Dell B2360dn Laser Printer FTP Server NH6.CY.N632 ready无需打印机
3HTTP协议响应横幅Server:cisco-IOS思科公司是全球领先的网络解决方案供应商,主要产品包括路由器、交换机等网络设备路由器
4TLS证书信息证书发行者组织名:WatchGuardWatchGuard公司是全球排名前列的专业防火墙产品制造商之一,主要业务涉及众多安全产品,包括防火墙与网关设备等防火墙设备
5端口协议该开放端口运行IPCAM协议IPCAM是一种用于摄像头在网络中传输数字视频流的通信协议网络摄像头
6HTTPS协议响应横幅SESSIONID=…Secoway USG2110Secoway USG是华为旗下的防火墙终端产品系列防火墙设备
), ArticleFig(id=1254010471661625690, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Tab.2, caption=

Mean statistics of LingoVision experimental results

, figureFileSmall=null, figureFileBig=null, tableContent=
精确率/%召回率/%F1值
宏平均值81760.74
加权平均值84750.76
), ArticleFig(id=1254010471787454811, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=表2, caption=

LingoVision实验结果均值统计

, figureFileSmall=null, figureFileBig=null, tableContent=
精确率/%召回率/%F1值
宏平均值81760.74
加权平均值84750.76
), ArticleFig(id=1254010471862952284, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Tab.3, caption=

Identification and statistical results for unknown nodes

, figureFileSmall=null, figureFileBig=null, tableContent=
识别出的设备类型独立设备种类数量识别出的独立设备数量同特征网络节点数量/个
所属厂商数具体型号数
路由器98916406
网络摄像头5447782
VoIP网关11147883
打印机322138
负载均衡器110460
网络存储设备4439480
无线接入点33314835
VPN设备2221603
防火墙3224690
总计312726103277
), ArticleFig(id=1254010471930061150, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=表3, caption=

针对未知节点的识别统计结果

, figureFileSmall=null, figureFileBig=null, tableContent=
识别出的设备类型独立设备种类数量识别出的独立设备数量同特征网络节点数量/个
所属厂商数具体型号数
路由器98916406
网络摄像头5447782
VoIP网关11147883
打印机322138
负载均衡器110460
网络存储设备4439480
无线接入点33314835
VPN设备2221603
防火墙3224690
总计312726103277
), ArticleFig(id=1254010472005558624, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=EN, label=Tab.4, caption=

Examples of identification and manual verification for unknown nodes

, figureFileSmall=null, figureFileBig=null, tableContent=
示例关键特征的类型特征内容重点部分系统推断及人工核验结果
设备类型所属厂商具体型号
1SNMP协议横幅响应… Cisco IOS Software,C800 Software (C800-UNIVERSALK9-M),Version 15.5 (3)M5,RELEASE SOFTWARE (fc1) Technical Support…路由器CiscoC800 15.5(3)M5
2RTSP协议横幅响应RTSP/1.0 200 OK CSeq:1 Server:IP Network Camera RTSP Server Public: DESCRIBE,SETUP,PLAY,TEARDOWN,GET PARAMETER,SET PARAMETER网络摄像头未知未知
3未知协议横幅响应$DEVINFO,000001,13.5,WLAN,ADHOC,M1-WiFi,6,78,MO-DEM,movistar,4,HSDPA,GPS,ON,30000,1,PORT,1,SERIAL,115200,ON,PORT,2,SERIAL,115200,ON,SERVICE,USB,0,30000,1,38450141无线接入点MovistarM1-WIFI
4POP3协议横幅响应+OK-ERR FortiGate firewall user authentication is needed.防火墙FortiGate未知
5HTTP协议横幅响应HTTP/1.1200 OK Server:ZhiDa NAS…connect-src data:
ws:wss:;default-src’self’’unsafe-eval’data:
blob:
https://*.synology.com
https://www.synology.cn/
https://help.synology.cn/
网络存储设备SynologyZhiDa NAS
), ArticleFig(id=1254010472127193442, tenantId=1146029695717560320, journalId=1251234646239789153, articleId=1254010454876020880, language=CN, label=表4, caption=

针对未知节点的识别及人工核验示例

, figureFileSmall=null, figureFileBig=null, tableContent=
示例关键特征的类型特征内容重点部分系统推断及人工核验结果
设备类型所属厂商具体型号
1SNMP协议横幅响应… Cisco IOS Software,C800 Software (C800-UNIVERSALK9-M),Version 15.5 (3)M5,RELEASE SOFTWARE (fc1) Technical Support…路由器CiscoC800 15.5(3)M5
2RTSP协议横幅响应RTSP/1.0 200 OK CSeq:1 Server:IP Network Camera RTSP Server Public: DESCRIBE,SETUP,PLAY,TEARDOWN,GET PARAMETER,SET PARAMETER网络摄像头未知未知
3未知协议横幅响应$DEVINFO,000001,13.5,WLAN,ADHOC,M1-WiFi,6,78,MO-DEM,movistar,4,HSDPA,GPS,ON,30000,1,PORT,1,SERIAL,115200,ON,PORT,2,SERIAL,115200,ON,SERVICE,USB,0,30000,1,38450141无线接入点MovistarM1-WIFI
4POP3协议横幅响应+OK-ERR FortiGate firewall user authentication is needed.防火墙FortiGate未知
5HTTP协议横幅响应HTTP/1.1200 OK Server:ZhiDa NAS…connect-src data:
ws:wss:;default-src’self’’unsafe-eval’data:
blob:
https://*.synology.com
https://www.synology.cn/
https://help.synology.cn/
网络存储设备SynologyZhiDa NAS
)], attaches=null, journal=Journal(id=1251231495319236709, delFlag=0, nameCn=信息对抗技术, nameEn=Information Countermeasure Technology, nameHistory1=null, nameHistory2=null, issn=2097-163X, eissn=, cn=34-1340/E, coden=null, periodic=1, language=CN, oaType=1, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=, officePhone=, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=28NAsfRnEEIAv0GvsYImxg==, journalPrice=null, startedYear=null, abbrevIsoEn=Information Countermeasure Technology, journalRemark=null, publicationField=null, createdTime=1776246435243, updatedTime=1776397616785, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=I, firstLetterEn=I, subjectCode=Engineering, subjectName=工程, subjectCodeEn=Engineering, subjectNameEn=null, picCn=28NAsfRnEEIAv0GvsYImxg==, picEn=W2zeGuQU+j8zRKRf0eBlGA==, jcr=null, cjcr=null, exts=[JournalExt(id=1251865596783182768, language=CN, name=信息对抗技术, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1776397616811, updatedTime=1776397616811, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=http://xxdkjs.ijournals.cn/xxdk/author/login, submissionEditorUrl=http://xxdkjs.ijournals.cn/xxdk/editor/login, submissionReviewUrl=http://xxdkjs.ijournals.cn/xxdk/reviewer/login, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1251865596854485937, language=EN, name=Information Countermeasure Technology, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1776397616828, updatedTime=1776397616828, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=http://xxdkjs.ijournals.cn/xxdk/author/login, submissionEditorUrl=http://xxdkjs.ijournals.cn/xxdk/editor/login, submissionReviewUrl=http://xxdkjs.ijournals.cn/xxdk/reviewer/login, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1251234646239789153, websiteList=[Website(id=1251257283527786546, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1251234646239789153, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/xxdkjs/CN, language=CN, createTime=1776252583629, createBy=18614031015, updateTime=1776253877246, updateBy=18614031015, name=信息对抗技术-中文, tplId=1146099689490845704, title=信息对抗技术, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1251262819858068136, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=articleTextType, value=kx, createTime=1776253903593, updateTime=1776253903593, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819841290917, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=banner, value=null, createTime=1776253903589, updateTime=1776253903589, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819879039659, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=grayFlag, value=0, createTime=1776253903598, updateTime=1776253903598, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819837096612, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=logo, value=https://castjournals.cast.org.cn/joweb/xxdkjs/CN/file/pic?fileId=ejr3qWSnlyLKPBFIFKSmqw==, createTime=1776253903588, updateTime=1776253903588, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819891622573, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=minRunFlag, value=0, createTime=1776253903601, updateTime=1776253903601, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819853873831, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/xxdkjs/CN/file/pic, createTime=1776253903592, updateTime=1776253903592, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819887428268, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=silenceFlag, value=0, createTime=1776253903600, updateTime=1776253903600, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819845485222, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1776253903590, updateTime=1776253903590, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819866456745, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=themeColor, value=null, createTime=1776253903595, updateTime=1776253903595, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262819870651050, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283527786546, code=themeStyle, value=null, createTime=1776253903596, updateTime=1776253903596, creator=18614031015, updator=18614031015)]), Website(id=1251257283611672652, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1251234646239789153, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/xxdkjs/EN, language=EN, createTime=1776252583649, createBy=18614031015, updateTime=1776253872573, updateBy=18614031015, name=信息对抗技术-英文, tplId=1146101810881728533, title=Information Countermeasure Technology, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1251262790762185057, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=articleTextType, value=kx, createTime=1776253896656, updateTime=1776253896656, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790737019230, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=banner, value=null, createTime=1776253896650, updateTime=1776253896650, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790778962276, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=grayFlag, value=0, createTime=1776253896660, updateTime=1776253896660, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790728630621, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=logo, value=https://castjournals.cast.org.cn/joweb/xxdkjs/EN/file/pic?fileId=ejr3qWSnlyLKPBFIFKSmqw==, createTime=1776253896648, updateTime=1776253896648, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790791545190, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=minRunFlag, value=0, createTime=1776253896663, updateTime=1776253896663, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790753796448, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/xxdkjs/EN/file/pic, createTime=1776253896654, updateTime=1776253896654, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790783156581, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=silenceFlag, value=0, createTime=1776253896661, updateTime=1776253896661, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790745407839, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1776253896652, updateTime=1776253896652, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790766379362, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=themeColor, value=null, createTime=1776253896657, updateTime=1776253896657, creator=18614031015, updator=18614031015), WebsiteProps(id=1251262790770573667, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1251257283611672652, code=themeStyle, value=null, createTime=1776253896658, updateTime=1776253896658, creator=18614031015, updator=18614031015)])], journalTitle=信息对抗技术, weixinUrl=null, journalUrl=http://xxdkjs.ijournals.cn/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Information Countermeasure Technology, journalPhotoCn=28NAsfRnEEIAv0GvsYImxg==, journalPhotoEn=W2zeGuQU+j8zRKRf0eBlGA==, journalFirstLetter=I, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/xxdkjs/CN/10.12399/j.issn.2097-163x.2025.05.003, detailUrlEn=https://castjournals.cast.org.cn/joweb/xxdkjs/EN/10.12399/j.issn.2097-163x.2025.05.003, pdfUrlCn=https://castjournals.cast.org.cn/joweb/xxdkjs/CN/PDF/10.12399/j.issn.2097-163x.2025.05.003, pdfUrlEn=https://castjournals.cast.org.cn/joweb/xxdkjs/EN/PDF/10.12399/j.issn.2097-163x.2025.05.003, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
一种基于大语言模型及RAG技术的节点设备类型识别方法
收藏切换
PDF下载
杨国正 1, 2 , 陈驰昱 1 , 沈照斌 1, 2 , 齐冬震 1 , 潘俊宇 1
信息对抗技术 | 研究论文 2025,4(5): 42-53
收起
收藏切换
信息对抗技术 | 研究论文 2025, 4(5): 42-53
一种基于大语言模型及RAG技术的节点设备类型识别方法
全屏
杨国正1, 2 , 陈驰昱1 , 沈照斌1, 2 , 齐冬震1 , 潘俊宇1
作者信息
  • 1国防科技大学电子对抗学院,安徽 合肥 230037
  • 2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037
  • 杨国正 男,1982年生,博士,教授,研究方向为网络空间测绘、网络安全态势感知 E-mail:

    陈驰昱 男,1996年生,硕士研究生,研究方向为网络安全态势感知 E-mail:

    沈照斌 男,2001年生,硕士研究生,研究方向为网络空间测绘 E-mail:

    齐冬震 男,2000年生,硕士研究生,研究方向为路由安全 E-mail:

    潘俊宇 男,2003年生,硕士研究生,研究方向为网络空间测绘 E-mail:

通讯作者:

通信作者:陈驰昱,E-mail:
A node device-type identification method based on large language models and retrieval-augmented generation
Guozheng YANG1, 2 , Chiyu CHEN1 , Zhaobin SHEN1, 2 , Dongzhen QI1 , Junyu PAN1
Affiliations
  • 1College of Electronic Engineering, National University of Defense Technology, Hefei 230037, China
  • 2Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation, Hefei 230037, China
doi: 10.12399/j.issn.2097-163x.2025.05.003
文章导航
收藏切换

在大规模网络空间测绘中,快速准确探测节点信息并识别设备运行状态,是核心研究内容之一。当前,网络空间设备版本迭代速度加快,大量新型设备不断涌现,如何跟踪并识别被测节点的设备类型,成为亟待解决的新挑战。针对当前研究过于依赖已有知识,无法适应设备升级变化的问题,提出了一种基于大语言模型(large language model,LLM)和检索增强生成(retrieval-augmented generation,RAG)技术的节点设备类型识别方法。首先,从RFC文档和互联网设备厂商站点收集相关资料,基于嵌入模型构建知识向量数据库;然后,对探测得到的节点特征信息进行编码,从向量数据库中检索相关背景知识,将其与节点特征信息共同构造为提示词并输入LLM,利用其推理能力实现对被测节点的设备类型识别;最后,通过消融实验和实网测试,验证了该方法的有效性和性能。

网络测量  /  LLM  /  RAG  /  设备类型识别

In large-scale cyberspace mapping,rapidly and accurately detecting node information and identifying the operational status of devices is one of the core research contents. Currently,the version iteration speed of cyberspace devices is accelerating,and a large number of new-type devices are constantly emerging. How to track and identify the device type of the measured node has become a new challenge that needs to be solved urgently. Aiming at the problem that current research relies too much on existing knowledge and cannot adapt to device upgrade changes,a node device type identification method based on large language model(LLM)and retrieval-augmented generation(RAG)technology was proposed. First,relevant data were collected from RFC documents and Internet device manufacturer websites,and a knowledge vector database was constructed based on the embedding model.Then,the detected node feature information was encoded,and relevant background knowledge was retrieved from the vector database. The retrieved knowledge and node feature information were jointly constructed into prompts for the LLM. The reasoning ability of the LLM was used to identify the device type of the probed node. Finally,the effectiveness and performance of the method were verified through ablation experiments and real-network tests.

cyberspace mapping  /  LLM  /  RAG  /  device type identification
杨国正, 陈驰昱, 沈照斌, 齐冬震, 潘俊宇. 一种基于大语言模型及RAG技术的节点设备类型识别方法. 信息对抗技术, 2025 , 4 (5) : 42 -53 . DOI: 10.12399/j.issn.2097-163x.2025.05.003
Guozheng YANG, Chiyu CHEN, Zhaobin SHEN, Dongzhen QI, Junyu PAN. A node device-type identification method based on large language models and retrieval-augmented generation[J]. Information Countermeasure Technology, 2025 , 4 (5) : 42 -53 . DOI: 10.12399/j.issn.2097-163x.2025.05.003
随着互联网技术与应用的快速发展,网络和终端设备的种类和数量不断增加,如何在大规模网络空间测绘中,快速准确识别被测节点的设备类型成为新的挑战。目前,针对节点设备类型的识别方法主要有基于指纹库和基于机器学习模型2类。基于指纹库的方法需要预先构建各种设备的探测指纹库,再通过类似正则表达式的方式匹配探测标识符实现设备类型识别。基于机器学习模型的方法主要通过标签数据训练特定的机器学习模型,根据探测节点特征自动判断所属设备类型。这2类方法均适用于识别已知设备类型,其识别能力依赖于人工构建的指纹库或者训练使用标签数据的质量。然而,随着网络空间设备版本升级节奏的加快和大量新设备的涌现,这些方法在识别新设备方面存在较大局限。
近年来,随着大语言模型(large language model,LLM)的快速发展,基于自然语言处理的设备类型识别方法逐渐受到关注。LLM擅长从文本中提取复杂信息,并在无监督学习的基础上展现出强大的推理能力。针对大规模网络测绘中的设备类别识别这一问题,本文提出了一种基于LLM和检索增强生成(retrieval-augmented generation,RAG)技术的节点设备类别识别方法。该方案通过采集RFC(request for comments)文档和互联网设备厂商网页信息,构建网络协议和设备品牌知识向量数据库;在具体分析探测节点设备类型时,从向量数据库中检索相关背景知识,将其精心构造为提示词,并输入LLM,利用LLM的推理能力完成设备类型识别,最后通过实验验证该方法的有效性。
在网络空间测绘领域,对测量得到的节点进行设备类型识别主要基于设备指纹技术(device fingerprinting,DFP)[1],即利用设备进行网络通信的各层面特征生成指纹(或签名),并借助这些指纹(或签名)实现对设备的识别或分类,这里的设备包括网络设备、嵌入式设备和物联网设备等类型。根据特征获取、指纹生成和识别方式等过程的差异,设备类型识别技术可被划分为基于规则的方法和基于机器学习模型的方法[2]
著名网络扫描工具Nmap(network mapper)的配套特征文件和Rapid7安全公司的开源项目Recog[3],都是典型的基于规则的指纹库项目。前者在脚本库中包含了大量服务和设备的指纹匹配规则;后者利用开源社区贡献,持续补充包括图标、响应和协议字段等形式的各类指纹信息,涉及操作系统、供应商、硬件和软件信息等方面内容。这类传统的规则匹配方式虽然识别准确度较高,但需要大量专家知识和人工参与,并且无法识别没有入库的未知设备。
为了降低指纹(规则)的构建成本,一些基于自动化构建规则的识别方法应运而生。LI等[4]发现,物联网设备的应用层响应数据通常包含与其制造商高度相关的内容,通过爬取关于设备描述的网页,并使用实体命名识别提取设备标注,提出了一种基于规则的物联网设备发现与识别方法。基于类似的策略,规则采集引擎ARE[5]利用来自物联网设备的应用层响应数据和相关网站中的产品描述来构建设备规则,以“类型—供应商—产品”的三元组形式生成物联网设备标注规则。然而,JAVED等[6]发现基于相同原理实现的ARE引擎无法达到其宣称的精度,这反映出此类自动化的规则生成方案不仅设计复杂,而且在面对未知设备时存在泛化能力较弱的问题。
CHENG等[7]从HTTP响应头部及响应体中提取各类统计特征,通过有监督的机器学习模型,在不需要人工指纹库支持的情况下,可以达到97.5%的准确率。WebIoT框架[8]将提取网页的图像特征和统计特征结合为设备特征作为神经网络的输入,在含20万条物联网设备的标注数据集上进行训练,也取得了较好的精度。但无论是机器学习还是深度学习模型的训练,都需要大量标签数据,且训练成本较高。
2023年,SARABI等[9]意识到由互联网扫描生成的大量文本数据非常适合于训练LLM,因此使用数亿条应用层横幅训练了一个基于Transformer的掩码语言模型,并设计对比损失函数进行微调使其能够生成时间上稳定的横幅嵌入,同时保持功能相似的硬件/软件产品嵌入在向量空间中距离相近。通过该模型可以对HTTP横幅进行聚类分析,并为每个聚类生成基于文本的指纹。LLM虽然可以有效生成具有语义的嵌入并形成聚类,但由于训练数据本身不具备标签信息,无法自动对聚类标识,导致生成的指纹表征无法识别具体的设备类型。
除上述研究外,当前研究在如何发现互联网中新出现的特定类型设备方面进行了相关尝试。UEDA等[10]通过在车载设备网站中提取关键字并在Censys中匹配和聚类Web页面,发现了12种暴露在互联网上的车载设备。SASAKI等[11]利用一般远程管理设备Web页面间的相似性和一般网站Web页面之间的异构性,精心设计了一套迭代流程来寻找互联网中的工业控制系统(industrial control systems,ICS)管理设备,同时借助ICS远程管理设备Web页面中普遍存在的特征信息,自动标注设备型号、地理位置等关键信息。ChargePrint框架[12]通过迭代初始种子指纹以及分类、聚类的方法扩展设备搜索引擎的功能,在互联网上搜索电动汽车充电管理系统,并评估其安全性。
总的来说,现有研究方法多聚焦于如何基于已有知识和样本完成对设备类型的判别,在对未知设备识别方面,仅少量研究针对特定类型设备,设计了较为复杂的方案流程,且需要人工参与,难以泛化成通用的自动化设备类别识别方法。为此,本文从提高网络空间测绘节点设备类型识别率的角度,引入LLM及RAG技术,提出一种新的设备类型识别方法。
在网络空间测绘过程中,测量节点无法有效识别设备类型,源于以下几种原因:
1)版本迭代。由于设备型号、固件版本等升级变化,使当前特征与已有指纹库中的特征产生偏差。
2)定制配置。厂商或用户自行修改、定制过的设备,其具体配置导致当前特征与已有指纹库中的特征产生差异。
3)新出现设备。互联网中新出现的未知品牌或型号设备,缺少指纹或样本训练数据。
针对上述无法有效识别设备类型的情况,本文基于LLM和RAG技术,设计一种智能化识别方案。该方案的立足点是:在网络探测中发现的节点虽然无法被传统方法有效识别类型,但节点某些端口和协议特征中隐含了该设备的类型特征,特别是其端口响应数据的文本内容中蕴含了可被理解的相关信息。在此情况下,即使方案中并无该设备的预设指纹,且未使用特征相近的类似标签数据做过训练,也能够基于探测到的特征信息推理出设备类型。方案总体架构如图1所示,主要包括节点探测信息特征提取、背景知识检索和LLM推理3个部分。
节点探测信息特征提取部分的具体工作流程如图2所示。
针对网络测绘中指定的地址范围,以流行端口的Top 20为测量端口,首先使用应用层扫描器[13]快速发现其开放端口服务和横幅信息,包括非加密信息获取和TLS加密信息获取2部分;然后使用当前最新版本的Nmap7.95扫描器内置指纹库(Nmap-service-probes)对已知的设备类型进行匹配识别,由此筛选和过滤出不能被指纹库匹配的节点,结合探测得到的相关信息形成该节点的端口和协议特征。
本方案选择的端口和协议特征都能够基于大规模应用层探测技术高效获取,主要包括以下元素:
1)开放端口列表;
2)各开放端口所运行应用层协议;
3)各开放端口响应的横幅信息;
4)运行TLS协议的各开放端口中包含的证书信息,包括证书主题(Subject)和证书发行者(Issuer)的通用名、组织名、组织单元等。
在节点设备类型识别中,传统方法依赖于预定义的特征匹配和规则库,当设备特征信息变化频繁时识别效果较差。LLM虽然可以根据设备特征对其类型进行推理和识别,但由于模型的参数和训练数据有限,可能因缺乏相关的领域知识或较新的资料导致无法准确识别。表1列出了实际探测中根据关键特征和背景知识人工使用LLM判断设备类型的相关示例。
表1的示例,可以形成如下认知:
1)节点某开放端口运行应用层协议,并且其响应横幅中包含了设备的类型和版本信息,则LLM在无需特定领域背景知识的情况下判断设备类型,如示例1和示例2;
2)节点所有开放端口和协议特征中并不直接包含设备类型信息,但其响应横幅或TLS证书信息中包含服务器厂商名称,基于相关背景知识可以推断该设备的类型,如示例3和示例4;
3)节点某开放端口只响应了没有语义的二进制消息,但该开放端口运行协议被识别为一种设备专用协议,基于网络协议相关的背景知识可以推断该设备类型,如示例5;
4)节点的开放端口响应横幅中虽没有指示设备类型,但其包含了产品型号,又由于该具体型号在传统指纹库中没有收录,因此未能识别。然而基于型号所属系列产品信息的背景指示,可以推断该设备类型,如示例6。
综上所述,当节点特征中包含设备类型或型号时,LLM能够根据提供的上下文信息准确识别设备类型和型号;当节点特征中并不显示包含设备类型标识符时,使用RAG技术为大模型补充相关背景知识,同样可以实现对设备类型的有效识别。基于这种策略,向量数据库构建与背景知识检索模块的实现流程如图3所示。
构建向量数据库的知识资料主要来源于RFC文档和各种互联网厂商设备的开源资料。RFC文档中详细定义了各种网络协议的规范和行为,为理解网络设备运行机制提供重要基础知识;各种互联网厂商设备的介绍资料包含了网络设备的功能、工作机制、使用协议和生产厂商等信息,这些信息为理解节点探测得到的特征信息进而识别设备类型提供重要支撑。
为了支持针对节点特征信息的语义化查询功能,这些知识资料需要被转化为向量表示。具体来说,需要采用嵌入技术,将文本内容通过嵌入模型转化为向量表示集合V={v1v2,…,vn},向量维度为d。这些向量可以捕捉文档中的语义信息,使得与设备特征中的语义相似的资料能够在向量空间中距离更近。这些嵌入向量随后被存入向量数据库。
在未识别节点特征提取完成后,输入其端口和协议特征信息,向量数据库能够根据这些输入信息生成对应的相同维度的嵌入向量w,通过计算wV中向量的距离
快速在向量数据库中找到最相似的向量,从而返回至与该节点最相关的背景知识。例如,通过查询返回有关协议或者与横幅内容相关的设备、厂商等资料。
需要说明的是,虽然向量数据库的构建也需要收集相关领域的资料,但相比于传统的指纹库构建和模型的标签数据集整理,能够省去大量的专家知识和相关成本投入,无需复杂的数据清洗过程,自动化程度较高,并且易于资料的更新处理。
在节点特征提取和背景知识检索的基础上,需要进一步使用LLM进行设备类型推断,其中的关键在于构建合理的提示词(亦称提示工程,prompt engineering),通过有效的问题引导LLM对设备的类型进行判断和分类。提示词不仅要向LLM传递设备的具体协议特征和相关背景知识,还需确保逻辑性和引导性,以帮助模型在推理过程中沿着正确的方向进行分析。
为确保LLM能够准确区分不同类型的设备,本方案设计的提示词中提供了每种设备类别的详细描述,涵盖其典型协议特征、常用端口、典型行为以及具体的应用场景。部分举例如下:
1)路由器。连接2个或多个网络的硬件设备,利用路由协议(如BGP、OSP等)管理网络流量,通过HTTP、HTTPS、SSH或TELNET协议进行配置,广泛应用于家庭、企业和服务提供商环境。
2)网络摄像头。一种结合传统摄像机与网络技术的新一代摄像机,常使用RTSP、RTP、RTCP等协议进行视频流传输,可通过HTTP、HTTPS、SSH或TELNET等协议进行管理,一般工作在特定的端口范围。
3)打印机。作为网络节点独立存在,通过网络打印服务接入互联网,常使用IPP、LPD/LPR、HTTP或专有协议进行通信,设备响应较为标准化,典型应用领域为办公场景。
提示词的另一个重要作用是规范化模型输出格式。通过提示词引导,LLM可以按照预设的格式返回结果,这对于后续方案的自动化实现非常重要。本方案使用结构化的JSON格式作为输出格式,规范化输出的提示词设计包括:
1)指示模型输出JSON格式。提示词中明确要求模型按指定格式返回结果。例如,在推理设备类型时,模型输出的JSON应包含设备类型、推理依据和置信度等字段。
2)统一字段定义。在提示词中定义字段名称(如设备类型、置信度等),确保模型输出的一致性和规范性。
基于以上思路,本方案设计的提示词形式如图4所示。由图4可以看出,提示词设计主要包含5个部分,其中:
①设备类型范围和详细定义用来明确类型划分的标准和边界;
②当前设备节点端口和协议特征是进行类型推断的关键依据;
③推断设备类型可能用到的思路可以为模型提供推理依据;
④相关背景知识为模型补充可用的推理材料;
⑤输出格式和示例模板用以规范化模型的输出,便于程序提取。
使用精心构建的提示词,引导LLM结合设备特征与背景知识进行推理,并使用结构化格式输出结果,能够提高设备类型识别的准确性和规范性。
基于上述设计方案,本文实现了面向网络空间测绘的节点探测与类型识别原型系统LingoVision。本节主要设计了3个方面的实验,分别是LingoVision系统基线方案实验,对部分模块进行删减的消融实验,以及使用该系统对真实互联网中无法被现有网络空间搜索引擎和流行指纹库识别的节点进行设备类型识别的实验。
考虑到方案中节点特征提取部分涉及大规模网络扫描,结合此类扫描节点轻量级部署的实际情况,本文将实验环境限制在常规配置下:实验均在配备12核CPU、16 GB内存的笔记本电脑上进行,主机上运行Ubuntu22.04LTS操作系统及6.5.0-41-generic版本Linux内核,无其他特殊设置。
LingoVision系统选取2个轻量级且易于部署在探测节点的模型,具体如下:
1)嵌入模型。使用开源嵌入模型mxbai-embed-large-v1[14-15],通过对7亿对比学习数据对进行训练,并结合3 000万高质量三元组数据进行微调,使其灵活处理不同主题和领域,旨在将文本转换为稠密的向量表示,用于各种自然语言处理任务,如语义搜索、信息检索和文本聚类。
2)语言模型。使用由Meta开发的LLaMA3大语言模型的8B版本。LLaMA3是一种自回归语言模型,采用优化的Transformer架构,专为生成式任务和对话场景进行了调优。8B版本的模型拥有80亿参数,并通过大规模公开数据集进行预训练。此版本模型支持输入文本并生成文本或代码,具备强大的语言理解和生成能力。
使用开源的Faiss(Facebook AI similarity search)向量数据库,每次查询时返回前3条最相关数据。在方案的具体实现中,通过下载和爬虫爬取的数据经过语义切分后,得到4 562条RFC相关数据和1 241条设备厂商相关数据。切分时,文本块词数(chunk size)设置为1 000,文本重叠词数(chunk overlap)设置为200。
为了验证本方案效果,测试数据集需要同时包含较为全面的端口、协议特征以及设备标签,而已有的节点设备类型识别工作中并没有满足条件的可用数据集。因此,本数据集主要通过手动收集现有网络空间搜索引擎中的相关记录获得。数据集中包含333个设备条目以及每种设备类型的多个特征信息,共涉及14种不同的设备类,包括端口、协议、TLS证书、服务横幅(Banner)和网页标题等特征。每一条数据记录代表一个节点及其相关开放端口的协议特征。由于这些节点条目中包含了无法通过Nmap指纹库匹配其类型的记录,这为测试方案的鲁棒性和泛化能力带来了挑战。
数据集中涉及的设备类型标签来源于现有网络空间搜索引擎,具体类型及其简称包括:路由器(router)、网络存储设备(NAS)、网络摄像头(webcam)、防火墙(firewall)、VoIP网关/适配器(VoIP adapter)、网关(gateway)、打印机(printer)、无线接入点(WAP)、VPN设备(VPN)、负载均衡器(load balancer)、代理服务器(proxy server)、工控设备(ICS)、媒体设备(media device)、邮件服务器(mail server)。
1)准确率(accuracy)。识别正确的样本数占总样本数的比例,即:
式中,TTP表示正确识别为当前类别的样本数,TTN表示正确识别为非当前类别的样本数,FFP表示错误识别为当前类别的样本数,FFN表示错误识别为非当前类别的样本数。
2)精确率(precision)。识别为正类中真正是正类的比例,即:
3)召回率(recall)。在所有实际为正类的样本中,正确识别为正类的样本的比例,反映了方案捕获正类样本的能力,可表示为:
4)F1值(F1-Score)。旨在综合精确率和召回率的表现,提供一个平衡指标,对于二分类情况来说,有:
由于本实验为多分类情况,因此使用了以下2种均值计算方式:
5)宏平均值(macro-average)。为每个类别单独计算精确率、召回率及F1值,然后计算这些值的平均值。宏平均值不考虑每个类别的样本数量。具体表示为:
式中,n表示类别的数量,Xi表示第i个类别的精确率、召回率及F1值。
6)加权平均值(weighted-average)。在宏平均值的基础上,每个类别的精确率、召回率及F1值会根据该类别的样本数量进行加权。具体表示为:
式中,TTP,iFFN,i分别表示第i个类别的真正例和假负例的数量,Yi表示第i个类别的精确率、召回率或F1值。
使用LingoVision系统针对数据集所有条目进行节点设备类别识别测试,该系统集合了本方案中的功能模块,是后续消融实验的基线方案。根据实验结果绘制的混淆矩阵归一化热力图(如图5所示)展示了对每类设备分类的准确率以及误分类的情况;相关分类分数柱状图(如图6所示)展示了每类设备分类的精确率、召回率和F1值。
由实验的结果数据可以看出,不同设备类型的识别精度存在差异,就混淆矩阵热力图所展现的单个设备分类准确率而言,针对路由器、网络存储设备、VoIP网关和媒体设备的分类表现最为优异,准确率超过了90%。这表明这些设备本身的协议特征在分类过程中较为明显,LingoVision系统能够较好地捕捉特征中蕴含的设备类型信息;针对网络摄像头、VPN设备、代理服务器和邮件服务器的分类表现也较好,识别准确率均高于80%。这些设备的特征信息较为独特或集中,能够有效帮助模型进行推断。然而,针对防火墙、打印机、负载均衡器和工控设备的分类表现较为一般,准确率在60%~80%,这是因为这些设备在某些特征上与其他设备存在一定重叠,导致分类模糊。从人工识别的角度来看,也难以根据设备的端口和协议特征对其进行精细划分。同时,相互之间强相关的设备类型虽然导致误分类,但这样的分类也已经完成了粒度较粗的类型识别。
针对图6,相关分数的均值结果见表2所列。
在均值分数报告中可以看到LingoVision系统的整体表现,宏平均值下的精确率为81%,召回率为76%,F1值为0.74,这表明模型在各类别上总体表现较好,但个别设备类型的表现拉低了整体的召回率和F1值。加权平均值下的精确率为84%,召回率为75%,F1值为0.76,加权平均值比宏平均值稍高,说明模型在处理设备类别时,主要类别的分类效果较好,而一些少数类别表现较差。
总体来看,LingoVision系统在测试数据集上的表现良好,大部分设备类型能够准确识别。对于少部分表现不佳的设备类型,其分类表现除了受限于设备类型的特征不够明显,也因为所使用的数据集分类标签之间不完全正交,具有强关联关系的设备间特征存在较大重叠。尽管如此,LingoVision系统还是能够将这些设备分类到最相关的类型当中,实现粗粒度的正确分类。
为了进一步评估方案中各模块策略对LingoVision系统的贡献,本文采用消融实验进行分析。通过去除RAG技术和精心设计的提示词这2部分,深入分析方案各部分对分类效果的影响。其中,未采用RAG技术的简化系统命名为LingoVision-1;不对提示词进行精心设计,只包含图4中的②、④和⑤部分,简化系统命名为LingoVision-2。随后,分别对所有条目进行节点设备类别识别测试,绘制的混淆矩阵归一化热力图如图7所示,相关分类分数柱状图如图8所示。
从实验结果来看,去除RAG技术之后,系统受影响较大。在混淆矩阵热力图中,防火墙、打印机和VoIP网关的分类表现下降最为明显,准确率均下降至原来的1/2左右,表明这几类设备在没有检索背景知识时,系统难以充分利用现有的设备特征进行准确分类,RAG在其中起到了关键作用。
在不使用精心设计的提示词的条件下,少部分设备类型分类准确率有轻微下降,如VPN设备和无线接入点等。大部分设备类型的分类准确率和各分类分数相比基线实验来说较为稳定。但同时也出现因缺少提示词设计,出现代理服务器这一类型的分类准确率下降至0的极端情况,并且全部被误分类到路由器类。这表明精心设计的提示词确实起到了标签间边界划分的作用,能够对个别设备的识别起到关键作用。
为验证本方案对于真实的未识别节点的设备类型识别效果,本文搜集了部分网络空间搜索引擎无法识别其设备类型的真实节点,这些节点经Nmap在线检测仍无法确定具体的设备类型。因此,使用LingoVision系统针对这些未知设备进行识别。
此外,实网探测的节点设备类型分布与数据集不同,主要体现在基准真相的标签数量更多更丰富。而当LLM由于领域知识不足或缺乏事实验证能力时,实网探测更容易导致大模型幻觉的产生,即在生成内容时,产生与事实不符、虚构或具有误导性信息的现象。尽管这些回答可能流畅且有逻辑性,但内容上存在错误或与输入指令不一致。为了避免模型面对超出提示词预设范围的节点设备类型时出现幻觉的情况,除了以RAG手段作为缓解外,本文在LingoVision的实网探测版本提示词中,增加了unknown和others设备类型选项,允许大模型在推理节点设备类型遭遇歧义或不明确时能够作出更客观的回答,进一步避免幻觉的产生。对未知节点的识别结果见表3所列。
对于这些识别出来的设备,由于当前没有对应的基准真相作为标签,本文通过人工的方式逐一对其端口和协议特征进行核验,均能从响应横幅、TLS证书或网页标题等文本特征中确认LingoVision识别结果的合理性,具体核验结果的示例及使用到的关键特征见表4所列。
从识别结果中可以看到,LingoVision系统基于真实存在的互联网节点特征信息,能够有效识别现有流行指纹库和网络空间搜索引擎无法识别的设备。识别出来的设备类型包括9类,各类型涉及的具体设备型号(版本)共计31种,并能识别出其中大部分设备类型的所属厂商和具体型号。经过网络空间搜索引擎中的数据比对,在互联网中与这些具体型号(版本)设备具有相同特征的IP节点数量超过10万个。
针对网络空间测绘中存在大量节点设备类型无法识别的问题,本文提出了一种基于LLM和RAG技术的节点设备识别方案,充分考虑了各种互联网设备在应用层协议中暴露的端口和特征信息,构建了一个能够部署在探测节点的未知设备识别原型系统LingoVision。通过系列实验验证了该系统方案的有效性。本文所提出的未知节点设备类型识别方法为网络空间测绘下的新设备发现提供了新的思路,且该方法具备较高的扩展性和准确性,可为后续的网络资产管理和安全态势感知提供技术支撑。
参考文献 引证文献
排序方式:
[1]
XU Q,ZHENG R,SAAD W,et al.Device fingerprinting in wireless networks: challenges and opportunities[J]. IEEE Communications Surveys & Tutorials, 2016, 18(1): 94-104.
[2]
WAN S F, LI Q, WANG H, et al. DevTag: a benchmark for fingerprinting IoT devices[J]. IEEE Internet of Things Journal, 2023, 10(7): 6388-6399.
[3]
Recog: a recognition framework[EB/OL].[2025-07-24].https://github.com/rapid7/recog/#recog-a%20-recognition-framework.
[4]
LI Q, FENG X, WANG H, et al. Discovery of Internet of Thing devices based on rules[C]//Proceedings of 2018 IEEE Conference on Computer Communications Poster and Demo.[S.l.]: IEEE, 2018: 1-2.
[5]
FENG X, LI Q, WANG H, et al. Acquisitional rule-based engine for discovering Internet-of-Things devices[C]//Proceedings of the 27th USENIX Security Symposium.[S.l.:s.n.], 2018: 327-341.
[6]
JAVED T, HASEEB M, ABDULLAH M, et al. Using application layer banner data to automatically identify IoT devices[J]. ACM SIGCOMM Computer Communication Review, 2020, 50(3): 23-29.
[7]
CHENG H, DONG W Y, ZHENG Y, et al. Identify IoT devices through web interface characteristics[C]//Proceedings of 2021 IEEE International Conference on Computer and Communication Systems.[S.l.]:IEEE, 2021: 405-410.
[8]
WU Y C, LI C L, YANG J H, et al. WebIoT:classifying Internet of Things devices at Internet scale through web characteristics[C]//Proceedings of 2022 IEEE Symposium on Computers and Communications.[S.l.]: IEEE, 2022: 1-7.
[9]
SARABI A, YIN T X, LIU M Y. An LLM-based framework for fingerprinting Internet-connected devices[C]//Proceedings of 2023 ACM on Internet Measurement Conference. New York: ACM, 2023: 478-484.
[10]
UEDA T, SASAKI T, YOSHIOKA K, et al. An Internet-wide view of connected cars: discovery of exposed automotive devices[C]//Proceedings of the 17th International Conference on Availability,Reliability and Security. New York: ACM, 2022: 1-8.
[11]
SASAKI T, FUJITA A, GAÑÁN C H, et al. Exposed infrastructures: discovery, attacks and remediation of insecure ICS remote management devices[C]//Proceedings of 2022 IEEE Symposium on Security and Privacy.[S.l.]: IEEE, 2022: 2379-2396.
[12]
NASR T,TORABI S,BOU-HARBE,et al.ChargePrint:a framework for Internet-scale discovery and security analysis of EV charging management systems[C]//Proceedings of the 30th Annual Network and Distributed System Security Symposium.[S.l.:s.n.], 2023:1-18.
[13]
CHEN C Y, LU Y L, YANG G Z, et al. ZBanner:fast stateless scanning capable of obtaining responses over TCP[C]//Proceedings of 2024 IEEE International Performance,Computing,and Communications Conference.[S.l.]: IEEE, 2024: 1-6.
[14]
LEE S, SHAKIR A, KOENIG D, et al. Open source strikes bread-new fluffy embeddings model[EB/OL]. (2024-03-08)[2025-07-24]. https://www.mixedbread.ai/blog/mxbai-embed-large-v1.
[15]
LI X M, LI J. AoE: angle-optimized embeddings for semantic textual similarity[C]//Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok: Association for Computational Linguistics, 2024: 1825-1839.
2025年第4卷第5期
PDF下载
72
30
引用本文
BibTeX
文章信息
doi: 10.12399/j.issn.2097-163x.2025.05.003
  • 接收时间:2025-07-07
  • 首发时间:2026-04-23
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2025-07-07
  • 修回日期:2025-07-24
基金
作者信息
    1国防科技大学电子对抗学院,安徽 合肥 230037
    2安徽省网络空间安全态势感知与评估重点实验室,安徽 合肥 230037

通讯作者:

通信作者:陈驰昱,E-mail:
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/xxdkjs/CN/10.12399/j.issn.2097-163x.2025.05.003
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏