In large-scale cyberspace mapping,rapidly and accurately detecting node information and identifying the operational status of devices is one of the core research contents. Currently,the version iteration speed of cyberspace devices is accelerating,and a large number of new-type devices are constantly emerging. How to track and identify the device type of the measured node has become a new challenge that needs to be solved urgently. Aiming at the problem that current research relies too much on existing knowledge and cannot adapt to device upgrade changes,a node device type identification method based on large language model(LLM)and retrieval-augmented generation(RAG)technology was proposed. First,relevant data were collected from RFC documents and Internet device manufacturer websites,and a knowledge vector database was constructed based on the embedding model.Then,the detected node feature information was encoded,and relevant background knowledge was retrieved from the vector database. The retrieved knowledge and node feature information were jointly constructed into prompts for the LLM. The reasoning ability of the LLM was used to identify the device type of the probed node. Finally,the effectiveness and performance of the method were verified through ablation experiments and real-network tests.
| 科 Family | 属数 Number of genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) | 属 Genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) |
|---|---|---|---|---|---|---|
| 鹅膏菌科Amanitaceae | 2 | 11 | 5.26 | 鹅膏菌属 Amanita | 10 | 4.78 |
| 小菇科 Mycenaceae | 2 | 12 | 5.74 | 丝盖伞属 Inocybe | 5 | 2.39 |
| 多孔菌科 Polyporaceae | 8 | 14 | 6.70 | 蜡蘑属 Laccaria | 5 | 2.39 |
| 红菇科 Russulaceae | 3 | 23 | 11.00 | 小皮伞属 Marasmius | 6 | 2.87 |
| 小菇属 Mycena | 11 | 5.26 | ||||
| 光柄菇属 Pluteus | 5 | 2.39 | ||||
| 红菇属 Russula | 17 | 8.13 | ||||
| 栓菌属 Trametes | 5 | 2.39 |