收藏切换
Named Entity Recognition in Beiyang Government Documents Resources Using Large Language Models
收藏切换
PDF
Jun Deng*, Zishu Zhang, Yubing Pan, Dongyu Ye, Yanyu Chang
Journal of Modern Information | 2026, 46(3) : 44 - 55
Less
收藏切换
Journal of Modern Information | 2026, 46(3): 44-55
DATA INTELLIGENCE and KNOWLEDGE SERVICE
Named Entity Recognition in Beiyang Government Documents Resources Using Large Language Models
Full
Jun Deng*, Zishu Zhang, Yubing Pan, Dongyu Ye, Yanyu Chang
Affiliations
  • 1School of Business and Management,Jilin University,Changchun130012,China
Published: 2026-03-01 doi: 10.3969/j.issn.1008-0821.2026.03.004
Outline
收藏切换
Purpose/Significance

Addressing the challenges in named entity recognition(NER) for Beiyang Government Document Resources due to linguistic complexity, diversity, and lack of annotation data, this paper proposes a large language model-based NER framework adapted for low-resource scenarios. This framework provides methodological su⁃pport for structured mining and knowledge reorganization of modern historical documents. [Methods/

Process

This framework integrated retrieval-enhanced generation with efficient parameter fine-tuning. It used Faiss vector retrieval to build a dynamic context example selection method and used the LoRA strategy to add domain knowledge to large language models.On a custom corpus, the study designed seven special entity types,including persons, places, organizations, time, positions, events,and document types. The study then compared two deep learning entity recognition methods, BERT-BiLSTM-CRF and RoBERTa-BiLSTM-CRF,with Baichuan-4B, DcepSeck-R1, Xunzi-Qwen3-8B, Qwen3-4B,Llama, and GPT-4. The study evaluated large language models performance under different sampling methods.

Result/Conclusion

Experiments demonstrate that compared to traditional deep learning models and general-purpose large language mo⁃dels, the synergistic paradigm integrating LoRA fine-tuning with RAG significantly enhances entity recognition performance, achieving an overall F1 score of 0.857. A framework that uses RAG with large, fine-tuned language models for named entity recognition in Beiyang Government Document Resources works well together, and it achieves accurate entity identification in these historical records. This shows that large language models are practical and can be scaled when processing historical documents with limited resources.

beiyang government document resources  /  large language model  /  named entity recognition  /  low-resource scenarios  /  retrieval augmented generation  /  LoRA fine-tuning
Jun Deng, Zishu Zhang, Yubing Pan, Dongyu Ye, Yanyu Chang. Named Entity Recognition in Beiyang Government Documents Resources Using Large Language Models[J]. Journal of Modern Information, 2026 , 46 (3) : 44 -55 . DOI: 10.3969/j.issn.1008-0821.2026.03.004
Year 2026 volume 46 Issue 3
PDF
22
9
Cite this Article
BibTeX
Article Info
doi: 10.3969/j.issn.1008-0821.2026.03.004
  • Receive Date:2026-01-09
  • Online Date:2026-06-05
  • Published:2026-03-01
Article Data
Affiliations
History
  • Received:2026-01-09
Affiliations
    1School of Business and Management,Jilin University,Changchun130012,China
References
Share
https://castjournals.cast.org.cn/joweb/xdqb/EN/10.3969/j.issn.1008-0821.2026.03.004
Share to
QR

Scan QR to access full text

Cite this article
BibTeX
Citations
表12种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏
  • BibTeX
  • EndNote
  • RefWorks
  • TxT