Article(id=1242140593107841508, tenantId=1146029695717560320, journalId=1146031591421210625, issueId=1242140568093012308, articleNumber=null, orderNo=14, doi=10.3981/j.issn.1000-7857.2023.15.012, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1675612800000, receivedDateStr=2023-02-06, revisedDate=1678291200000, revisedDateStr=2023-03-09, acceptedDate=null, acceptedDateStr=null, onlineDate=1693382156943, onlineDateStr=2023-08-30, pubDate=1691856000000, pubDateStr=2023-08-13, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1693382156943, onlineIssueDateStr=2023-08-30, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1774078995249, creator=sys-migrate, updateTime=1774078995249, updator=sys-migrate, issue=Issue{id=1242140568093012308, tenantId=1146029695717560320, journalId=1146031591421210625, year='2023', volume='41', issue='15', pageStart='1', pageEnd='132', issueExtLink='null', onlineDate='null', pubDate='1691856000000', pubDateStr='2023-08-13', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=3, issueType=-1, specialIssue=null, createTime=1774078989285, creator='sys-migrate', updateTime=1774078989285, updator='sys-migrate', preIssue=null, nextIssue=null, articleTotal=null, ext=null, issueFiles=null, downloadFileDto=null}, startPage=113, endPage=123, ext={EN=ArticleExt(id=1242140596895294277, articleId=1242140593107841508, tenantId=1146029695717560320, journalId=1146031591421210625, language=EN, title=An entity mapping technology of national grid public data model integrating BERT and congestion filtering, columnId=1150494644690366681, journalTitle=Science & Technology Review, columnName=Papers, runingTitle=null, highlight=null, articleAbstract=Aiming at the problems of current SG-CIM (state grid-common information model) such as difficult to achieve automatic update iteration and low efficient mining of new elements, an SG-CIM model automatic mapping technology based on BERT model and blocking filtering is proposed. On the basis of the existing SG-CIM, an SG-CIM knowledge map and data table knowledge graph are constructed at first. Secondly, by studying the entity alignment method based on BERT model and blocking filtering, the mapping relationship between the two knowledge graphs is established. Finally, the effectiveness of the proposed method is verified by experimental analysis of the text mapping effect. Results show that the accuracy of BERT model after finetuning on a self-made data set is more than 95%. This method lays a foundation for subsequent mining of new elements and automatic updating iteration of SG-CIM., authors=LI Yufei1, HAO Baocong1, LOU Yiwei2*, YANG Shiyu1, GAO Shijie3, ZHANG Pengyu1, authorsList=LI Yufei, HAO Baocong, LOU Yiwei, YANG Shiyu, GAO Shijie, ZHANG Pengyu, authorCompany=1. Big Data Center of State Grid Corporation of China, Beijing 100053, China
2. School of Computer Science, Peking University, Beijing 100871, China
3. Beijing Zhongdian Puhua Information Technology Co., Ltd., Beijing 100085, China, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=iFumIYgBJEV0axSAoLC+pw==, pdfFileSize=3798915, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, fund=null), CN=ArticleExt(id=1242140595662168879, articleId=1242140593107841508, tenantId=1146029695717560320, journalId=1146031591421210625, language=CN, title=融合BERT和阻塞过滤的国家电网公共数据模型实体映射技术, columnId=1242117055294537731, journalTitle=科技导报, columnName=论文, runingTitle=null, highlight=null, articleAbstract=针对目前国家电网公共数据模型SG-CIM(state grid-common information model)难以实现自动更新迭代和挖掘新元素效率较低等问题,提出了一种基于知识图谱和BERT(bidirectional encoder representations from transformers)模型的SG-CIM模型自动映射技术。在现有SG-CIM模型的基础上,构建出SG-CIM知识图谱和数据表知识图谱;通过研究基于BERT模型和阻塞过滤的实体映射技术,在2个知识图谱之间建立映射关系;对文本方法映射效果进行实验分析,结果表明在自制数据集上微调后BERT模型的精确度在88%以上。, authors=李雨霏1,郝保聪1,楼轶维2*,杨诗语1,高士杰3,张鹏宇1, authorsList=李雨霏,郝保聪,楼轶维,杨诗语,高士杰,张鹏宇, authorCompany=1. 国家电网有限公司大数据中心,北京 100053
2. 北京大学计算机学院,北京 100871
3. 北京中电普华信息技术有限公司,北京 100085, correspAuthors=null, authorNote=李雨霏,高级工程师,研究方向为大数据应用技术等,电子信箱:15101537383@126.com, correspAuthorsNote=楼轶维(通信作者),博士研究生,研究方向为大数据应用技术,电子信箱:cyfqylyw@gmail.com, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=hQm8ii/Irs352q0rCaqGGA==, pdfFileSize=3798915, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, fund=国网大数据中心科技项目(SGSJ0000SJJS2200040))}, authors=null, keywords=[Keyword(id=1242140595125297957, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=CN, orderNo=1, keyword=知识图谱), Keyword(id=1242140595196601127, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=CN, orderNo=1, keyword=SG-CIM模型), Keyword(id=1242140595293070120, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=CN, orderNo=1, keyword=BERT模型), Keyword(id=1242140595406316330, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=CN, orderNo=1, keyword=阻塞过滤), Keyword(id=1242140595473425196, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=CN, orderNo=1, keyword=实体对齐), Keyword(id=1242140595536339757, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=CN, orderNo=1, keyword=实体映射), Keyword(id=1242140596421337919, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=EN, orderNo=1, keyword=Knowledge graph), Keyword(id=1242140596505224001, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=EN, orderNo=1, keyword=SG-CIM model), Keyword(id=1242140596584915778, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=EN, orderNo=1, keyword=BERT model), Keyword(id=1242140596668801859, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=EN, orderNo=1, keyword=entity alignment), Keyword(id=1242140596765270852, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140593107841508, language=EN, orderNo=1, keyword=Establish Mapping)], refs=null, funds=null, companyList=null, figs=null, attaches=null, journal=Journal(id=1125356956822126595, delFlag=0, nameCn=科技导报, nameEn=Science & Technology Review, nameHistory1=null, nameHistory2=null, issn=1000-7857, eissn=, cn=11-1421/N, coden=null, periodic=3, language=CN, oaType=0, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=wfghvu3bhh/dKxuZ+ucVHA==, journalPrice=null, startedYear=null, abbrevIsoEn=Sci Technol Rev, journalRemark=null, publicationField=null, createdTime=null, updatedTime=1774230116083, createdBy=null, updatedBy=13041195026, firstLetterCn=S, firstLetterEn=S, subjectCode=Natural Sciences, subjectName=自然科学, subjectCodeEn=Natural Sciences, subjectNameEn=null, picCn=wfghvu3bhh/dKxuZ+ucVHA==, picEn=yjSfclmpNm7ihn9NbTZ69g==, jcr=null, cjcr=null, exts=[JournalExt(id=1242774439910290156, language=CN, name=科技导报, nameHistory1=null, nameHistory2=null, managedBy=中国科学技术协会, sponsoredBy=中国科学技术协会, publishedBy=科技导报社, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=http://www.kjdb.org/CN/home, createdTime=1774230116107, updatedTime=1774230116107, createdBy=13041195026, updatedBy=13041195026, submissionGuidelinesUrl=http://www.kjdb.org/CN/column/column7.shtml, submissionAuthorUrl=https://kjdbauthor.cast.org.cn/webm, submissionEditorUrl=https://kjdbeditor.cast.org.cn/webm/, submissionReviewUrl=https://kjdbauthor.cast.org.cn/webm, submissionCeEditorUrl=https://kjdbeditor.cast.org.cn/webm/, submissionAeEditorUrl=https://kjdbeditor.cast.org.cn/webm/, option={"copyright":""}), JournalExt(id=1242774439960621805, language=EN, name=Science & Technology Review, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=http://www.kjdb.org/EN/home, createdTime=1774230116119, updatedTime=1774230116119, createdBy=13041195026, updatedBy=13041195026, submissionGuidelinesUrl=http://www.kjdb.org/EN/column/column7.shtml, submissionAuthorUrl=https://kjdbauthor.manuscriptcloud.com/login, submissionEditorUrl=https://kjdbeditor.manuscriptcloud.com/login, submissionReviewUrl=https://kjdbauthor.manuscriptcloud.com/login, submissionCeEditorUrl=https://kjdbeditor.manuscriptcloud.com/login, submissionAeEditorUrl=https://kjdbeditor.manuscriptcloud.com/login, option={"copyright":""})], databaseList=null, tenantJournalId=1146031591421210625, websiteList=[Website(id=1146104741081231361, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1146031591421210625, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/kjdb/CN, language=CN, createTime=1751182263881, createBy=18614031015, updateTime=1751778001962, updateBy=18614031015, name=科技导报, tplId=1146099689490845704, title=科技导报, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1148021146403992296, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=articleTextType, value=kx, createTime=1751639170504, updateTime=1751639170504, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146378826469, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=banner, value=null, createTime=1751639170498, updateTime=1751639170498, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146366243556, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=logo, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic?fileId=9GHSf7eGlIPH0Tv/OOdstA==, createTime=1751639170495, updateTime=1751639170495, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146395603687, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic, createTime=1751639170502, updateTime=1751639170502, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146387215078, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1751639170500, updateTime=1751639170500, creator=18614031015, updator=18614031015)]), Website(id=1146105254833139715, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1146031591421210625, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/kjdb/EN, language=EN, createTime=1751182386363, createBy=18614031015, updateTime=1753500121937, updateBy=18614031015, name=科技导报, tplId=1146101810881728533, title=Science & Technology Review, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1155838567709528217, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=articleTextType, value=kx, createTime=1753502988984, updateTime=1753502988984, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567692750998, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=banner, value=null, createTime=1753502988980, updateTime=1753502988980, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567688556693, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=logo, value=https://castjournals.cast.org.cn/joweb/kjdb/EN/file/pic?fileId=9GHSf7eGlIPH0Tv/OOdstA==, createTime=1753502988979, updateTime=1753502988979, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567705333912, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/kjdb/EN/file/pic, createTime=1753502988983, updateTime=1753502988983, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567701139607, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1753502988982, updateTime=1753502988982, creator=18614031015, updator=18614031015)])], journalTitle=科技导报, weixinUrl=null, journalUrl=null, iacademicId=null, status=1, seqNo=null, journalTitleEn=Science & Technology Review, journalPhotoCn=wfghvu3bhh/dKxuZ+ucVHA==, journalPhotoEn=yjSfclmpNm7ihn9NbTZ69g==, journalFirstLetter=S, journalRecommend=null, journalNew=null, journalCollection=1, jcrJf=null, cjcrJf=0.91, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false, interPubPlatform=null, interPubPlatformUrl=null), detailUrlCn=https://castjournals.cast.org.cn/joweb/kjdb/CN/10.3981/j.issn.1000-7857.2023.15.012, detailUrlEn=https://castjournals.cast.org.cn/joweb/kjdb/EN/10.3981/j.issn.1000-7857.2023.15.012, pdfUrlCn=https://castjournals.cast.org.cn/joweb/kjdb/CN/PDF/10.3981/j.issn.1000-7857.2023.15.012, pdfUrlEn=https://castjournals.cast.org.cn/joweb/kjdb/EN/PDF/10.3981/j.issn.1000-7857.2023.15.012, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, previewStatus=0, delFlag=0, hasFullText=0, orderTime=1691856000000, fullTextJson=null, articleText=null, reference=null)
收藏切换
融合BERT和阻塞过滤的国家电网公共数据模型实体映射技术
收藏切换
PDF下载
科技导报 | 论文 2023,41(15): 113-123
收起
收藏切换
科技导报 | 论文 2023, 41(15): 113-123
融合BERT和阻塞过滤的国家电网公共数据模型实体映射技术
全屏
李雨霏1,郝保聪1,楼轶维2*,杨诗语1,高士杰3,张鹏宇1
作者信息
    1. 国家电网有限公司大数据中心,北京 100053
    2. 北京大学计算机学院,北京 100871
    3. 北京中电普华信息技术有限公司,北京 100085

通讯作者:

楼轶维(通信作者),博士研究生,研究方向为大数据应用技术,电子信箱:cyfqylyw@gmail.com
An entity mapping technology of national grid public data model integrating BERT and congestion filtering
Affiliations
出版时间: 2023-08-13 doi: 10.3981/j.issn.1000-7857.2023.15.012
文章导航
收藏切换
针对目前国家电网公共数据模型SG-CIM(state grid-common information model)难以实现自动更新迭代和挖掘新元素效率较低等问题,提出了一种基于知识图谱和BERT(bidirectional encoder representations from transformers)模型的SG-CIM模型自动映射技术。在现有SG-CIM模型的基础上,构建出SG-CIM知识图谱和数据表知识图谱;通过研究基于BERT模型和阻塞过滤的实体映射技术,在2个知识图谱之间建立映射关系;对文本方法映射效果进行实验分析,结果表明在自制数据集上微调后BERT模型的精确度在88%以上。
知识图谱  /  SG-CIM模型  /  BERT模型  /  阻塞过滤  /  实体对齐  /  实体映射
Aiming at the problems of current SG-CIM (state grid-common information model) such as difficult to achieve automatic update iteration and low efficient mining of new elements, an SG-CIM model automatic mapping technology based on BERT model and blocking filtering is proposed. On the basis of the existing SG-CIM, an SG-CIM knowledge map and data table knowledge graph are constructed at first. Secondly, by studying the entity alignment method based on BERT model and blocking filtering, the mapping relationship between the two knowledge graphs is established. Finally, the effectiveness of the proposed method is verified by experimental analysis of the text mapping effect. Results show that the accuracy of BERT model after finetuning on a self-made data set is more than 95%. This method lays a foundation for subsequent mining of new elements and automatic updating iteration of SG-CIM.
Knowledge graph  /  SG-CIM model  /  BERT model  /  entity alignment  /  Establish Mapping
李雨霏,郝保聪,楼轶维,杨诗语,高士杰,张鹏宇. 融合BERT和阻塞过滤的国家电网公共数据模型实体映射技术. 科技导报, 2023 , 41 (15) : 113 -123 . DOI: 10.3981/j.issn.1000-7857.2023.15.012
LI Yufei, HAO Baocong, LOU Yiwei, YANG Shiyu, GAO Shijie, ZHANG Pengyu. An entity mapping technology of national grid public data model integrating BERT and congestion filtering[J]. Science & Technology Review, 2023 , 41 (15) : 113 -123 . DOI: 10.3981/j.issn.1000-7857.2023.15.012
2023年第41卷第15期
PDF下载
1910
1595
引用本文
BibTeX
文章信息
doi: 10.3981/j.issn.1000-7857.2023.15.012
  • 接收时间:2023-02-06
  • 首发时间:2023-08-30
  • 出版时间:2023-08-13
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2023-02-06
  • 修回日期:2023-03-09
基金
作者信息

通讯作者:

楼轶维(通信作者),博士研究生,研究方向为大数据应用技术,电子信箱:cyfqylyw@gmail.com
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/kjdb/CN/10.3981/j.issn.1000-7857.2023.15.012
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏