Article(id=1242140399033201162, tenantId=1146029695717560320, journalId=1146031591421210625, issueId=1242140382134350274, articleNumber=null, orderNo=11, doi=10.3981/j.issn.1000-7857.2023.07.009, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1659456000000, receivedDateStr=2022-08-03, revisedDate=1667404800000, revisedDateStr=2022-11-03, acceptedDate=null, acceptedDateStr=null, onlineDate=1682587642177, onlineDateStr=2023-04-27, pubDate=1681315200000, pubDateStr=2023-04-13, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1682587642177, onlineIssueDateStr=2023-04-27, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1774078948978, creator=sys-migrate, updateTime=1774078948978, updator=sys-migrate, issue=Issue{id=1242140382134350274, tenantId=1146029695717560320, journalId=1146031591421210625, year='2023', volume='41', issue='7', pageStart='1', pageEnd='127', issueExtLink='null', onlineDate='null', pubDate='1681315200000', pubDateStr='2023-04-13', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=3, issueType=-1, specialIssue=null, createTime=1774078944950, creator='sys-migrate', updateTime=1774078944950, updator='sys-migrate', preIssue=null, nextIssue=null, articleTotal=null, ext=null, issueFiles=null, downloadFileDto=null}, startPage=79, endPage=97, ext={EN=ArticleExt(id=1242140405282709571, articleId=1242140399033201162, tenantId=1146029695717560320, journalId=1146031591421210625, language=EN, title=Principle and methodology of AlphaGo family algorithms, columnId=1242116810380743325, journalTitle=Science & Technology Review, columnName=Reviews, runingTitle=null, highlight=null, articleAbstract=AlphaGo family algorithms are important milestones in the history of artificial intelligence. These algorithms not only solve the typical complete information game problem such as Go but also are applicable to a wider range of problems. According to their development, this paper summarizes the fundamental principle and technical characteristics for the series of algorithms from AlphaGo Fan to MuZero, elaborating how the AlphaGo family algorithms work. The key technologies employed, including the Monte Carlo tree search, modeling and training of deep neural networks, are surveyed and compared. The AlphaGo family algorithms are of significant instructive value for addressing various problems in practice, from algorithm design, neural network modeling to model utilization. This paper helps to quickly understand the principle of these algorithms and is expected to provide useful reference for the further research and development of algorithms., authors=ZHANG Sheng1, LONG Qiang2*, KONG Yinan3, WANG Yu2, authorsList=ZHANG Sheng, LONG Qiang, KONG Yinan, WANG Yu, authorCompany=1. Aerospace Technology Institute, China Aerodynamic Research and Development Center, Mianyang 621000, China
2. School of Mathematics and Physics, Southwest University of Science and Technology, Mianyang 621000, China
3. Computational Aerodynamics Institute, China Aerodynamic Research and Development Center, Mianyang 621000, China, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=5INZFhwGMIrJt51OLt0Khg==, pdfFileSize=4620804, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, fund=null), CN=ArticleExt(id=1242140403701456951, articleId=1242140399033201162, tenantId=1146029695717560320, journalId=1146031591421210625, language=CN, title=围棋人工智能AlphaGo系列算法的原理与方法, columnId=1242134377686180284, journalTitle=科技导报, columnName=综述, runingTitle=null, highlight=null, articleAbstract=围棋人工智能AlphaGo系列算法是人工智能发展历史中的重要里程碑事件。它们不仅成功地求解了以围棋为代表的完全信息博弈问题,而且具有更加广泛的适用性。依算法的发展历程,从基本原理与技术特征方面对 AlphaGo Fan 到 MuZero 的一系列算法进行了梳理,说明了AlphaGo系列算法的落子原理,阐释与对比了其中采用的关键技术:蒙特卡洛树搜索和深度神经网络的建模及训练。AlphaGo系列算法对解决实践中的其他重要问题,从算法设计、神经网络建模到模型利用等方面都具有重要的参考意义,本文的总结有助于快速地掌握这些算法的基本原理,从而为相关算法的研究与拓展提供有益参考。, authors=章胜1,龙强2*,孔轶男3,王宇2, authorsList=章胜,龙强,孔轶男,王宇, authorCompany=1. 中国空气动力研究与发展中心空天技术研究所,绵阳 621000
2. 西南科技大学数理学院,绵阳 621000
3. 中国空气动力研究与发展中心计算空气动力研究所,绵阳 621000, correspAuthors=null, authorNote=章胜,副研究员,研究方向为系统优化与人工智能,电子信箱:zszhangshengzs1@outlook.com, correspAuthorsNote=龙强(通信作者),副教授,研究方向为人工智能,电子信箱:Longqiang@163.com, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=M21sApEAuzy4h7rcVTW0kQ==, pdfFileSize=4620804, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, fund=国家自然科学基金项目(11902332))}, authors=[Author(id=1277396067499029104, tenantId=1146029695717560320, journalId=null, articleId=1242140399033201162, orderNo=null, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=null, email=null, emailSecond=null, emailThird=null, correspondingAuthor=null, authorType=null, ext={}, companyList=null)], keywords=[Keyword(id=1242140403152003120, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=CN, orderNo=1, keyword=人工智能), Keyword(id=1242140403235889202, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=CN, orderNo=1, keyword=AlphaGo系列算法), Keyword(id=1242140403332358195, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=CN, orderNo=1, keyword=蒙特卡洛树搜索), Keyword(id=1242140403424632884, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=CN, orderNo=1, keyword=深度神经网络), Keyword(id=1242140403508518965, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=CN, orderNo=1, keyword=强化学习), Keyword(id=1242140404791975998, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=EN, orderNo=1, keyword=artificial intelligence), Keyword(id=1242140404875862079, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=EN, orderNo=1, keyword=AlphaGo family algorithms), Keyword(id=1242140404972331072, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=EN, orderNo=1, keyword=Monte Carlo tree search), Keyword(id=1242140405052022849, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=EN, orderNo=1, keyword=deep neural network), Keyword(id=1242140405135908930, tenantId=1146029695717560320, journalId=1146031591421210625, articleId=1242140399033201162, language=EN, orderNo=1, keyword=reinforcement learning)], refs=null, funds=null, companyList=null, figs=null, attaches=null, journal=Journal(id=1125356956822126595, delFlag=0, nameCn=科技导报, nameEn=Science & Technology Review, nameHistory1=null, nameHistory2=null, issn=1000-7857, eissn=, cn=11-1421/N, coden=null, periodic=3, language=CN, oaType=0, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=wfghvu3bhh/dKxuZ+ucVHA==, journalPrice=null, startedYear=null, abbrevIsoEn=Sci Technol Rev, journalRemark=null, publicationField=null, createdTime=null, updatedTime=1774230116083, createdBy=null, updatedBy=13041195026, firstLetterCn=S, firstLetterEn=S, subjectCode=Natural Sciences, subjectName=自然科学, subjectCodeEn=Natural Sciences, subjectNameEn=null, picCn=wfghvu3bhh/dKxuZ+ucVHA==, picEn=yjSfclmpNm7ihn9NbTZ69g==, jcr=null, cjcr=null, exts=[JournalExt(id=1242774439910290156, language=CN, name=科技导报, nameHistory1=null, nameHistory2=null, managedBy=中国科学技术协会, sponsoredBy=中国科学技术协会, publishedBy=科技导报社, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=http://www.kjdb.org/CN/home, createdTime=1774230116107, updatedTime=1774230116107, createdBy=13041195026, updatedBy=13041195026, submissionGuidelinesUrl=http://www.kjdb.org/CN/column/column7.shtml, submissionAuthorUrl=https://kjdbauthor.cast.org.cn/webm, submissionEditorUrl=https://kjdbeditor.cast.org.cn/webm/, submissionReviewUrl=https://kjdbauthor.cast.org.cn/webm, submissionCeEditorUrl=https://kjdbeditor.cast.org.cn/webm/, submissionAeEditorUrl=https://kjdbeditor.cast.org.cn/webm/, option={"copyright":""}), JournalExt(id=1242774439960621805, language=EN, name=Science & Technology Review, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=http://www.kjdb.org/EN/home, createdTime=1774230116119, updatedTime=1774230116119, createdBy=13041195026, updatedBy=13041195026, submissionGuidelinesUrl=http://www.kjdb.org/EN/column/column7.shtml, submissionAuthorUrl=https://kjdbauthor.manuscriptcloud.com/login, submissionEditorUrl=https://kjdbeditor.manuscriptcloud.com/login, submissionReviewUrl=https://kjdbauthor.manuscriptcloud.com/login, submissionCeEditorUrl=https://kjdbeditor.manuscriptcloud.com/login, submissionAeEditorUrl=https://kjdbeditor.manuscriptcloud.com/login, option={"copyright":""})], databaseList=null, tenantJournalId=1146031591421210625, websiteList=[Website(id=1146104741081231361, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1146031591421210625, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/kjdb/CN, language=CN, createTime=1751182263881, createBy=18614031015, updateTime=1751778001962, updateBy=18614031015, name=科技导报, tplId=1146099689490845704, title=科技导报, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1148021146403992296, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=articleTextType, value=kx, createTime=1751639170504, updateTime=1751639170504, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146378826469, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=banner, value=null, createTime=1751639170498, updateTime=1751639170498, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146366243556, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=logo, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic?fileId=9GHSf7eGlIPH0Tv/OOdstA==, createTime=1751639170495, updateTime=1751639170495, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146395603687, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/kjdb/CN/file/pic, createTime=1751639170502, updateTime=1751639170502, creator=18614031015, updator=18614031015), WebsiteProps(id=1148021146387215078, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146104741081231361, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1751639170500, updateTime=1751639170500, creator=18614031015, updator=18614031015)]), Website(id=1146105254833139715, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1146031591421210625, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/kjdb/EN, language=EN, createTime=1751182386363, createBy=18614031015, updateTime=1753500121937, updateBy=18614031015, name=科技导报, tplId=1146101810881728533, title=Science & Technology Review, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1155838567709528217, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=articleTextType, value=kx, createTime=1753502988984, updateTime=1753502988984, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567692750998, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=banner, value=null, createTime=1753502988980, updateTime=1753502988980, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567688556693, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=logo, value=https://castjournals.cast.org.cn/joweb/kjdb/EN/file/pic?fileId=9GHSf7eGlIPH0Tv/OOdstA==, createTime=1753502988979, updateTime=1753502988979, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567705333912, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/kjdb/EN/file/pic, createTime=1753502988983, updateTime=1753502988983, creator=18614031015, updator=18614031015), WebsiteProps(id=1155838567701139607, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1146105254833139715, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1753502988982, updateTime=1753502988982, creator=18614031015, updator=18614031015)])], journalTitle=科技导报, weixinUrl=null, journalUrl=null, iacademicId=null, status=1, seqNo=null, journalTitleEn=Science & Technology Review, journalPhotoCn=wfghvu3bhh/dKxuZ+ucVHA==, journalPhotoEn=yjSfclmpNm7ihn9NbTZ69g==, journalFirstLetter=S, journalRecommend=null, journalNew=null, journalCollection=1, jcrJf=null, cjcrJf=0.91, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false, interPubPlatform=null, interPubPlatformUrl=null), detailUrlCn=https://castjournals.cast.org.cn/joweb/kjdb/CN/10.3981/j.issn.1000-7857.2023.07.009, detailUrlEn=https://castjournals.cast.org.cn/joweb/kjdb/EN/10.3981/j.issn.1000-7857.2023.07.009, pdfUrlCn=https://castjournals.cast.org.cn/joweb/kjdb/CN/PDF/10.3981/j.issn.1000-7857.2023.07.009, pdfUrlEn=https://castjournals.cast.org.cn/joweb/kjdb/EN/PDF/10.3981/j.issn.1000-7857.2023.07.009, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, previewStatus=0, delFlag=0, hasFullText=0, orderTime=1681315200000, fullTextJson=null, articleText=null, reference=null)
收藏切换
围棋人工智能AlphaGo系列算法的原理与方法
收藏切换
PDF下载
科技导报 | 综述 2023,41(7): 79-97
收起
收藏切换
科技导报 | 综述 2023, 41(7): 79-97
围棋人工智能AlphaGo系列算法的原理与方法
全屏
作者信息

通讯作者:

龙强(通信作者),副教授,研究方向为人工智能,电子信箱:Longqiang@163.com
Principle and methodology of AlphaGo family algorithms
Affiliations
出版时间: 2023-04-13 doi: 10.3981/j.issn.1000-7857.2023.07.009
文章导航
收藏切换
围棋人工智能AlphaGo系列算法是人工智能发展历史中的重要里程碑事件。它们不仅成功地求解了以围棋为代表的完全信息博弈问题,而且具有更加广泛的适用性。依算法的发展历程,从基本原理与技术特征方面对 AlphaGo Fan 到 MuZero 的一系列算法进行了梳理,说明了AlphaGo系列算法的落子原理,阐释与对比了其中采用的关键技术:蒙特卡洛树搜索和深度神经网络的建模及训练。AlphaGo系列算法对解决实践中的其他重要问题,从算法设计、神经网络建模到模型利用等方面都具有重要的参考意义,本文的总结有助于快速地掌握这些算法的基本原理,从而为相关算法的研究与拓展提供有益参考。
人工智能  /  AlphaGo系列算法  /  蒙特卡洛树搜索  /  深度神经网络  /  强化学习
AlphaGo family algorithms are important milestones in the history of artificial intelligence. These algorithms not only solve the typical complete information game problem such as Go but also are applicable to a wider range of problems. According to their development, this paper summarizes the fundamental principle and technical characteristics for the series of algorithms from AlphaGo Fan to MuZero, elaborating how the AlphaGo family algorithms work. The key technologies employed, including the Monte Carlo tree search, modeling and training of deep neural networks, are surveyed and compared. The AlphaGo family algorithms are of significant instructive value for addressing various problems in practice, from algorithm design, neural network modeling to model utilization. This paper helps to quickly understand the principle of these algorithms and is expected to provide useful reference for the further research and development of algorithms.
artificial intelligence  /  AlphaGo family algorithms  /  Monte Carlo tree search  /  deep neural network  /  reinforcement learning
章胜,龙强,孔轶男,王宇. 围棋人工智能AlphaGo系列算法的原理与方法. 科技导报, 2023 , 41 (7) : 79 -97 . DOI: 10.3981/j.issn.1000-7857.2023.07.009
ZHANG Sheng, LONG Qiang, KONG Yinan, WANG Yu. Principle and methodology of AlphaGo family algorithms[J]. Science & Technology Review, 2023 , 41 (7) : 79 -97 . DOI: 10.3981/j.issn.1000-7857.2023.07.009
2023年第41卷第7期
PDF下载
1107
268
引用本文
BibTeX
文章信息
doi: 10.3981/j.issn.1000-7857.2023.07.009
  • 接收时间:2022-08-03
  • 首发时间:2023-04-27
  • 出版时间:2023-04-13
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2022-08-03
  • 修回日期:2022-11-03
基金
作者信息

通讯作者:

龙强(通信作者),副教授,研究方向为人工智能,电子信箱:Longqiang@163.com
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/kjdb/CN/10.3981/j.issn.1000-7857.2023.07.009
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏