Article(id=1241779356830339198, tenantId=1146029695717560320, journalId=1205117023404326918, issueId=1241779355555266850, articleNumber=null, orderNo=null, doi=10.16155/j.0254-1793.2024.05.15, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1689696000000, receivedDateStr=2023-07-19, revisedDate=null, revisedDateStr=null, acceptedDate=null, acceptedDateStr=null, onlineDate=1773992869813, onlineDateStr=2026-03-20, pubDate=1717084800000, pubDateStr=2024-05-31, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1773992869813, onlineIssueDateStr=2026-03-20, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1773992869813, creator=13701087609, updateTime=1773992869813, updator=13701087609, issue=Issue{id=1241779355555266850, tenantId=1146029695717560320, journalId=1205117023404326918, year='2024', volume='44', issue='5', pageStart='737', pageEnd='920', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=null, createTime=1773992869509, creator=13701087609, updateTime=1773992925624, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1241779590985749489, tenantId=1146029695717560320, journalId=1205117023404326918, issueId=1241779355555266850, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1241779590985749490, tenantId=1146029695717560320, journalId=1205117023404326918, issueId=1241779355555266850, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=866, endPage=873, ext={EN=ArticleExt(id=1241779358302539903, articleId=1241779356830339198, tenantId=1146029695717560320, journalId=1205117023404326918, language=EN, title=Research on the application of machine learning related techniques in the classification of Astragali Radix characterized by flavonoids, columnId=1239148841803501731, journalTitle=Chinese Journal of Pharmaceutical Analysis, columnName=Quality Control, runingTitle=null, highlight=null, articleAbstract=
Objective:

To establish a three classification model for cultivated,semi-wild,and wild Astragali Radix characterized by flavonoids,and explore and evaluate the application of techniques of automated machine learning and data augmentation in the field of drug analysis.

Methods:

Firstly,correlation analysis and principal component analysis were conducted on the flavonoid content data of Astragali Radix,and models of decision tree and logistic regression were established to analyze the importance of flavonoid components based on the models. Then,using the AutoGluon framework with 5 as num_bag_folds,2 sets of 30 models respectively through 64 batches of real data and 600 batches of virtual data generated based on real data with the TVAE table data generation algorithm for training were obtained,and these models were evaluated by accuracy.

Results:

The analysis of machine learning models,indicated that formononetin,campanulin and onospin played the important roles in the quality control of Astragali Radix,especially for the source grade control. The accuracy of model prediction showed that the models based on Neural Net and tree-model always had the best classification effect for Astragali Radix. The virtual data generated by data augmentation technique is basically consistent with the actual data in terms of the accuracy trend of the model training process.

Conclusion:

Related techniques of machine learning have good application value in the classification of Astragali Radix characterized by flavonoids.

, correspAuthors=Feng WEI, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Yan SHI, Ning LI, Feng WEI, Shuang-cheng MA), CN=ArticleExt(id=1241779362933051546, articleId=1241779356830339198, tenantId=1146029695717560320, journalId=1205117023404326918, language=CN, title=机器学习相关技术在以黄酮为特征的黄芪分类中的应用研究, columnId=1239148842025799861, journalTitle=药物分析杂志, columnName=质量分析, runingTitle=null, highlight=null, articleAbstract=
目的:

建立以黄酮类成分为特征的栽培黄芪、半野生黄芪和野生黄芪的三分类模型,并且对自动机器学习技术和数据增强技术在药物分析领域中的应用进行探索和评价。

方法:

首先,对黄芪的黄酮类成分含量数据进行相关性分析、主成分分析,建立决策树和逻辑回归模型,根据模型分析黄酮类成分的重要性程度;然后,使用TVAE表格数据生成算法,根据真实数据生成600批虚拟数据,使用自动学习框架AutoGluon,num_bag_folds设为5,分别对64批真实数据和600批虚拟数据进行学习,得到2组共30个模型,依据准确率进行评估。

结果:

对机器学习模型的分析可知,芒柄花素、毛蕊异黄酮葡萄糖苷和刺芒柄花苷这3种黄酮类成分对于黄芪质量,尤其是来源等级的控制具有重要意义;2组共30个模型预测准确率表明,基于NeuralNet的模型和基于树模型的机器学习算法对于黄酮成分数据表征的黄芪而言分类效果最好;数据增强技术生成的虚拟数据与真实数据在所训练得到的模型准确率趋势方面基本一致。

结论:

机器学习相关技术在以黄酮为特征的黄芪分类中具有较好的应用价值。

, correspAuthors=魏锋, authorNote=null, correspAuthorsNote=
* Tel:(010)53852020;E-mail:
, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=4ZbPw7l8eWW2IIpVtKh+sQ==, magXml=wrMnqf5R07PRe11l2nU2Xw==, pdfUrl=null, pdf=QlM3WYObs2SQXSDMYZahzw==, pdfFileSize=3506771, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=9Jeit5X0kRMOGRHV71kBbw==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=sCsEGcjfPdyhVUSc2HTp1g==, mapNumber=null, authorCompany=null, fund=null, authors=

石岩 Tel:(010)53852081;E-mail:

李宁 Tel:13811671528;E-mail:

, authorsList=石岩, 李宁, 魏锋, 马双成)}, authors=[Author(id=1241779363482505395, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=shiyan@nifdc.org.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241779363595751609, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779363482505395, language=EN, stringName=Yan SHI, firstName=Yan, middleName=null, lastName=SHI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.National Institutes for Food and Drug Control, Beijing 102629, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241779363713192126, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779363482505395, language=CN, stringName=石岩, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.中国食品药品检定研究院,北京 102629, bio={"content":"

石岩 Tel:(010)53852081;E-mail:

"}, bioImg=null, bioContent=

石岩 Tel:(010)53852081;E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241779363209875616, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, xref=1., ext=[AuthorCompanyExt(id=1241779363218264226, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.National Institutes for Food and Drug Control, Beijing 102629, China), AuthorCompanyExt(id=1241779363226652836, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.中国食品药品检定研究院,北京 102629)])]), Author(id=1241779363826438341, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=642781540@qq.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241779364061319377, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779363826438341, language=EN, stringName=Ning LI, firstName=Ning, middleName=null, lastName=LI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.Beijing Institute for Drug Control, Beijing 102206, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241779364170371291, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779363826438341, language=CN, stringName=李宁, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.北京市药品检验研究院,北京 102206, bio={"content":"

李宁 Tel:13811671528;E-mail:

"}, bioImg=null, bioContent=

李宁 Tel:13811671528;E-mail:

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241779363339899051, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, xref=2., ext=[AuthorCompanyExt(id=1241779363348287660, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363339899051, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.Beijing Institute for Drug Control, Beijing 102206, China), AuthorCompanyExt(id=1241779363360870573, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363339899051, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.北京市药品检验研究院,北京 102206)])]), Author(id=1241779364266840292, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=weifeng@nifdc.org.cn, emailSecond=null, emailThird=null, correspondingAuthor=1, authorType=1, ext={EN=AuthorExt(id=1241779364405252335, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779364266840292, language=EN, stringName=Feng WEI, firstName=Feng, middleName=null, lastName=WEI, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, *, address=1.National Institutes for Food and Drug Control, Beijing 102629, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241779364539470069, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779364266840292, language=CN, stringName=魏锋, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, *, address=1.中国食品药品检定研究院,北京 102629, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241779363209875616, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, xref=1., ext=[AuthorCompanyExt(id=1241779363218264226, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.National Institutes for Food and Drug Control, Beijing 102629, China), AuthorCompanyExt(id=1241779363226652836, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.中国食品药品检定研究院,北京 102629)])]), Author(id=1241779364640133374, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241779364770156805, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779364640133374, language=EN, stringName=Shuang-cheng MA, firstName=Shuang-cheng, middleName=null, lastName=MA, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.National Institutes for Food and Drug Control, Beijing 102629, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241779364879208716, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, authorId=1241779364640133374, language=CN, stringName=马双成, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.中国食品药品检定研究院,北京 102629, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241779363209875616, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, xref=1., ext=[AuthorCompanyExt(id=1241779363218264226, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.National Institutes for Food and Drug Control, Beijing 102629, China), AuthorCompanyExt(id=1241779363226652836, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.中国食品药品检定研究院,北京 102629)])])], keywords=[Keyword(id=1241779365009232149, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=1, keyword=Astragali Radix), Keyword(id=1241779365122478368, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=2, keyword=flavonoids), Keyword(id=1241779365256696107, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=3, keyword=campanulin), Keyword(id=1241779365399302452, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=4, keyword=onospin), Keyword(id=1241779365483188541, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=5, keyword=calycosin), Keyword(id=1241779365609017672, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=6, keyword=kaempferol), Keyword(id=1241779367097995603, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=7, keyword=isorhamnetin), Keyword(id=1241779367190270298, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=8, keyword=formononetin), Keyword(id=1241779367366431078, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=9, keyword=machine learning), Keyword(id=1241779367525814639, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=10, keyword=artificial intelligence), Keyword(id=1241779367622283636, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, orderNo=11, keyword=data augmentation), Keyword(id=1241779367806833022, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=1, keyword=黄芪), Keyword(id=1241779367949439366, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=2, keyword=黄酮), Keyword(id=1241779368033325454, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=3, keyword=毛蕊异黄酮葡萄糖苷), Keyword(id=1241779368222069146, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=4, keyword=刺芒柄花苷), Keyword(id=1241779368352092577, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=5, keyword=毛蕊异黄酮), Keyword(id=1241779368469533096, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=6, keyword=山柰酚), Keyword(id=1241779368603750833, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=7, keyword=异鼠李素), Keyword(id=1241779368721191353, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=8, keyword=芒柄花素), Keyword(id=1241779368813466047, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=9, keyword=机器学习), Keyword(id=1241779368939295178, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=10, keyword=人工智能), Keyword(id=1241779369065124314, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, orderNo=11, keyword=数据增强)], refs=[Reference(id=1241779373032936060, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=315, pageEnd=null, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=null, journalName=null, refType=null, unstructuredReference=中华人民共和国药典2020年版.一部 [S]. 2020:315, articleTitle=null, refAbstract=null), Reference(id=1241779373167153796, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=315, pageEnd=null, url=null, language=null, rfNumber=[1], rfOrder=1, authorNames=null, journalName=null, refType=null, unstructuredReference=ChP 2020. Vol Ⅰ [S]. 2020:315, articleTitle=null, refAbstract=null), Reference(id=1241779373301371535, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2014, volume=52, issue=3, pageStart=226, pageEnd=null, url=null, language=null, rfNumber=[2], rfOrder=2, authorNames=DU HW, ZHAO XL, ZHANG AH, journalName=J Chromatogr Sci, refType=null, unstructuredReference=DU HWZHAO XLZHANG AHet al. Rapid separation,identification and analysis of Astragalus membranaceus Fisch. using liquid chromatography-tandem mass spectrometry[J]. J Chromatogr Sci201452(3):226, articleTitle=Rapid separation,identification and analysis of Astragalus membranaceus Fisch. using liquid chromatography-tandem mass spectrometry, refAbstract=null), Reference(id=1241779373477532315, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2015, volume=20, issue=9, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[3], rfOrder=3, authorNames=LI K, GAO F, WANG G, journalName=Molecules, refType=null, unstructuredReference=LI KGAO FWANG Get al. Identification of cultured and natural Astragalus root based on monosaccharide mapping[J]. Molecules201520(9):16466, articleTitle=Identification of cultured and natural Astragalus root based on monosaccharide mapping, refAbstract=null), Reference(id=1241779373607555748, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2017, volume=1070, issue=null, pageStart=76, pageEnd=null, url=null, language=null, rfNumber=[4], rfOrder=4, authorNames=LEE SM, JEONG JS, KWON HJ, journalName=J Chromatogr B, refType=null, unstructuredReference=LEE SMJEONG JSKWON HJet al. Quantification of isoflavonoids and triterpene saponins in Astragali Radix,the root of Astragalus membranaceus,via reverse-phase high-performance liquid chromatography coupled with integrated pulsed amperometric detection[J]. J Chromatogr B20171070:76, articleTitle=Quantification of isoflavonoids and triterpene saponins in Astragali Radix,the root of Astragalus membranaceus,via reverse-phase high-performance liquid chromatography coupled with integrated pulsed amperometric detection, refAbstract=null), Reference(id=1241779373724996270, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2022, volume=42, issue=7, pageStart=1120, pageEnd=null, url=null, language=null, rfNumber=[5], rfOrder=5, authorNames=石岩, 贾天颖, 李向日, journalName=药物分析杂志, refType=null, unstructuredReference=石岩,贾天颖,李向日,.黄芪中多种黄酮类成分的测定研究[J].药物分析杂志202242(7):1120, articleTitle=黄芪中多种黄酮类成分的测定研究, refAbstract=null), Reference(id=1241779373863408312, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2022, volume=42, issue=7, pageStart=1120, pageEnd=null, url=null, language=null, rfNumber=[5], rfOrder=6, authorNames=SHI Y, JIA TY, LI XR, journalName=Chin J Pharm Anal, refType=null, unstructuredReference=SHI YJIA TYLI XRet al. Quantification of flavonoid compounds in Astragali Radix[J]. Chin J Pharm Anal202242(7):1120, articleTitle=Quantification of flavonoid compounds in Astragali Radix, refAbstract=null), Reference(id=1241779374014403261, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=40, issue=4, pageStart=722, pageEnd=null, url=null, language=null, rfNumber=[6], rfOrder=7, authorNames=张丽, 钱大玮, 卜凡淑, journalName=药物分析杂志, refType=null, unstructuredReference=张丽,钱大玮,卜凡淑,.基于UPLC-MS的黄芪药材质量评价研究[J].药物分析杂志202040(4):722, articleTitle=基于UPLC-MS的黄芪药材质量评价研究, refAbstract=null), Reference(id=1241779374173786822, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=40, issue=4, pageStart=722, pageEnd=null, url=null, language=null, rfNumber=[6], rfOrder=8, authorNames=ZHANG L, QIAN DW, BU FS, journalName=Chin J Pharm Anal, refType=null, unstructuredReference=ZHANG LQIAN DWBU FSet al. Study on quality evaluation of Astragali Radix base on UPLC-MS [J]. Chin J Pharm Anal202040(4):722, articleTitle=Study on quality evaluation of Astragali Radix base on UPLC-MS, refAbstract=null), Reference(id=1241779374303810252, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=45, issue=13, pageStart=3183, pageEnd=null, url=null, language=null, rfNumber=[7], rfOrder=9, authorNames=赵晨光, 李存玉, 杨珊, journalName=中国中药杂志, refType=null, unstructuredReference=赵晨光,李存玉,杨珊,.基于道地产区蒙古黄芪的质量差异性分析[J].中国中药杂志202045(13):3183, articleTitle=基于道地产区蒙古黄芪的质量差异性分析, refAbstract=null), Reference(id=1241779374429639376, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=45, issue=13, pageStart=3183, pageEnd=null, url=null, language=null, rfNumber=[7], rfOrder=10, authorNames=ZHAO CG, LI CY, YANG S, journalName=China J Chin Mater Med, refType=null, unstructuredReference=ZHAO CGLI CYYANG Set al. Analysis of qualitu difference based on Astragalus membranaceus var. mongholicus in genuine region [J]. China J Chin Mater Med202045(13):3183, articleTitle=Analysis of qualitu difference based on Astragalus membranaceus var. mongholicus in genuine region, refAbstract=null), Reference(id=1241779374530302680, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=37, issue=5, pageStart=620, pageEnd=null, url=null, language=null, rfNumber=[8], rfOrder=11, authorNames=裴文菡, 何凡, 程青松, journalName=中国现代应用药学, refType=null, unstructuredReference=裴文菡,何凡,程青松,.中药黄芪质量评价方法的研究进展[J].中国现代应用药学202037(5):620, articleTitle=中药黄芪质量评价方法的研究进展, refAbstract=null), Reference(id=1241779376057029338, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=37, issue=5, pageStart=620, pageEnd=null, url=null, language=null, rfNumber=[8], rfOrder=12, authorNames=PEI WH, HE F, CHENG QS, journalName=Chin J Mod Appl Pharm, refType=null, unstructuredReference=PEI WHHE FCHENG QSet al. Research progress on the quality evaluation methods of traditional Chinese medicine Astragali Radix[J]. Chin J Mod Appl Pharm202037(5):620, articleTitle=Research progress on the quality evaluation methods of traditional Chinese medicine Astragali Radix, refAbstract=null), Reference(id=1241779376161886947, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=91, pageEnd=null, url=null, language=null, rfNumber=[9], rfOrder=13, authorNames=李航, journalName=统计学习方法, refType=null, unstructuredReference=李航.统计学习方法[M].第2版.北京:清华大学出版社,2019:91, articleTitle=null, refAbstract=null), Reference(id=1241779376275133163, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=null, pageStart=91, pageEnd=null, url=null, language=null, rfNumber=[9], rfOrder=14, authorNames=LI H, journalName=Statistical Learning Methods, refType=null, unstructuredReference=LI H. Statistical Learning Methods[M]. 2nd Ed. Beijing:Tsinghua University Press,2019:91, articleTitle=null, refAbstract=null), Reference(id=1241779376405156596, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2009, volume=null, issue=null, pageStart=119, pageEnd=null, url=null, language=null, rfNumber=[10], rfOrder=15, authorNames=HASTIE T, TIBSHIRANI R, FRIEDMAN J, journalName=The Elements of Statistical Learning:Data Mining,Inference,and Prediction, refType=null, unstructuredReference=HASTIE TTIBSHIRANI RFRIEDMAN J. The Elements of Statistical Learning:Data Mining,Inference,and Prediction[M]. 2nd Ed. New York:Springer,2009:119, articleTitle=null, refAbstract=null), Reference(id=1241779376526791419, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2017, volume=null, issue=null, pageStart=134, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=16, authorNames=GÉRON A, journalName=Hands-on Machine Learning with Scikit-learn,Keras & Tensorflow:Concepts,Tools,and Techniques to Build Intelligent Systems, refType=null, unstructuredReference=GÉRON A. Hands-on Machine Learning with Scikit-learn,Keras & Tensorflow:Concepts,Tools,and Techniques to Build Intelligent Systems [M]. Sebastopol:O’REILLY,2017:134, articleTitle=null, refAbstract=null), Reference(id=1241779376644231937, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=28, pageStart=7, pageEnd=null, url=null, language=null, rfNumber=[12], rfOrder=17, authorNames=习近平, journalName=中华人民共和国国务院公报, refType=null, unstructuredReference=习近平.在教育文化卫生体育领域专家代表座谈会上的讲话[J].中华人民共和国国务院公报2020 (28):7, articleTitle=在教育文化卫生体育领域专家代表座谈会上的讲话, refAbstract=null), Reference(id=1241779376744895239, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=28, pageStart=7, pageEnd=null, url=null, language=null, rfNumber=[12], rfOrder=18, authorNames=XI JP, journalName=Gazette of the State Council of the People’s Republic of China, refType=null, unstructuredReference=XI JP. Speech at the symposium for representative of expert in the fields of education,culture,health,and sports[J]. Gazette of the State Council of the People’s Republic of China2020(28):7, articleTitle=Speech at the symposium for representative of expert in the fields of education,culture,health,and sports, refAbstract=null), Reference(id=1241779376828781324, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2022, volume=63, issue=17, pageStart=1601, pageEnd=null, url=null, language=null, rfNumber=[13], rfOrder=19, authorNames=黄璐琦, journalName=中医杂志, refType=null, unstructuredReference=黄璐琦.对中医药发展规律及特点的传承与创新认识[J].中医杂志202263(17):1601, articleTitle=对中医药发展规律及特点的传承与创新认识, refAbstract=null), Reference(id=1241779376916861714, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, doi=null, pmid=null, pmcid=null, year=2022, volume=63, issue=17, pageStart=1601, pageEnd=null, url=null, language=null, rfNumber=[13], rfOrder=20, authorNames=HUANG LQ, journalName=J Tradit Chin Med, refType=null, unstructuredReference=HUANG LQ. Inheritance and innovation of the development rules and characteristics of traditional Chinese medicine[J]. J Tradit Chin Med202263(17):1601, articleTitle=Inheritance and innovation of the development rules and characteristics of traditional Chinese medicine, refAbstract=null)], funds=null, companyList=[AuthorCompany(id=1241779363209875616, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, xref=1., ext=[AuthorCompanyExt(id=1241779363218264226, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.National Institutes for Food and Drug Control, Beijing 102629, China), AuthorCompanyExt(id=1241779363226652836, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363209875616, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.中国食品药品检定研究院,北京 102629)]), AuthorCompany(id=1241779363339899051, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, xref=2., ext=[AuthorCompanyExt(id=1241779363348287660, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363339899051, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.Beijing Institute for Drug Control, Beijing 102206, China), AuthorCompanyExt(id=1241779363360870573, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, companyId=1241779363339899051, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.北京市药品检验研究院,北京 102206)])], figs=[ArticleFig(id=1241779369232896486, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Fig.1, caption=Research flow diagram, figureFileSmall=QPQLF8Rhvb2An+29b9WniA==, figureFileBig=9Jeit5X0kRMOGRHV71kBbw==, tableContent=null), ArticleFig(id=1241779369333559792, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=图1, caption=研究流程示意图, figureFileSmall=QPQLF8Rhvb2An+29b9WniA==, figureFileBig=9Jeit5X0kRMOGRHV71kBbw==, tableContent=null), ArticleFig(id=1241779369614578181, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Fig.2, caption=Pearson correlation analysis heat map, figureFileSmall=LD9xq6rAQAqt/9pgEUBCOA==, figureFileBig=y2qGyYDj2NG1zxuwk4eWIg==, tableContent=null), ArticleFig(id=1241779369769767438, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=图2, caption=Pearson相关性分析热度图, figureFileSmall=LD9xq6rAQAqt/9pgEUBCOA==, figureFileBig=y2qGyYDj2NG1zxuwk4eWIg==, tableContent=null), ArticleFig(id=1241779370054980123, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Fig.3, caption=Score plot of principle component analysis, figureFileSmall=scGEFIhgijAY/vy9dVr6EQ==, figureFileBig=j7h/Q7eppNGNN5BVKL8ZJg==, tableContent=null), ArticleFig(id=1241779371577512482, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=图3, caption=主成分分析得分分布图, figureFileSmall=scGEFIhgijAY/vy9dVr6EQ==, figureFileBig=j7h/Q7eppNGNN5BVKL8ZJg==, tableContent=null), ArticleFig(id=1241779371699147303, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Fig.4, caption=Decision tree diagram, figureFileSmall=03PYhYG9reA30LaVdZ1NAg==, figureFileBig=FdIuejtNnf2SUkEZJZeo5Q==, tableContent=null), ArticleFig(id=1241779371904668207, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=图4, caption=决策树示意图, figureFileSmall=03PYhYG9reA30LaVdZ1NAg==, figureFileBig=FdIuejtNnf2SUkEZJZeo5Q==, tableContent=null), ArticleFig(id=1241779372005331513, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Fig.5, caption=Data box diagram, figureFileSmall=Oy6wz+fVosEViHJjSgwXmQ==, figureFileBig=mthHoUVnAIjlnxyV+2obzw==, tableContent=null), ArticleFig(id=1241779372139549251, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=图5, caption=数据箱式图, figureFileSmall=Oy6wz+fVosEViHJjSgwXmQ==, figureFileBig=mthHoUVnAIjlnxyV+2obzw==, tableContent=null), ArticleFig(id=1241779372244406861, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Tab.1, caption=

Model prediction accuracy

, figureFileSmall=null, figureFileBig=null, tableContent=
序号
(code)
模型
(model)
真实数据建模准确率
(accuracy based on true data)
模拟数据建模准确率
(accuracy based on analog data)
测试集(test set)交叉验证(cross validation)测试集(test set)交叉验证(cross validation)
1NeuralNetTorch_BAG_L10.9680.8440.8750.993
2ExtraTreesEntr_BAG_L10.9670.8910.8910.993
3ExtraTreesGini_BAG_L10.9650.8750.8910.992
4NeuralNetFastAI_BAG_L10.9630.9530.8130.992
5WeightedEnsemble_L20.9630.9530.8910.997
6LightGBM_BAG_L10.9450.8910.8280.992
7CatBoost_BAG_L10.9430.8750.8440.992
8RandomForestGini_BAG_L10.9420.9060.8440.997
9RandomForestEntr_BAG_L10.9380.8910.8440.995
10XGBoost_BAG_L10.9130.8590.8280.988
11LightGBMXT_BAG_L10.9100.8590.8910.997
12LightGBMLarge_BAG_L10.8730.8590.8130.992
13KNeighborsDist_BAG_L10.8450.7810.8280.973
14KNeighborsUnif_BAG_L10.8400.6880.8280.973
), ArticleFig(id=1241779372437344857, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=表1, caption=

模型预测准确率

, figureFileSmall=null, figureFileBig=null, tableContent=
序号
(code)
模型
(model)
真实数据建模准确率
(accuracy based on true data)
模拟数据建模准确率
(accuracy based on analog data)
测试集(test set)交叉验证(cross validation)测试集(test set)交叉验证(cross validation)
1NeuralNetTorch_BAG_L10.9680.8440.8750.993
2ExtraTreesEntr_BAG_L10.9670.8910.8910.993
3ExtraTreesGini_BAG_L10.9650.8750.8910.992
4NeuralNetFastAI_BAG_L10.9630.9530.8130.992
5WeightedEnsemble_L20.9630.9530.8910.997
6LightGBM_BAG_L10.9450.8910.8280.992
7CatBoost_BAG_L10.9430.8750.8440.992
8RandomForestGini_BAG_L10.9420.9060.8440.997
9RandomForestEntr_BAG_L10.9380.8910.8440.995
10XGBoost_BAG_L10.9130.8590.8280.988
11LightGBMXT_BAG_L10.9100.8590.8910.997
12LightGBMLarge_BAG_L10.8730.8590.8130.992
13KNeighborsDist_BAG_L10.8450.7810.8280.973
14KNeighborsUnif_BAG_L10.8400.6880.8280.973
), ArticleFig(id=1241779372538008158, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Tab.2, caption=

Coefficients of logistic regression model

, figureFileSmall=null, figureFileBig=null, tableContent=
类别
(category)
毛蕊异黄酮葡萄糖苷
(campanulin)
刺芒柄花苷
(onospin)
毛蕊异黄酮
(calycosin)
山柰酚
(kaempferol)
异鼠李素
(isorhamnetin)
芒柄花素
(formononetin)
总和
(sum)
BY1.4050.978-1.057-0.4220.240-1.1770.268
YS-0.776-0.8980.2180.233-0.580-0.792-0.532
ZP-0.630-0.0800.8400.1900.3401.9690.263
), ArticleFig(id=1241779372668031591, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=表2, caption=

逻辑回归模型系数

, figureFileSmall=null, figureFileBig=null, tableContent=
类别
(category)
毛蕊异黄酮葡萄糖苷
(campanulin)
刺芒柄花苷
(onospin)
毛蕊异黄酮
(calycosin)
山柰酚
(kaempferol)
异鼠李素
(isorhamnetin)
芒柄花素
(formononetin)
总和
(sum)
BY1.4050.978-1.057-0.4220.240-1.1770.268
YS-0.776-0.8980.2180.233-0.580-0.792-0.532
ZP-0.630-0.0800.8400.1900.3401.9690.263
), ArticleFig(id=1241779372747723372, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=EN, label=Tab.3, caption=

Evaluation for results of data augmentation

, figureFileSmall=null, figureFileBig=null, tableContent=
类别
(category)
相似程度打分(score on similarity)
GaussianCopulaCopulaGANCTGANTVAE
ZP0.820.630.700.80
BY0.690.680.680.74
YS0.720.790.770.82
), ArticleFig(id=1241779372881941105, tenantId=1146029695717560320, journalId=1205117023404326918, articleId=1241779356830339198, language=CN, label=表3, caption=

数据增强结果评价

, figureFileSmall=null, figureFileBig=null, tableContent=
类别
(category)
相似程度打分(score on similarity)
GaussianCopulaCopulaGANCTGANTVAE
ZP0.820.630.700.80
BY0.690.680.680.74
YS0.720.790.770.82
)], attaches=null, journal=Journal(id=1205114436974182403, delFlag=0, nameCn=药物分析杂志, nameEn=Chinese Journal of Pharmaceutical Analysis, nameHistory1=null, nameHistory2=null, issn=0254-1793, eissn=, cn=11-2224/R, coden=null, periodic=月刊, language=CN, oaType=1, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=, officePhone=, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=E+iq5BU+frOtpo6YF6JpQQ==, journalPrice=null, startedYear=null, abbrevIsoEn=Chinese Journal of Pharmaceutical Analysis, journalRemark=null, publicationField=null, createdTime=1765251271721, updatedTime=1765252167692, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=C, firstLetterEn=C, subjectCode=Life Sciences, subjectName=生命医学, subjectCodeEn=Life Sciences, subjectNameEn=null, picCn=E+iq5BU+frOtpo6YF6JpQQ==, picEn=n/3T/+AptjYTGmmMsE/a0Q==, jcr=null, cjcr=null, exts=[JournalExt(id=1205118195070906572, language=CN, name=药物分析杂志, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1765252167718, updatedTime=1765252167718, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://ywfxauthor.manuscriptcloud.com/login, submissionEditorUrl=https://ywfxeditor.manuscriptcloud.com/login, submissionReviewUrl=https://ywfxauthor.manuscriptcloud.com/login, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1205118195133821133, language=EN, name=Chinese Journal of Pharmaceutical Analysis, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1765252167733, updatedTime=1765252167733, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=https://ywfxauthor.manuscriptcloud.com/login, submissionEditorUrl=https://ywfxeditor.manuscriptcloud.com/login, submissionReviewUrl=https://ywfxauthor.manuscriptcloud.com/login, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1205117023404326918, websiteList=[Website(id=1205118568045195478, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1205117023404326918, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/ywfxzz/CN, language=CN, createTime=1765252256642, createBy=18614031015, updateTime=1765252521924, updateBy=18614031015, name=药物分析杂志-中文, tplId=1146099689490845704, title=药物分析杂志, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1205154326156128935, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=articleTextType, value=kx, createTime=1765260782040, updateTime=1765260782040, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326135157412, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=banner, value=null, createTime=1765260782035, updateTime=1765260782035, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326172906154, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=grayFlag, value=0, createTime=1765260782044, updateTime=1765260782044, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326126768803, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=logo, value=https://castjournals.cast.org.cn/joweb/ywfxzz/CN/file/pic?fileId=5+7UKv7ckvsJ8AsWJeRI6g==, createTime=1765260782033, updateTime=1765260782033, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326181294764, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=minRunFlag, value=0, createTime=1765260782046, updateTime=1765260782046, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326147740326, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/ywfxzz/CN/file/pic, createTime=1765260782038, updateTime=1765260782038, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326177100459, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=silenceFlag, value=0, createTime=1765260782045, updateTime=1765260782045, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326143546021, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1765260782037, updateTime=1765260782037, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326160323240, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=themeColor, value=null, createTime=1765260782041, updateTime=1765260782041, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154326164517545, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568045195478, code=themeStyle, value=null, createTime=1765260782042, updateTime=1765260782042, creator=18614031015, updator=18614031015)]), Website(id=1205118568124887259, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1205117023404326918, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/ywfxzz/EN, language=EN, createTime=1765252256661, createBy=18614031015, updateTime=1765252537379, updateBy=18614031015, name=药物分析杂志-英文, tplId=1146101810881728533, title=Chinese Journal of Pharmaceutical Analysis, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1205154361170174338, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=articleTextType, value=kx, createTime=1765260790388, updateTime=1765260790388, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361145008511, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=banner, value=null, createTime=1765260790382, updateTime=1765260790382, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361195340165, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=grayFlag, value=0, createTime=1765260790394, updateTime=1765260790394, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361132425598, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=logo, value=https://castjournals.cast.org.cn/joweb/ywfxzz/EN/file/pic?fileId=5+7UKv7ckvsJ8AsWJeRI6g==, createTime=1765260790379, updateTime=1765260790379, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361207923079, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=minRunFlag, value=0, createTime=1765260790397, updateTime=1765260790397, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361157591425, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/ywfxzz/EN/file/pic, createTime=1765260790385, updateTime=1765260790385, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361199534470, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=silenceFlag, value=0, createTime=1765260790396, updateTime=1765260790396, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361149202816, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1765260790383, updateTime=1765260790383, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361174368643, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=themeColor, value=null, createTime=1765260790389, updateTime=1765260790389, creator=18614031015, updator=18614031015), WebsiteProps(id=1205154361186951556, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1205118568124887259, code=themeStyle, value=null, createTime=1765260790392, updateTime=1765260790392, creator=18614031015, updator=18614031015)])], journalTitle=药物分析杂志, weixinUrl=null, journalUrl=null, iacademicId=null, status=1, seqNo=null, journalTitleEn=Chinese Journal of Pharmaceutical Analysis, journalPhotoCn=E+iq5BU+frOtpo6YF6JpQQ==, journalPhotoEn=n/3T/+AptjYTGmmMsE/a0Q==, journalFirstLetter=C, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/ywfxzz/CN/10.16155/j.0254-1793.2024.05.15, detailUrlEn=https://castjournals.cast.org.cn/joweb/ywfxzz/EN/10.16155/j.0254-1793.2024.05.15, pdfUrlCn=https://castjournals.cast.org.cn/joweb/ywfxzz/CN/PDF/10.16155/j.0254-1793.2024.05.15, pdfUrlEn=https://castjournals.cast.org.cn/joweb/ywfxzz/EN/PDF/10.16155/j.0254-1793.2024.05.15, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
机器学习相关技术在以黄酮为特征的黄芪分类中的应用研究
收藏切换
PDF下载
石岩 1 , 李宁 2 , 魏锋 1, * , 马双成 1
药物分析杂志 | 质量分析 2024,44(5): 866-873
收起
收藏切换
药物分析杂志 | 质量分析 2024, 44(5): 866-873
机器学习相关技术在以黄酮为特征的黄芪分类中的应用研究
全屏
石岩1 , 李宁2 , 魏锋1, * , 马双成1
作者信息
  • 1.中国食品药品检定研究院,北京 102629
  • 2.北京市药品检验研究院,北京 102206
  • 石岩 Tel:(010)53852081;E-mail:

    李宁 Tel:13811671528;E-mail:

通讯作者:

* Tel:(010)53852020;E-mail:
Research on the application of machine learning related techniques in the classification of Astragali Radix characterized by flavonoids
Yan SHI1 , Ning LI2 , Feng WEI1, * , Shuang-cheng MA1
Affiliations
  • 1.National Institutes for Food and Drug Control, Beijing 102629, China
  • 2.Beijing Institute for Drug Control, Beijing 102206, China
出版时间: 2024-05-31 doi: 10.16155/j.0254-1793.2024.05.15
文章导航
收藏切换
目的:

建立以黄酮类成分为特征的栽培黄芪、半野生黄芪和野生黄芪的三分类模型,并且对自动机器学习技术和数据增强技术在药物分析领域中的应用进行探索和评价。

方法:

首先,对黄芪的黄酮类成分含量数据进行相关性分析、主成分分析,建立决策树和逻辑回归模型,根据模型分析黄酮类成分的重要性程度;然后,使用TVAE表格数据生成算法,根据真实数据生成600批虚拟数据,使用自动学习框架AutoGluon,num_bag_folds设为5,分别对64批真实数据和600批虚拟数据进行学习,得到2组共30个模型,依据准确率进行评估。

结果:

对机器学习模型的分析可知,芒柄花素、毛蕊异黄酮葡萄糖苷和刺芒柄花苷这3种黄酮类成分对于黄芪质量,尤其是来源等级的控制具有重要意义;2组共30个模型预测准确率表明,基于NeuralNet的模型和基于树模型的机器学习算法对于黄酮成分数据表征的黄芪而言分类效果最好;数据增强技术生成的虚拟数据与真实数据在所训练得到的模型准确率趋势方面基本一致。

结论:

机器学习相关技术在以黄酮为特征的黄芪分类中具有较好的应用价值。

黄芪  /  黄酮  /  毛蕊异黄酮葡萄糖苷  /  刺芒柄花苷  /  毛蕊异黄酮  /  山柰酚  /  异鼠李素  /  芒柄花素  /  机器学习  /  人工智能  /  数据增强
Objective:

To establish a three classification model for cultivated,semi-wild,and wild Astragali Radix characterized by flavonoids,and explore and evaluate the application of techniques of automated machine learning and data augmentation in the field of drug analysis.

Methods:

Firstly,correlation analysis and principal component analysis were conducted on the flavonoid content data of Astragali Radix,and models of decision tree and logistic regression were established to analyze the importance of flavonoid components based on the models. Then,using the AutoGluon framework with 5 as num_bag_folds,2 sets of 30 models respectively through 64 batches of real data and 600 batches of virtual data generated based on real data with the TVAE table data generation algorithm for training were obtained,and these models were evaluated by accuracy.

Results:

The analysis of machine learning models,indicated that formononetin,campanulin and onospin played the important roles in the quality control of Astragali Radix,especially for the source grade control. The accuracy of model prediction showed that the models based on Neural Net and tree-model always had the best classification effect for Astragali Radix. The virtual data generated by data augmentation technique is basically consistent with the actual data in terms of the accuracy trend of the model training process.

Conclusion:

Related techniques of machine learning have good application value in the classification of Astragali Radix characterized by flavonoids.

Astragali Radix  /  flavonoids  /  campanulin  /  onospin  /  calycosin  /  kaempferol  /  isorhamnetin  /  formononetin  /  machine learning  /  artificial intelligence  /  data augmentation
石岩, 李宁, 魏锋, 马双成. 机器学习相关技术在以黄酮为特征的黄芪分类中的应用研究. 药物分析杂志, 2024 , 44 (5) : 866 -873 . DOI: 10.16155/j.0254-1793.2024.05.15
Yan SHI, Ning LI, Feng WEI, Shuang-cheng MA. Research on the application of machine learning related techniques in the classification of Astragali Radix characterized by flavonoids[J]. Chinese Journal of Pharmaceutical Analysis, 2024 , 44 (5) : 866 -873 . DOI: 10.16155/j.0254-1793.2024.05.15
黄芪为豆科植物蒙古黄芪Astragalus membranaceus (Fisch.) Bge. var. mongholicus (Bge.) Hsiao和膜荚黄芪Astragalus membranaceus (Fisch.) Bge.的干燥根,是我国传统的常用中药材和饮片,具有补气升阳、固表止汗、利水消肿等功效,在气虚乏力、食少便溏等症的临床治疗中多有应用[1]。黄芪中的主要活性成分可以分为黄酮类、皂苷类及多糖类等[2-4],黄酮类成分主要包括毛蕊异黄酮、山柰酚、芒柄花苷、芒柄花素等。前期对黄酮类成分的研究中[5],对所测得的野生、半野生以及栽培品的黄芪中的主要黄酮类成分的含量分布,总结出一些规律,但是尚无法据此对主要黄酮类成分做到准确分类。目前市场上的黄芪多以栽培品为主,野生或半野生黄芪资源较少,价格相对也较昂贵,尤其是野生黄芪的资源几近枯竭,更是价高而难得[6-8]。为了规范黄芪这一常用中药材的市场秩序,保护野生黄芪资源,有必要对不同来源的黄芪进行等级化研究,建立能够明确区分不同来源黄芪的数据模型,客观、准确地评判黄芪的质量。前期研究[5]表明,黄芪中黄酮类成分的含量分布具有较多的重叠情况,仅靠简单的1个或几个成分含量的关系难以做到准确区分。为了探究野生和栽培对黄芪药材质量的影响,本研究根据黄芪所含特征黄酮类成分与含量,采用自动机器学习(automated machine learning,AutoML)与数据增强(data augmentation)技术的联合建模方法,对不同来源的黄芪药材的区分展开可行性研究。AutoML是将机器学习与现实问题结合应用的端到端自动化的过程,可以帮助数据科学家、分析师和开发者高效、高质量地构建机器学习模型。数据增强技术是通过对原始数据进行分布模拟合成,生成更多样本来扩充数据集的方法,可以提高模型的泛化能力和稳定性。本研究采用的AutoML框架为亚马逊公司的开源AutoGluon,而数据增强技术则采用了4种主流的生成表格数据的技术:高斯Copula(GaussianCopula)、条件表格生成对抗网络(conditional tabular generative adversarial network,CTGAN)、Copula生成对抗网络(Copula generative adversarial network,CopulaGAN)、基于三元组的变分自动编码器(triplet-based variational autoencoder,TVAE)。目前,生成表格数据的技术多是基于对抗神经网络(GAN)和变分自编码器(VAE)而实现的,区别主要在于数据转换及生成策略。本研究基于前期研究获得的真实实验数据,根据数据特点(样本量少且数据分布缺乏均衡),采用数据增强获得足够数量虚拟数据,通过4种生成数据模型的评估得分,选定其中的TVAE作为生成虚拟表格数据模型,通过虚拟表格数据投喂训练AutoML模型,根据真实数据预测验证结果,优选AutoGluon框架的模型bag及堆叠(stack)形式。
对于药物分析领域内的机器学习相关应用而言,可将其流程总结如图1所示。本着“大胆假设,小心求证”准则,本研究在使用基本机器学习算法对黄芪进行基于黄酮类成分的分类预测的同时,初步揭示了不同来源黄芪样品中具有特征性的黄酮类成分。此外,还引入了目前机器学习领域中的相关热点技术,即AutoML技术和数据增强技术,对黄芪的数据进行了系列的应用研究,评估了AutoML各模型、二级融合模型及数据增强技术的应用效果。
本研究是前期黄芪中黄酮类成分测定研究工作的后续[5],所使用的不同来源的黄芪样品以及测定数据均来自前期研究。黄芪样品即1~2年生人工栽培黄芪30批(A1~A30),5年生半野生黄芪24批(YX1~YX24),5~6年生野生黄芪10批(HY1~HY10);3个类别黄芪(人工栽培黄芪、半野生黄芪和野生黄芪)分别以ZP、BY和YS表示。测定数据即以上64批黄芪样品中毛蕊异黄酮葡萄糖苷、刺芒柄花苷、毛蕊异黄酮、山柰酚、异鼠李素和芒柄花素的含量,连同六者的含量总和,共7组数据作为输入特征。
本研究中涉及的数据处理、AutoML及数据增强等技术,均在Python计算机编程语言(美国Python Software Foundation,version:3.8.8)中编码和运行。
将前期研究[5]得到的64批黄芪样品的特征变量进行Pearson相关性分析,绘制各成分数据之间相关性热图,见图2
对数据进行标准化预处理,然后进行主成分分析,以累计方差达95%以上为标准选取主成分。实际选取了4个主成分,累计方差为97.6%。64批样品在这4个主成分之间形成的二维空间的得分分布情况见图3
决策树以信息熵表征不纯度和计算信息增益,选择最大深度为3,绘制决策树树状图,见图4。其中,芒柄花素、毛蕊异黄酮葡萄糖苷和刺芒柄花苷这3个成分分别分布在根节点及近根节点,表明其在分类中的重要性。
对数据进行标准化处理,然后将变量数据及分类信息输入逻辑回归分类模型,根据多分类的实际情况选择Softmax回归,惩罚项参数选择岭回归(l2),优化算法选择lbfgs。由于YS类别中样品数量较少,且该模型并非最终目标模型,故选择5折交叉验证法实现模型的粗略验证评价。
使用TVAE,分别对不同来源黄芪样品的黄酮类成分含量及其含量总和进行数据的深度学习,epochs为400,batch_size为10,每个来源的黄芪样品模拟生成200批数据,共模拟生成600批数据。
使用python语言环境下的AutoGluon自动机器学习框架,对64批真实的黄芪数据以及模拟生成的600批数据分别进行学习,得到2组模型;交换数据,分别使用模拟生成的600批数据和64批真实的黄芪数据作为测试集对模型进行验证,即真实数据和模拟生成的数据各自进行对方数据训练得到的模型的外部验证。自动学习框架AutoGluon选择不使用多层堆叠模式,num_bag_folds选择5。框架内各学习算法模型的测试集验证准确率和训练集交叉验证准确率结果见表1
图2中方格代表横纵坐标变量之间的Pearson相关性,方格颜色越红,代表2个变量之间相关性数值越大,即相关性越强。在排除含量总和情况下,毛蕊异黄酮葡萄糖苷与刺芒柄花苷属于极强相关,二者相关系数为0.96;而芒柄花素与毛蕊异黄酮葡萄糖苷、刺芒柄花苷及异鼠李素三者之间属于极弱相关,相关系数分别为0.033、0.051和0.062。该项研究结果初步显示芒柄花素属于比较特殊的成分。
为避免数据各变量差异对算法产生不必要的影响,在进行主成分分析及后续的逻辑回归之前,首先对数据进行标准化预处理。图3为数据经过标准化处理后各批样品主成分得分分布图。该图显示,总体来说YS样品分布相对集中,而BY与ZP样品分布较分散;YS样品较多分布在BY和ZP样品之间位置;ZP、BY和YS样品的分布并无明显的区域界限差异。可见,简单的常规分类算法可能较难取得良好的效果。
决策树算法是机器学习的一种基础分类算法,可以采用树形作为形象化表示和理解,所有样品由根节点出发,依据一定指标规则对特征变量进行选择而分枝,最终到达代表类别的叶节点。其中节点位置特征变量的选择直接决定了决策树模型的分类效果,该处通常以信息增益(information gain,IG)为具体选择指标,以信息熵(information entropy)或基尼(Gini)系数作为衡量。本研究以信息熵为模型的度量,绘制决策树形象化树形示意图,如图4所示。由图4可见,选择在根节点以芒柄花素为特征,表明芒柄花素可以给样品分析带来最大的信息增益,即最大化信息熵。此结果与图2的相关性分析结果相契合,表明芒柄花素在黄芪的黄酮类成分中具有较为重要的指标性意义。依据该成分含量是否>0.009%,可将野生黄芪、栽培黄芪与大部分的半野生黄芪区分开来。此外,决策树分析结果还显示结合上述芒柄花素成分含量情况,毛蕊异黄酮葡萄糖苷成分含量以0.075%为界,可将10批野生黄芪中的8批从64批总样本中区分出来;刺芒柄花苷成分含量以0.12%为界,可将30批栽培黄芪中的28批从64批总样本中区分出来。为了避免样本过拟合情况的发生,决策树分析进行了剪枝设定,决策树进行到根节点下3层即止。
逻辑回归模型预测的平均准确率为82.2%,表明模型在部分程度上具有一定的预测能力,参与建模的各黄酮类成分在BY、YS和ZP 3个等级分类模型方程中的系数见表2
从原理上来看,逻辑回归模型是根据计算样本二分类的概率大小,来预测样本的二分类问题,但也能解决样本多分类的预测问题[9-11],后者主要通过Softmax回归实现[9]。Softmax回归,即多项式逻辑回归,首先计算各类别得分,然后使用Softmax函数(归一化指数)对各得分求得各类别概率值高低[8],计算过程中,各变量的系数大小一定程度上可以表征该变量对各类别得分影响的大小,从而影响最终分类概率。因此,表2列出的逻辑回归各变量系数,在一定程度上是这几个黄酮类成分在黄芪等级分类中重要性的表征。逻辑回归模型对BY、YS、ZP等级分类的系数绝对值排前2位的分别为毛蕊异黄酮葡萄糖苷和芒柄花素、刺芒柄花苷和芒柄花素,以及芒柄花素和毛蕊异黄酮,其中以芒柄花素系数出现的次数最多,可见芒柄花素在黄酮类成分中对于黄芪来源等级的表征的重要性。
真实数据分布及异常值等情况如箱式图(图5)所示,6个黄酮类成分中,以毛蕊异黄酮葡萄糖苷与刺芒柄花苷的含量分布最为集中,其次为芒柄花素。这3个成分的含量数据均有异常值出现,且均为数值较大的异常值。
从以上相关性分析、决策树模型和逻辑回归分类模型来看,芒柄花素是区分不同来源等级黄芪的重要指标性成分,其次为毛蕊异黄酮葡萄糖苷和刺芒柄花苷;从箱式图也可看出,这3个成分数据分布较集中,适用于表征黄芪药材质量信息,对黄芪质量,尤其是来源等级的控制具有重要意义。若仅以这3个成分的数据区分黄芪来源等级,分类策略参考决策树分析,基于这3个成分的决策树分析结果可到树的第2层,64批数据有60批分类正确,准确率已可达到93.8%。
机器学习是一种基于样本数据的科学,样本数据的大小和质量能够直接影响其应用的效果。本研究真实数据仅64批,尤其是YS类别仅有10批,对于机器学习来说,样本数量较少。此类情况在药物分析领域内较为普遍,样本收集的困难和数据的实验成本在一定程度上制约了实验数据的样本规模。因此,本研究尝试以真实数据为基础,利用深度学习中的数据增强技术生成虚拟数据,以考察此途径的可行性。
本研究分别使用了4种主要的表格数据生成技术,即GaussianCopula、CopulaGAN、CTGAN和TVAE,最终选择采用TVAE技术。在处理高维、稀疏和不平衡的数据时,以及具有复杂关系和约束的数据时,TVAE常常具有较好的效果。见表3
AutoGluon是亚马逊公司开发的AutoML框架,与通常的AutoML框架基于超参数搜索优化策略不同,AutoGluon依赖于融合多个无需超参数搜索的模型,在相同数据上训练多个不同类型的模型,然后以stacking形式通过线性模型输出加权和结果,其中权重是通过训练获得。此外,AutoGluon还采用了源于K-则交叉验证的K-则交叉bagging技术,对同类别的模型使用不同初始权重或数据块,对输出的结果做平均以减小模型输出方差。AutoGluon还有多层stacking技术可供选择,但为了避免过拟合产生,多配合K-则交叉bagging技术使用。AutoGluon除了对表格数据进行处理外,还支持文本、图像等多模态数据学习。得益于这些先进的技术和思想,AutoGluon在机器学习竞赛中获得了较好的成绩,在预测房价比赛中获得了第1名,更在国际知名的机器学习Kaggle竞赛上,在对泰坦尼克号生还预测中轻松取得前10%的成绩。
表1展示了框架内各模型分别使用真实数据和虚拟数据情况下的学习预测准确率,内含交叉验证与测试集验证结果。首先,基于真实数据建立的AutoML内各模型以NeuralNet模型(NeuralNetTorch和NeuralNetFastAI)及树模型类(ExtraTreesEntr、ExtraTreesGini、LightGBM、CatBoost、RandomForestGini和RandomForestEntr等)准确率最高,同时,WeightedEnsemble作为模型融合的元模型(Meta Model)在真实数据交叉验证和模拟数据的测试集验证中都具有相当高的正确率;其次,基于模拟数据建立的AutoML内各模型中,主要以树模型类(ExtraTreesEntr、ExtraTreesGini、LightGBMXT、CatBoost、RandomForestGini和RandomForestEntr等)准确率最高,同时,WeightedEnsemble在真实数据交叉验证和真实数据的测试集验证中均具有最高的正确率。这部分结果表明:基于NeuralNet的模型和基于树模型的机器学习算法对于黄酮成分数据表征的黄芪而言分类效果最好,然而,从模型可解释性及复杂程度考虑,基于树模型的各算法应该是更合适的选择;AutoML框架中各模型的各种验证显示,相较于真实数据而言,模型对于模拟生成的数据往往能够产生较好的分类预测结果,可见本研究使用的基于TVAE算法的数据增强技术模拟生成的数据分布与真实数据的确尚存在一定的差异。然而,从各类模型预测结果的准确率趋势来看,两类数据的确存在一定的相似性,对于药物分析数据而言,数据增强技术是可以用于数据不足之时的模型选择,甚至是用于模型训练。
传统医药是优秀传统文化的重要载体,发展中医药事业应“传承创新发展”,应立足于“中医药特点”,发挥“中医药的独特优势”[12]。整体思维是中医药的主要思维规律,该认识具有重悟性而不重论证的特点[13]。从创新发展和中医药整体思维角度出发,对于中药领域以及机器学习的本质而言,机器学习的相关技术应是目前应当尝试和发展的研究方向。本研究正是基于此宗旨,以黄芪样品中黄酮类成分含量为分析数据,使用了目前机器学习领域的热点技术,在解决黄芪不同等级的三分类及特征性黄酮类成分问题的基础上,考察实践了不同类型机器学习模型效果以及数据增强技术模拟生成的数据在模型训练中的应用效果,确定了AutoML框架和数据增强技术在中药质量分析和评价领域的应用价值及应用前景。
一般来讲,机器学习所建立的预测模型属于“黑盒子”,外界只清楚模型的输入和输出结果,至于其内部对输入数据的计算过程,则模糊不清。若仅仅将其用于样本分类或定量结果的预测,则低估了该领域技术的实际应用价值,如果能在准确率基础之上,开展基于输入数据的模型解释能力的研究,可以更加丰富和加强其在中药质量分析和评价领域的应用场景和应用意义。鉴于此,急需展开对于机器学习模型的特征解释,这也是下一步研究的着力点。
参考文献 引证文献
排序方式:
[1]
中华人民共和国药典2020年版.一部 [S]. 2020:315
ChP 2020. Vol Ⅰ [S]. 2020:315
[2]
DU HWZHAO XLZHANG AHet al. Rapid separation,identification and analysis of Astragalus membranaceus Fisch. using liquid chromatography-tandem mass spectrometry[J]. J Chromatogr Sci201452(3):226
[3]
LI KGAO FWANG Get al. Identification of cultured and natural Astragalus root based on monosaccharide mapping[J]. Molecules201520(9):16466
[4]
LEE SMJEONG JSKWON HJet al. Quantification of isoflavonoids and triterpene saponins in Astragali Radix,the root of Astragalus membranaceus,via reverse-phase high-performance liquid chromatography coupled with integrated pulsed amperometric detection[J]. J Chromatogr B20171070:76
[5]
石岩,贾天颖,李向日,.黄芪中多种黄酮类成分的测定研究[J].药物分析杂志202242(7):1120
SHI YJIA TYLI XRet al. Quantification of flavonoid compounds in Astragali Radix[J]. Chin J Pharm Anal202242(7):1120
[6]
张丽,钱大玮,卜凡淑,.基于UPLC-MS的黄芪药材质量评价研究[J].药物分析杂志202040(4):722
ZHANG LQIAN DWBU FSet al. Study on quality evaluation of Astragali Radix base on UPLC-MS [J]. Chin J Pharm Anal202040(4):722
[7]
赵晨光,李存玉,杨珊,.基于道地产区蒙古黄芪的质量差异性分析[J].中国中药杂志202045(13):3183
ZHAO CGLI CYYANG Set al. Analysis of qualitu difference based on Astragalus membranaceus var. mongholicus in genuine region [J]. China J Chin Mater Med202045(13):3183
[8]
裴文菡,何凡,程青松,.中药黄芪质量评价方法的研究进展[J].中国现代应用药学202037(5):620
PEI WHHE FCHENG QSet al. Research progress on the quality evaluation methods of traditional Chinese medicine Astragali Radix[J]. Chin J Mod Appl Pharm202037(5):620
[9]
李航.统计学习方法[M].第2版.北京:清华大学出版社,2019:91
LI H. Statistical Learning Methods[M]. 2nd Ed. Beijing:Tsinghua University Press,2019:91
[10]
HASTIE TTIBSHIRANI RFRIEDMAN J. The Elements of Statistical Learning:Data Mining,Inference,and Prediction[M]. 2nd Ed. New York:Springer,2009:119
[11]
GÉRON A. Hands-on Machine Learning with Scikit-learn,Keras & Tensorflow:Concepts,Tools,and Techniques to Build Intelligent Systems [M]. Sebastopol:O’REILLY,2017:134
[12]
习近平.在教育文化卫生体育领域专家代表座谈会上的讲话[J].中华人民共和国国务院公报2020 (28):7
XI JP. Speech at the symposium for representative of expert in the fields of education,culture,health,and sports[J]. Gazette of the State Council of the People’s Republic of China2020(28):7
[13]
黄璐琦.对中医药发展规律及特点的传承与创新认识[J].中医杂志202263(17):1601
HUANG LQ. Inheritance and innovation of the development rules and characteristics of traditional Chinese medicine[J]. J Tradit Chin Med202263(17):1601
2024年第44卷第5期
PDF下载
65
27
引用本文
BibTeX
文章信息
doi: 10.16155/j.0254-1793.2024.05.15
  • 接收时间:2023-07-19
  • 首发时间:2026-03-20
  • 出版时间:2024-05-31
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2023-07-19
基金
作者信息
    1.中国食品药品检定研究院,北京 102629
    2.北京市药品检验研究院,北京 102206

通讯作者:

* Tel:(010)53852020;E-mail:
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/ywfxzz/CN/10.16155/j.0254-1793.2024.05.15
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏