Article(id=1241102739212652677, tenantId=1146029695717560320, journalId=1227665162245664772, issueId=1241102730337505325, articleNumber=null, orderNo=null, doi=10.20043/j.cnki.MPM.202404262, pmid=null, cstr=null, oa=null, hot=null, price=null, onlineType=0, articleFormat=0, articleType=null, articleTypeStr=null, receivedDate=1713196800000, receivedDateStr=2024-04-16, revisedDate=null, revisedDateStr=null, acceptedDate=null, acceptedDateStr=null, onlineDate=1773831551601, onlineDateStr=2026-03-18, pubDate=1721836800000, pubDateStr=2024-07-25, doiRegisterDate=null, doiRegisterDateStr=null, onlineIssueDate=1773831551601, onlineIssueDateStr=2026-03-18, onlineJustAcceptDate=null, onlineJustAcceptDateStr=null, onlineFirstDate=null, onlineFirstDateStr=null, sourceXml=null, magXml=null, createTime=1773831551601, creator=13701087609, updateTime=1773831551601, updator=13701087609, issue=Issue{id=1241102730337505325, tenantId=1146029695717560320, journalId=1227665162245664772, year='2024', volume='51', issue='14', pageStart='2497', pageEnd='2688', issueExtLink='null', onlineDate='null', pubDate='null', beforeIssueId=null, nextIssueId=null, price=null, status=1, issueComplete=1, articleOrder=1, issueType=-1, specialIssue=null, createTime=1773831549486, creator=13701087609, updateTime=1773831697291, updator=13701087609, preIssue=null, nextIssue=null, ext={EN=IssueExt(id=1241103350322745680, tenantId=1146029695717560320, journalId=1227665162245664772, issueId=1241102730337505325, language=EN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=), CN=IssueExt(id=1241103350322745681, tenantId=1146029695717560320, journalId=1227665162245664772, issueId=1241102730337505325, language=CN, specialIssueTitle=, coverIllustrator=null, specialIssueEditor=, specialIssueAbout=)}, issueFiles=null}, startPage=2669, endPage=2674, ext={EN=ArticleExt(id=1241102739598528655, articleId=1241102739212652677, tenantId=1146029695717560320, journalId=1227665162245664772, language=EN, title=Prognostic prediction model for patients with low-grade gliomas based on multi-omics data, columnId=1228016569138213037, journalTitle=Modern Preventive Medicine, columnName=Clinical Medicine and Prevention, runingTitle=null, highlight=null, articleAbstract=
Objective

To explore the application of integrated clustering methods in identifying low-grade glioma subtypes and prognostic prediction.

Methods

A comprehensive clustering algorithm (MOVICS), which pools ten clustering algorithms, was used to integrate the multi-omics data of LGG patients downloaded from TCGA to obtain cluster subtypes; prognostic factors of LGG were analyzed by multifactorial Cox regression. A random forest classification prediction model was constructed using mRNA data to evaluate the classification performance and externally validated with the CGGA dataset.

Results

LGG patients were clustered into two subtypes, and the difference in survival between the two groups was statistically significant (χ2=54.410, P<0.001). The results of multifactorial Cox regression analysis showed that age (HR=1.053,95%CI: 1.037-1.069), cancer grade (HR=2.733,95%CI: 1.836-4.069) and cluster typing (HR=3.210,95%CI: 2.216-4.650) were all prognostic factors for LGG, and the results of Nomogram plots, calibration curves and ROC curves indicated good predictive performance of the model. The average prediction accuracy of the ten-fold cross-validated RF model was 87.81%, and the C-indexes of the training set, the internal validation set, and the two external validation sets were 0.717, 0.721, 0.574, and 0.572, and the Brier scores were 0.044, 0.066, 0.179, and 0.128, and the differences in the survival of the two external validation datasets were all statistically significance (P<0.05).

Conclusion

The comprehensive clustering method can effectively identify LGG subtypes, which are prognostic factors for LGG patients, and has been validated in an external dataset, CGGA, which can provide an important theoretical basis for clinical personalized treatment of LGG.

, correspAuthors=null, authorNote=null, correspAuthorsNote=null, copyrightStatement=null, copyrightOwner=null, extLink=null, articleAbsUrl=null, sourceXml=null, magXml=null, pdfUrl=null, pdf=null, pdfFileSize=null, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=null, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=null, mapNumber=null, authorCompany=null, fund=null, authors=null, authorsList=Yi-rong LIU, Yue REN, Yang QIN, Shu-qi WU, Jin-fang ZHAO, Tian-e LUO), CN=ArticleExt(id=1241102744459727219, articleId=1241102739212652677, tenantId=1146029695717560320, journalId=1227665162245664772, language=CN, title=基于多组学数据构建低级别胶质瘤患者预后预测模型, columnId=1228016570119680182, journalTitle=现代预防医学, columnName=临床与预防, runingTitle=null, highlight=null, articleAbstract=
目的

探讨综合聚类方法在识别低级别胶质瘤(LGG)亚型和预后预测中的应用。

方法

采用集合了十种聚类算法的综合聚类算法(MOVICS)对TCGA下载的LGG患者多组学数据进行整合,得到聚类亚型;通过多因素Cox回归分析LGG的预后因素。使用mRNA数据构建随机森林分类预测模型来评估分类性能,用CGGA数据集进行外部验证。

结果

LGG患者经过聚类分为两型,两组生存率差异具有统计学意义(χ2=54.410,P<0.001)。多因素Cox回归分析结果表明,年龄(HR=1.053,95%CI:1.037~1.069)、癌症分级(HR=2.733,95%CI:1.836~4.069)和聚类亚型(HR=3.210,95%CI:2.216~4.650)都是LGG的预后因素,Nomogram图、校准曲线和ROC曲线结果表明模型的预测性能良好。十折交叉验证RF模型的平均预测准确率为87.81%,训练集、内部验证集和两个外部验证集的C指数分别为0.717、0.721、0.574和0.572,Brier评分分别为0.044、0.066、0.179和0.128,两个外部验证数据集的生存差异均有统计学意义(P<0.05)。

结论

综合聚类方法能够有效识别LGG亚型,其亚型是LGG患者的预后因素,并在外部数据集CGGA中得到验证,可为LGG的临床个性化治疗提供重要的理论依据。

, correspAuthors=null, authorNote=null, correspAuthorsNote=
罗天娥,E-mail:
, copyrightStatement=本刊刊出的所有文章不代表中华预防医学会和本刊编委会的观点,除非特别声明。, copyrightOwner=中华预防医学会和四川大学华西公共卫生学院, extLink=null, articleAbsUrl=null, sourceXml=aiTgyvXtmV2mQqp6O1kFIw==, magXml=NQg7SiVt0D4V1j7xuAolaA==, pdfUrl=null, pdf=Ovq9MY31LmwsxvaxaL+NNg==, pdfFileSize=881351, pdfExtLink=null, richHtmlUrl=null, mobilePdfUrl=null, reviewReport=null, pdfFirstPage=null, abstractGraph=fj9sWPNboMpSwL2BYiF3hw==, abstractGraphContent=null, abstractVideo=null, citation=null, cebUrl=null, magXmlContent=O0Qa7sZhciusB/nxYZih6Q==, mapNumber=null, authorCompany=null, fund=null, authors=

刘毅蓉(1998—),女,硕士在读,研究方向:机器学习在疾病分类和预后预测中的应用

, authorsList=刘毅蓉, 任月, 秦阳, 武舒琪, 赵晋芳, 罗天娥)}, authors=[Author(id=1241102744853991846, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241102744950460849, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102744853991846, language=EN, stringName=Yi-rong LIU, firstName=Yi-rong, middleName=null, lastName=LIU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241102745084678588, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102744853991846, language=CN, stringName=刘毅蓉, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001, bio={"content":"

刘毅蓉(1998—),女,硕士在读,研究方向:机器学习在疾病分类和预后预测中的应用

"}, bioImg=null, bioContent=

刘毅蓉(1998—),女,硕士在读,研究方向:机器学习在疾病分类和预后预测中的应用

, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)])]), Author(id=1241102745185341898, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241102745302782419, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102745185341898, language=EN, stringName=Yue REN, firstName=Yue, middleName=null, lastName=REN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241102745386668506, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102745185341898, language=CN, stringName=任月, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)])]), Author(id=1241102745491526118, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241102745692852720, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102745491526118, language=EN, stringName=Yang QIN, firstName=Yang, middleName=null, lastName=QIN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241102745789321722, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102745491526118, language=CN, stringName=秦阳, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)])]), Author(id=1241102745902567941, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241102746053562900, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102745902567941, language=EN, stringName=Shu-qi WU, firstName=Shu-qi, middleName=null, lastName=WU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241102746145837598, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102745902567941, language=CN, stringName=武舒琪, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)])]), Author(id=1241102746229723686, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241102746305221172, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102746229723686, language=EN, stringName=Jin-fang ZHAO, firstName=Jin-fang, middleName=null, lastName=ZHAO, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241102746410078776, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102746229723686, language=CN, stringName=赵晋芳, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001
2.煤炭环境致病与防治教育部重点实验室, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)]), AuthorCompany(id=1241102744770105755, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=2., ext=[AuthorCompanyExt(id=1241102744778494364, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744770105755, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.煤炭环境致病与防治教育部重点实验室)])]), Author(id=1241102746502353473, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, orderNo=5, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=luotiane1977@163.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1241102746640765521, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102746502353473, language=EN, stringName=Tian-e LUO, firstName=Tian-e, middleName=null, lastName=LUO, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1241102748171686495, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, authorId=1241102746502353473, language=CN, stringName=罗天娥, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001
2.煤炭环境致病与防治教育部重点实验室, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)]), AuthorCompany(id=1241102744770105755, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=2., ext=[AuthorCompanyExt(id=1241102744778494364, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744770105755, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.煤炭环境致病与防治教育部重点实验室)])])], keywords=[Keyword(id=1241102748335264364, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, orderNo=1, keyword=Multi-omics clustering), Keyword(id=1241102748423344757, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, orderNo=2, keyword=Low-grade glioma), Keyword(id=1241102748515619456, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, orderNo=3, keyword=Prognosis prediction), Keyword(id=1241102748641448585, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, orderNo=4, keyword=Random forest), Keyword(id=1241102748763083410, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, orderNo=1, keyword=多组学聚类), Keyword(id=1241102748918272672, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, orderNo=2, keyword=低级别胶质瘤), Keyword(id=1241102749035713193, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, orderNo=3, keyword=预后预测), Keyword(id=1241102749165736631, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, orderNo=4, keyword=随机森林)], refs=[Reference(id=1241102753431344017, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=56, issue=2, pageStart=199, pageEnd=206, url=null, language=null, rfNumber=[1], rfOrder=0, authorNames=蔡祥, 王仁东, 王世佳, journalName=北京大学学报:医学版, refType=null, unstructuredReference=蔡祥,王仁东,王世佳,等.胶质母细胞瘤恶性进展中不同细胞亚群的动态轨迹和细胞通讯网络[J].北京大学学报:医学版202456(2):199-206., articleTitle=胶质母细胞瘤恶性进展中不同细胞亚群的动态轨迹和细胞通讯网络, refAbstract=null), Reference(id=1241102753569756060, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=56, issue=2, pageStart=199, pageEnd=206, url=null, language=null, rfNumber=[1], rfOrder=1, authorNames=Cai X, Wang RD, Wang SJ, journalName=Journal of Peking University(Health Sciences), refType=null, unstructuredReference=Cai X,Wang RD, Wang SJ, et al. Dynamic trajectory and cell communication of different cell clusters in malignant progression of glioblastoma[J]. Journal of Peking University(Health Sciences), 2024, 56(2): 199-206., articleTitle=Dynamic trajectory and cell communication of different cell clusters in malignant progression of glioblastoma, refAbstract=null), Reference(id=1241102753687196578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2023, volume=40, issue=3, pageStart=382, pageEnd=385, url=null, language=null, rfNumber=[2], rfOrder=2, authorNames=兰宁, 张永超, 李淼, journalName=中国卫生统计, refType=null, unstructuredReference=兰宁,张永超,李淼,等.结合正则化方法的区块森林在癌症多组学数据预后预测中的应用研究[J].中国卫生统计202340(3):382-385., articleTitle=结合正则化方法的区块森林在癌症多组学数据预后预测中的应用研究, refAbstract=null), Reference(id=1241102753796248486, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2023, volume=40, issue=3, pageStart=382, pageEnd=385, url=null, language=null, rfNumber=[2], rfOrder=3, authorNames=Lan N, Zhang YC, Li M, journalName=Chinese Journal of Health Statistics, refType=null, unstructuredReference=Lan N, Zhang YC, Li M, et al. Application of block forest combined with regularization method in prognostic prediction of cancer multi-omics data[J]. Chinese Journal of Health Statistics, 2023, 40(3): 382-385., articleTitle=Application of block forest combined with regularization method in prognostic prediction of cancer multi-omics data, refAbstract=null), Reference(id=1241102753905300398, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2020, volume=41, issue=5, pageStart=788, pageEnd=793, url=null, language=null, rfNumber=[3], rfOrder=4, authorNames=龙智平, 王帆, journalName=中华流行病学杂志, refType=null, unstructuredReference=龙智平,王帆.多组学整合分析的设计及统计方法在肿瘤流行病学研究中的应用[J].中华流行病学杂志202041(5):788-793., articleTitle=多组学整合分析的设计及统计方法在肿瘤流行病学研究中的应用, refAbstract=null), Reference(id=1241102754010158005, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2020, volume=41, issue=5, pageStart=788, pageEnd=793, url=null, language=null, rfNumber=[3], rfOrder=5, authorNames=Long ZP, Wang F, journalName=Chinese Journal of Epidemiology, refType=null, unstructuredReference=Long ZP, Wang F. Study design and statistical methods used for integrative analysis on multi-omics in cancer epidemiology[J]. Chinese Journal of Epidemiology, 2020, 41(5): 788-793., articleTitle=Study design and statistical methods used for integrative analysis on multi-omics in cancer epidemiology, refAbstract=null), Reference(id=1241102754127598527, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=35, issue=2, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[4], rfOrder=6, authorNames=Liu XD, Wang WH, Zhang XL, journalName=Molecular Therapy. Nucleic Acids, refType=null, unstructuredReference=Liu XD, Wang WH, Zhang XL, et al. Metabolism pathway-based subtyping in endometrial cancer: An integrated study by multi-omics analysis and machine learning algorithms[J]. Molecular Therapy. Nucleic Acids, 2024, 35(2): 102155., articleTitle=Metabolism pathway-based subtyping in endometrial cancer: An integrated study by multi-omics analysis and machine learning algorithms, refAbstract=null), Reference(id=1241102754240844742, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=1870, issue=5, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[5], rfOrder=7, authorNames=Chakraborty S, Sharma G, Karmakar S, journalName=Biochim Biophys Acta Mol Basis Dis, refType=null, unstructuredReference=Chakraborty S, Sharma G, Karmakar S, et al. Multi-OMICS approaches in cancer biology: New era in cancer therapy[J]. Biochim Biophys Acta Mol Basis Dis, 2024, 1870(5): 167120., articleTitle=Multi-OMICS approaches in cancer biology: New era in cancer therapy, refAbstract=null), Reference(id=1241102754354090955, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2021, volume=57, issue=23, pageStart=1, pageEnd=17, url=null, language=null, rfNumber=[6], rfOrder=8, authorNames=钟雅婷, 林艳梅, 陈定甲, journalName=计算机工程与应用, refType=null, unstructuredReference=钟雅婷,林艳梅,陈定甲,等.多组学数据整合分析和应用研究综述[J].计算机工程与应用202157(23):1-17., articleTitle=多组学数据整合分析和应用研究综述, refAbstract=null), Reference(id=1241102754454754257, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2021, volume=57, issue=23, pageStart=1, pageEnd=17, url=null, language=null, rfNumber=[6], rfOrder=9, authorNames=Zhong YT, Lin YM, Chen DJ, journalName=Computer Engineering and Applications, refType=null, unstructuredReference=Zhong YT, Lin YM, Chen DJ, et al. Review on integration analysis and application of multi-omics data[J]. Computer Engineering and Applications, 2021, 57(23): 1-17., articleTitle=Review on integration analysis and application of multi-omics data, refAbstract=null), Reference(id=1241102754547028949, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=25, issue=2, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[7], rfOrder=10, authorNames=Cai YY, Wang SF, journalName=Briefings in Bioinformatics, refType=null, unstructuredReference=Cai YY, Wang SF. Deeply integrating latent consistent representations in high-noise multi-omics data for cancer subtyping[J]. Briefings in Bioinformatics, 2024, 25(2): bbae061., articleTitle=Deeply integrating latent consistent representations in high-noise multi-omics data for cancer subtyping, refAbstract=null), Reference(id=1241102754643497949, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[8], rfOrder=11, authorNames=高振博, journalName=null, refType=null, unstructuredReference=高振博.基于多组学数据的聚类方法研究[D].大连:大连理工大学,2020., articleTitle=基于多组学数据的聚类方法研究, refAbstract=null), Reference(id=1241102754723189730, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2020, volume=null, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[8], rfOrder=12, authorNames=Gao ZB, journalName=null, refType=null, unstructuredReference=Gao ZB. Research on clustering method based on multi-omics data[D].Dalian: Dalian University of Technology, 2020., articleTitle=Research on clustering method based on multi-omics data, refAbstract=null), Reference(id=1241102754798687206, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2021, volume=36, issue=22/23, pageStart=5539, pageEnd=5541, url=null, language=null, rfNumber=[9], rfOrder=13, authorNames=Lu XF, Meng JL, Zhou YJ, journalName=Bioinformatics, refType=null, unstructuredReference=Lu XF, Meng JL, Zhou YJ, et al. MOVICS: an R package for multi-omics integration and visualization in cancer subtyping[J]. Bioinformatics, 2021, 36(22/23): 5539-5541., articleTitle=MOVICS: an R package for multi-omics integration and visualization in cancer subtyping, refAbstract=null), Reference(id=1241102754878378984, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2022, volume=11, issue=23, pageStart=3784, pageEnd=null, url=null, language=null, rfNumber=[10], rfOrder=14, authorNames=Guan Y, Yue SY, Chen YD, journalName=Cells (Basel, Switzerland), refType=null, unstructuredReference=Guan Y, Yue SY, Chen YD, et al. Molecular cluster mining of adrenocortical carcinoma via Multi-Omics data analysis Aids precise clinical therapy[J]. Cells (Basel, Switzerland), 2022, 11(23): 3784., articleTitle=Molecular cluster mining of adrenocortical carcinoma via Multi-Omics data analysis Aids precise clinical therapy, refAbstract=null), Reference(id=1241102754962265070, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2021, volume=28, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[11], rfOrder=15, authorNames=Macaulay BO, Aribisala BS, Akande SA, journalName=Cancer Treatmentand Research Communications, refType=null, unstructuredReference=Macaulay BO, Aribisala BS, Akande SA, et al. Breast cancer risk prediction in African women using Random Forest Classifier[J]. Cancer Treatmentand Research Communications, 2021, 28: 100396., articleTitle=Breast cancer risk prediction in African women using Random Forest Classifier, refAbstract=null), Reference(id=1241102755037762547, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=15, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[12], rfOrder=16, authorNames=Halder RK, Uddin MN, Uddin MA, journalName=Journal of Pathology Informatics, refType=null, unstructuredReference=Halder RK,Uddin MN, Uddin MA, et al. ML-CKDP: machine learning-based chronic kidney disease prediction with smart web application[J]. Journal of Pathology Informatics, 2024, 15: 100371., articleTitle=ML-CKDP: machine learning-based chronic kidney disease prediction with smart web application, refAbstract=null), Reference(id=1241102755117454323, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2017, volume=12, issue=5, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[13], rfOrder=17, authorNames=Chalise P, Fridley BL, journalName=PLOS One, refType=null, unstructuredReference=Chalise P, Fridley BL. Integrative clustering of multi-level ’omic data based on non-negative matrix factorization algorithm[J]. PLOS One, 2017, 12(5): e0176278., articleTitle=Integrative clustering of multi-level ’omic data based on non-negative matrix factorization algorithm, refAbstract=null), Reference(id=1241102755205534714, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=12, pageStart=76, pageEnd=80, url=null, language=null, rfNumber=[14], rfOrder=18, authorNames=王俊丰, 贾晓霞, 李志强, journalName=信息技术, refType=null, unstructuredReference=王俊丰,贾晓霞,李志强.基于K-means算法改进的短文本聚类研究与实现[J].信息技术2019(12):76-80., articleTitle=基于K-means算法改进的短文本聚类研究与实现, refAbstract=null), Reference(id=1241102755289420795, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2019, volume=null, issue=12, pageStart=76, pageEnd=80, url=null, language=null, rfNumber=[14], rfOrder=19, authorNames=Wang JF, Jia XX, Li ZQ, journalName=Information Technology, refType=null, unstructuredReference=Wang JF, Jia XX, Li ZQ. Research and implementation of short text clustering based on improved K-means algorithm[J]. Information Technology, 2019(12): 76-80., articleTitle=Research and implementation of short text clustering based on improved K-means algorithm, refAbstract=null), Reference(id=1241102755398472706, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2022, volume=20, issue=null, pageStart=5203, pageEnd=5217, url=null, language=null, rfNumber=[15], rfOrder=20, authorNames=Du ZX, Wang YQ, Liang JQ, journalName=Computational and Structural Biotechnology Journal, refType=null, unstructuredReference=Du ZX, Wang YQ, Liang JQ, et al. Association of glioma CD44 expression with glial dynamics in the tumour microenvironment and patient prognosis[J]. Computational and Structural Biotechnology Journal, 2022, 20: 5203-5217., articleTitle=Association of glioma CD44 expression with glial dynamics in the tumour microenvironment and patient prognosis, refAbstract=null), Reference(id=1241102755494940675, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=10, issue=7, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[16], rfOrder=21, authorNames=Li L, Zhang WW, Sun YJ, journalName=Heliyon, refType=null, unstructuredReference=Li L, Zhang WW, Sun YJ, et al. A clinical prognostic model of oxidative stress-related genes linked to tumor immune cell infiltration and the prognosis of ovarian cancer patients[J]. Heliyon, 2024, 10(7): e28442., articleTitle=A clinical prognostic model of oxidative stress-related genes linked to tumor immune cell infiltration and the prognosis of ovarian cancer patients, refAbstract=null), Reference(id=1241102755583021065, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2018, volume=46, issue=20, pageStart=10546, pageEnd=10562, url=null, language=null, rfNumber=[17], rfOrder=22, authorNames=Rappoport N, Shamir R, journalName=Nucleic Acids Research, refType=null, unstructuredReference=Rappoport N, Shamir R. Multi-omic and multi-view clustering algorithms: review and cancer benchmark[J]. Nucleic Acids Research, 2018, 46(20): 10546-10562., articleTitle=Multi-omic and multi-view clustering algorithms: review and cancer benchmark, refAbstract=null), Reference(id=1241102755675295759, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=12, issue=null, pageStart=null, pageEnd=null, url=null, language=null, rfNumber=[18], rfOrder=23, authorNames=Gao DD, Zhou QY, Hou DQ, journalName=PeerJ, refType=null, unstructuredReference=Gao DD, Zhou QY, Hou DQ, et al. A novel peroxisome-related gene signature predicts clinical prognosis and is associated with immune microenvironment in low-grade glioma[J]. PeerJ, 2024, 12: e16874., articleTitle=A novel peroxisome-related gene signature predicts clinical prognosis and is associated with immune microenvironment in low-grade glioma, refAbstract=null), Reference(id=1241102757222993939, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, doi=null, pmid=null, pmcid=null, year=2024, volume=16, issue=10, pageStart=8697, pageEnd=8716, url=null, language=null, rfNumber=[19], rfOrder=24, authorNames=Zhao ZR, Ma YH, Liu Y, journalName=Aging, refType=null, unstructuredReference=Zhao ZR, Ma YH, Liu Y, et al. A cuproptosis-based prognostic model for predicting survival in low-grade glioma[J]. Aging, 2024, 16(10): 8697-8716., articleTitle=A cuproptosis-based prognostic model for predicting survival in low-grade glioma, refAbstract=null)], funds=[Fund(id=1241102753309709191, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, awardId=201801D121210, language=CN, fundingSource=山西省自然科学基金(201801D121210), fundOrder=null, country=null)], companyList=[AuthorCompany(id=1241102744686219663, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=1., ext=[AuthorCompanyExt(id=1241102744694608273, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China), AuthorCompanyExt(id=1241102744698802578, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744686219663, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001)]), AuthorCompany(id=1241102744770105755, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, xref=2., ext=[AuthorCompanyExt(id=1241102744778494364, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, companyId=1241102744770105755, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.煤炭环境致病与防治教育部重点实验室)])], figs=[ArticleFig(id=1241102749320925901, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Fig.1, caption=Optimal number of clusters, figureFileSmall=KcEZh10A+JMRamEteFbj1w==, figureFileBig=fj9sWPNboMpSwL2BYiF3hw==, tableContent=null), ArticleFig(id=1241102749425783512, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=图1, caption=最佳聚类数, figureFileSmall=KcEZh10A+JMRamEteFbj1w==, figureFileBig=fj9sWPNboMpSwL2BYiF3hw==, tableContent=null), ArticleFig(id=1241102749677441777, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Fig.2, caption=KM survival curves for TCGA-LGG, figureFileSmall=/+93j1tL3SqxJcnh4LCq/A==, figureFileBig=L3iFJy1wPYR9QP4DbhzcRA==, tableContent=null), ArticleFig(id=1241102749786493693, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=图2, caption=TCGA-LGG-KM生存曲线, figureFileSmall=/+93j1tL3SqxJcnh4LCq/A==, figureFileBig=L3iFJy1wPYR9QP4DbhzcRA==, tableContent=null), ArticleFig(id=1241102749887156999, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Fig.3, caption=Nomogram of multifactorial Cox regression analysis in TCGA-LGG patients, figureFileSmall=PJj+gc1qfRsGoNXIMlXOJQ==, figureFileBig=bcNalAiiNzn/fRgD1KVz5w==, tableContent=null), ArticleFig(id=1241102749992014610, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=图3, caption=TCGA-LGG患者多因素Cox回归分析Nomogram图, figureFileSmall=PJj+gc1qfRsGoNXIMlXOJQ==, figureFileBig=bcNalAiiNzn/fRgD1KVz5w==, tableContent=null), ArticleFig(id=1241102750122038044, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Fig.4, caption=Multifactorial Cox regression analysis of 1,3and 5-year calibration curves in TCGA-LGG patients, figureFileSmall=TRyV3fYd49cRKHPtfpnlAg==, figureFileBig=mpftwO7PVNUE2hyS3anTEA==, tableContent=null), ArticleFig(id=1241102750252061475, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=图4, caption=TCGA-LGG患者多因素Cox回归分析1、3和5年校准曲线, figureFileSmall=TRyV3fYd49cRKHPtfpnlAg==, figureFileBig=mpftwO7PVNUE2hyS3anTEA==, tableContent=null), ArticleFig(id=1241102750403056431, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Fig.5, caption=Cox regression analysis of ROC curves in TCGA-LGG patients, figureFileSmall=zr6sJrgbYraRk0RyghCQgA==, figureFileBig=om38f00TNT47t16vBwnYrg==, tableContent=null), ArticleFig(id=1241102750541468468, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=图5, caption=TCGA-LGG患者多因素Cox回归分析ROC曲线, figureFileSmall=zr6sJrgbYraRk0RyghCQgA==, figureFileBig=om38f00TNT47t16vBwnYrg==, tableContent=null), ArticleFig(id=1241102750679880507, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Fig.6, caption=Survival curves of LGG patients for TCGA training set, internal validation set and CGGA external validation set, figureFileSmall=OdO5eSXzhU+Qhu2wYYKvrA==, figureFileBig=ezK8Hgq4wwSPF99PZ+kTdw==, tableContent=null), ArticleFig(id=1241102750809903939, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=图6, caption=TCGA训练集、内部验证集和CGGA外部验证集LGG患者生存曲线, figureFileSmall=OdO5eSXzhU+Qhu2wYYKvrA==, figureFileBig=ezK8Hgq4wwSPF99PZ+kTdw==, tableContent=null), ArticleFig(id=1241102751036396367, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Table 1, caption=

Key features of the dataset situation

, figureFileSmall=null, figureFileBig=null, tableContent=
数据集组学类型样本量特征数
TCGA-LGGmRNA502691
miRNA5021 524
DNA甲基化502671
CGGA1mRNA172691
CGGA2mRNA420691
), ArticleFig(id=1241102751153836888, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=表1, caption=

数据集关键特征情况

, figureFileSmall=null, figureFileBig=null, tableContent=
数据集组学类型样本量特征数
TCGA-LGGmRNA502691
miRNA5021 524
DNA甲基化502671
CGGA1mRNA172691
CGGA2mRNA420691
), ArticleFig(id=1241102752684757856, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Table 2, caption=

Basic information on TCGA-LGG patient typing

, figureFileSmall=null, figureFileBig=null, tableContent=
项目CS1(n=312)CS2(n=190)χ2/zP
年龄(岁,)40.9±12.446.6±14.34.391<0.001
性别[n(%)]1.9050.168
男性179(57.4)97(51.1)
女性133(42.6)93(48.9)
癌症分级[n(%)]10.498<0.001
WHO G2168(53.8)74(38.9)
WHO G3144(46.2)116(61.1)
生存状况[n(%)]27.604<0.001
存活259(83.0)118(62.1)
死亡53(17.0)72(37.9)
), ArticleFig(id=1241102752764449638, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=表2, caption=

TCGA-LGG患者亚型的基本资料

, figureFileSmall=null, figureFileBig=null, tableContent=
项目CS1(n=312)CS2(n=190)χ2/zP
年龄(岁,)40.9±12.446.6±14.34.391<0.001
性别[n(%)]1.9050.168
男性179(57.4)97(51.1)
女性133(42.6)93(48.9)
癌症分级[n(%)]10.498<0.001
WHO G2168(53.8)74(38.9)
WHO G3144(46.2)116(61.1)
生存状况[n(%)]27.604<0.001
存活259(83.0)118(62.1)
死亡53(17.0)72(37.9)
), ArticleFig(id=1241102752877695854, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Table 3, caption=

Results of multifactorial Cox regression analysis in TCGA-LGG patients

, figureFileSmall=null, figureFileBig=null, tableContent=
变量b(S.E)Waldχ2PHR(95% CI)
年龄0.051(0.008)46.221<0.0011.053(1.037~1.069)
癌症分级1.005(0.203)24.522<0.0012.733(1.836~4.069)
亚型1.166(0.189)38.065<0.0013.210(2.216~4.650)
), ArticleFig(id=1241102752982553456, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=表3, caption=

TCGA-LGG患者多因素Cox回归分析结果

, figureFileSmall=null, figureFileBig=null, tableContent=
变量b(S.E)Waldχ2PHR(95% CI)
年龄0.051(0.008)46.221<0.0011.053(1.037~1.069)
癌症分级1.005(0.203)24.522<0.0012.733(1.836~4.069)
亚型1.166(0.189)38.065<0.0013.210(2.216~4.650)
), ArticleFig(id=1241102753091605370, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=EN, label=Table 4, caption=

Model performance for TCGA training set, internal validation set and CGGA external validation set test set

, figureFileSmall=null, figureFileBig=null, tableContent=
数据集C指数Brier评分log-rankP
TCGA训练集(70%)0.7170.0441e-12
TCGA验证集(30%)0.7210.0665e-05
CGGA10.5740.1791e-02
CGGA20.5720.1281e-04
), ArticleFig(id=1241102753209045887, tenantId=1146029695717560320, journalId=1227665162245664772, articleId=1241102739212652677, language=CN, label=表4, caption=

TCGA训练集、内部验证集和CGGA外部验证集的模型性能

, figureFileSmall=null, figureFileBig=null, tableContent=
数据集C指数Brier评分log-rankP
TCGA训练集(70%)0.7170.0441e-12
TCGA验证集(30%)0.7210.0665e-05
CGGA10.5740.1791e-02
CGGA20.5720.1281e-04
)], attaches=null, journal=Journal(id=1227664546253402114, delFlag=0, nameCn=现代预防医学, nameEn=Modern Preventive Medicine, nameHistory1=null, nameHistory2=null, issn=1003-8507, eissn=null, cn=51-1365/R, coden=null, periodic=3, language=CN, oaType=null, ccby=null, superviseOffice=null, ownerOffice=null, pubOffice=null, editorOffice=null, officeType=null, aims=null, clcCode=null, officeProv=null, officeCity=null, officeAddr=null, officeZip=null, officeEmail=null, officePhone=null, editDirector=null, officeDirector=null, officeDirectorPhone=null, officeStaffNum=null, officeEmpNum=null, coverPicUrl=IeiuPXEZi6AA+k0VfvoiOQ==, journalPrice=null, startedYear=null, abbrevIsoEn=Modern Preventive Medicine, journalRemark=null, publicationField=null, createdTime=1770627636734, updatedTime=1770628902248, createdBy=18614031015, updatedBy=13701087609, firstLetterCn=M, firstLetterEn=M, subjectCode=Life Sciences, subjectName=null, subjectCodeEn=Life Sciences, subjectNameEn=null, picCn=IeiuPXEZi6AA+k0VfvoiOQ==, picEn=/9iTl8/ndms4tBz1fL28Pg==, jcr=null, cjcr=null, exts=[JournalExt(id=1227669854342280188, language=CN, name=现代预防医学, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1770628902278, updatedTime=1770628902278, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=http://xdyfyxzz.paperopen.com/#/regist, submissionEditorUrl=http://xdyfyxzz.paperopen.com/#/Login, submissionReviewUrl=http://xdyfyxzz.paperopen.com/#/Login, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""}), JournalExt(id=1227669854396806141, language=EN, name=Modern Preventive Medicine, nameHistory1=null, nameHistory2=null, managedBy=, sponsoredBy=, publishedBy=, editorOffice=, officeProv=null, officeCity=null, officeAddr=, officeZip=, editDirector=, officeDirector=null, officePhone=null, coverPicUrl=null, journalRemark=, submitArticleUrl=null, websiteUrl=, createdTime=1770628902291, updatedTime=1770628902291, createdBy=13701087609, updatedBy=13701087609, submissionGuidelinesUrl=, submissionAuthorUrl=http://xdyfyxzz.paperopen.com/#/regist, submissionEditorUrl=http://xdyfyxzz.paperopen.com/#/Login, submissionReviewUrl=http://xdyfyxzz.paperopen.com/#/Login, submissionCeEditorUrl=, submissionAeEditorUrl=, option={"copyright":""})], databaseList=null, tenantJournalId=1227665162245664772, websiteList=[Website(id=1227687234141352800, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1227665162245664772, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/xdyfyx/CN, language=CN, createTime=1770633045945, createBy=18614031015, updateTime=1770633090526, updateBy=18614031015, name=现代预防医学-中文, tplId=1146099689490845704, title=现代预防医学, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1227687735088051072, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=articleTextType, value=kx, createTime=1770633165380, updateTime=1770633165380, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735071273853, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=banner, value=null, createTime=1770633165376, updateTime=1770633165376, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735113216899, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=grayFlag, value=0, createTime=1770633165386, updateTime=1770633165386, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735062885244, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=logo, value=https://castjournals.cast.org.cn/joweb/xdyfyx/CN/file/pic?fileId=/XB5plC0xuykmQnycvtyrw==, createTime=1770633165374, updateTime=1770633165374, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735125799813, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=minRunFlag, value=0, createTime=1770633165389, updateTime=1770633165389, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735083856767, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/xdyfyx/CN/file/pic, createTime=1770633165379, updateTime=1770633165379, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735121605508, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=silenceFlag, value=0, createTime=1770633165388, updateTime=1770633165388, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735079662462, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_cn_619/, createTime=1770633165378, updateTime=1770633165378, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735096439681, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=themeColor, value=null, createTime=1770633165382, updateTime=1770633165382, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687735104828290, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234141352800, code=themeStyle, value=null, createTime=1770633165384, updateTime=1770633165384, creator=18614031015, updator=18614031015)]), Website(id=1227687234338485094, webName=null, webTitle=null, webDomain=null, webCopyrigh=null, webIpcNo=null, seoTitle=null, seoKeywords=null, seoDescription=null, tenantJournalId=null, journalId=1227665162245664772, journalNameCn=null, journalNameEn=null, grayFlag=null, tenantId=1146029695717560320, platformId=null, journalGroupId=null, journalGroupNameCn=null, journalGroupNameEn=null, type=1, domain=https://castjournals.cast.org.cn/joweb/xdyfyx/EN, language=EN, createTime=1770633045992, createBy=18614031015, updateTime=1770633115374, updateBy=18614031015, name=现代预防医学-英文, tplId=1146101810881728533, title=Modern Preventive Medicine, delFlag=0, indexPage=/home, props=[WebsiteProps(id=1227687709129507332, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=articleTextType, value=kx, createTime=1770633159191, updateTime=1770633159191, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709108535809, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=banner, value=null, createTime=1770633159186, updateTime=1770633159186, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709167256071, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=grayFlag, value=0, createTime=1770633159200, updateTime=1770633159200, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709095952896, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=logo, value=https://castjournals.cast.org.cn/joweb/xdyfyx/EN/file/pic?fileId=/XB5plC0xuykmQnycvtyrw==, createTime=1770633159183, updateTime=1770633159183, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709179838985, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=minRunFlag, value=0, createTime=1770633159203, updateTime=1770633159203, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709121118723, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=picServerUrl, value=https://castjournals.cast.org.cn/joweb/xdyfyx/EN/file/pic, createTime=1770633159189, updateTime=1770633159189, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709171450376, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=silenceFlag, value=0, createTime=1770633159201, updateTime=1770633159201, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709112730114, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=staticResourcePath, value=https://castjournals.cast.org.cn/joweb/cast_kjdb_en_623/, createTime=1770633159187, updateTime=1770633159187, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709133701637, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=themeColor, value=null, createTime=1770633159192, updateTime=1770633159192, creator=18614031015, updator=18614031015), WebsiteProps(id=1227687709154673158, tenantId=1146029695717560320, journalId=null, journalGroupId=null, siteId=1227687234338485094, code=themeStyle, value=null, createTime=1770633159197, updateTime=1770633159197, creator=18614031015, updator=18614031015)])], journalTitle=现代预防医学, weixinUrl=null, journalUrl=http://xdyfyxzz.paperopen.com/, iacademicId=null, status=1, seqNo=null, journalTitleEn=Modern Preventive Medicine, journalPhotoCn=IeiuPXEZi6AA+k0VfvoiOQ==, journalPhotoEn=/9iTl8/ndms4tBz1fL28Pg==, journalFirstLetter=M, journalRecommend=null, journalNew=null, journalCollection=null, jcrJf=null, cjcrJf=null, jcrJfStr=null, cjcrJfStr=null, submissionFirstDecision=null, sciSubjectClassification=null, casSubjectClassification=null, citeScore=null, totalCitationFrequency=null, icpCode=null, psCode=null, advertisingLicenseCode=null, copyrightInformation=null, country=null, option=, provinceCode=null, provinceName=null, collectFlag=false), detailUrlCn=https://castjournals.cast.org.cn/joweb/xdyfyx/CN/10.20043/j.cnki.MPM.202404262, detailUrlEn=https://castjournals.cast.org.cn/joweb/xdyfyx/EN/10.20043/j.cnki.MPM.202404262, pdfUrlCn=https://castjournals.cast.org.cn/joweb/xdyfyx/CN/PDF/10.20043/j.cnki.MPM.202404262, pdfUrlEn=https://castjournals.cast.org.cn/joweb/xdyfyx/EN/PDF/10.20043/j.cnki.MPM.202404262, aliStartDate=null, aliEndDate=null, collectionFlag=false, citedCount=null, citedUrl=null, reference=null)
收藏切换
基于多组学数据构建低级别胶质瘤患者预后预测模型
收藏切换
PDF下载
刘毅蓉 1 , 任月 1 , 秦阳 1 , 武舒琪 1 , 赵晋芳 1, 2 , 罗天娥 1, 2
现代预防医学 | 临床与预防 2024,51(14): 2669-2674
收起
收藏切换
现代预防医学 | 临床与预防 2024, 51(14): 2669-2674
基于多组学数据构建低级别胶质瘤患者预后预测模型
全屏
刘毅蓉1, 任月1, 秦阳1, 武舒琪1, 赵晋芳1, 2, 罗天娥1, 2
作者信息
  • 1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001
  • 2.煤炭环境致病与防治教育部重点实验室
  • 刘毅蓉(1998—),女,硕士在读,研究方向:机器学习在疾病分类和预后预测中的应用

通讯作者:

罗天娥,E-mail:
Prognostic prediction model for patients with low-grade gliomas based on multi-omics data
Yi-rong LIU1, Yue REN1, Yang QIN1, Shu-qi WU1, Jin-fang ZHAO1, 2, Tian-e LUO1, 2
Affiliations
  • Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi 030001, China
出版时间: 2024-07-25 doi: 10.20043/j.cnki.MPM.202404262
文章导航
收藏切换
目的

探讨综合聚类方法在识别低级别胶质瘤(LGG)亚型和预后预测中的应用。

方法

采用集合了十种聚类算法的综合聚类算法(MOVICS)对TCGA下载的LGG患者多组学数据进行整合,得到聚类亚型;通过多因素Cox回归分析LGG的预后因素。使用mRNA数据构建随机森林分类预测模型来评估分类性能,用CGGA数据集进行外部验证。

结果

LGG患者经过聚类分为两型,两组生存率差异具有统计学意义(χ2=54.410,P<0.001)。多因素Cox回归分析结果表明,年龄(HR=1.053,95%CI:1.037~1.069)、癌症分级(HR=2.733,95%CI:1.836~4.069)和聚类亚型(HR=3.210,95%CI:2.216~4.650)都是LGG的预后因素,Nomogram图、校准曲线和ROC曲线结果表明模型的预测性能良好。十折交叉验证RF模型的平均预测准确率为87.81%,训练集、内部验证集和两个外部验证集的C指数分别为0.717、0.721、0.574和0.572,Brier评分分别为0.044、0.066、0.179和0.128,两个外部验证数据集的生存差异均有统计学意义(P<0.05)。

结论

综合聚类方法能够有效识别LGG亚型,其亚型是LGG患者的预后因素,并在外部数据集CGGA中得到验证,可为LGG的临床个性化治疗提供重要的理论依据。

多组学聚类  /  低级别胶质瘤  /  预后预测  /  随机森林
Objective

To explore the application of integrated clustering methods in identifying low-grade glioma subtypes and prognostic prediction.

Methods

A comprehensive clustering algorithm (MOVICS), which pools ten clustering algorithms, was used to integrate the multi-omics data of LGG patients downloaded from TCGA to obtain cluster subtypes; prognostic factors of LGG were analyzed by multifactorial Cox regression. A random forest classification prediction model was constructed using mRNA data to evaluate the classification performance and externally validated with the CGGA dataset.

Results

LGG patients were clustered into two subtypes, and the difference in survival between the two groups was statistically significant (χ2=54.410, P<0.001). The results of multifactorial Cox regression analysis showed that age (HR=1.053,95%CI: 1.037-1.069), cancer grade (HR=2.733,95%CI: 1.836-4.069) and cluster typing (HR=3.210,95%CI: 2.216-4.650) were all prognostic factors for LGG, and the results of Nomogram plots, calibration curves and ROC curves indicated good predictive performance of the model. The average prediction accuracy of the ten-fold cross-validated RF model was 87.81%, and the C-indexes of the training set, the internal validation set, and the two external validation sets were 0.717, 0.721, 0.574, and 0.572, and the Brier scores were 0.044, 0.066, 0.179, and 0.128, and the differences in the survival of the two external validation datasets were all statistically significance (P<0.05).

Conclusion

The comprehensive clustering method can effectively identify LGG subtypes, which are prognostic factors for LGG patients, and has been validated in an external dataset, CGGA, which can provide an important theoretical basis for clinical personalized treatment of LGG.

Multi-omics clustering  /  Low-grade glioma  /  Prognosis prediction  /  Random forest
刘毅蓉, 任月, 秦阳, 武舒琪, 赵晋芳, 罗天娥. 基于多组学数据构建低级别胶质瘤患者预后预测模型. 现代预防医学, 2024 , 51 (14) : 2669 -2674 . DOI: 10.20043/j.cnki.MPM.202404262
Yi-rong LIU, Yue REN, Yang QIN, Shu-qi WU, Jin-fang ZHAO, Tian-e LUO. Prognostic prediction model for patients with low-grade gliomas based on multi-omics data[J]. Modern Preventive Medicine, 2024 , 51 (14) : 2669 -2674 . DOI: 10.20043/j.cnki.MPM.202404262
低级别胶质瘤(low-grade glioma,LGG)具有高度异变性,极易发展为最恶性的胶质母细胞瘤(Glioblastoma,GBM),对患者生存提出巨大考验[1]。由于LGG患者预后差异较大,不能仅根据组织学亚型进行预测,因此探讨更好的亚型预测方法十分关键。随着高通量测序技术与基因组芯片技术的飞速发展,获得了多种癌症患者的基因组学、转录组学和蛋白质组学等组学数据[2],可以反映癌症的多种生物学过程[3]。聚类可以对组学数据进行整合分析,在疾病亚型、精准医疗、药物研究等方面具有非常重要的现实意义[4-5]。整合多组学数据有利于对生物医学数据进行全面深入的研究,补充单一组学中缺失或不可靠的信息[6]。由于组学数据通常具有高维度、样本量少、高噪声[7]的特点,研究中选择合适的聚类算法十分关键[8]
MOVICS是一种综合聚类方法,通过综合iClusterBayes、Mocluster、CIMLR、IntNMF、ConsensusClustering、LRAcluster、COCA、PINSplus、SNF和NEMO十种聚类算法,得到聚类亚型。本文使用TCGA的LGG患者多组学数据库,通过MOVICS模型进行聚类,利用多因素Cox回归分析不同亚型对LGG患者预后的影响,绘制Nomogram图、校准曲线和ROC曲线评价模型的预测性能;使用mRNA数据构建随机森林模型验证MOVICS聚类性能,同时用CGGA数据进行外部验证,为LGG的临床个性化治疗提供重要的理论依据。
从UCSCXena网站(https://xenabrowser.net)下载低级别胶质瘤(LGG)癌症患者的基因表达、DNA甲基化、miRNA数据以及临床信息数据(性别、年龄、癌症分级、生存时间和生存状态)。其中count数据经过log转化,DNA甲基化数据剔除未匹配到基因的探针以及对应多个基因名称的探针,再去除重复基因。多组学数据均剔除缺失值超过10%的基因,剩余缺失值用中位数进行插补,数据经过归一化处理。从中国脑胶质瘤基因组图谱计划CGGA(http://www.cgga.org.cn/)下载胶质瘤的两个基因表达数据以及临床信息数据(生存时间、生存状态),mRNA数据经log转化,排除临床数据中WHOIV型胶质瘤,删除缺失数据,去除批次效应。从COSMIC(https://cancer.sanger.ac.uk/)数据库获得733个泛癌相关基因,在mRNA和DNA甲基化数据中筛选泛癌相关基因。
MOVICS是Lu等[9]2020年提出的综合聚类方法。它的输入是多组学数据,输出是综合聚类后最优的分子亚型。MOVICS选择聚类预测指数(Cluster Prediction Index,CPI)和Gap统计量之和为最大值的聚类数作为最佳聚类数。通过综合十种聚类算法(iClusterBayes、moCluster、CIMLR、IntNMF、COCA、NEMO、ConsensusClustering、PINSPlus、SNF和LRA),将患者分为不同的亚型,并使用共识聚类进行组合分类,以高度鲁棒性识别每个亚型。
具体来说,如果指定tmax算法,且2≤tmax≤10,则每个算法计算一个矩阵,其中n为样本个数,当样本ij聚类在同一子类型中时,;否则。在获得指定算法的所有结果后,MOVICS计算共识矩阵,并且CMϵ[0,10],并通过剪影评分计算各亚型之间的样本相似性[10]
随机森林(random forest,RF)是一种集成学习方法,通过构建多个决策树并取其结果的平均值或投票来进行预测,在每个决策树的训练过程中,采用了自助采样法对样本和特征进行随机选择,能够处理变量数量超过观测数量的数据集[11]。模型具有较高的稳定性和泛化能力,可以很好地处理过拟合问题[12],有效处理缺失和有噪声的数据。
CPI:基于重采样的方法将数据重复划分为训练集和测试集。每次重复时,将算法应用于训练集估计系数矩阵,在测试集上使用系数矩阵估计公共基矩阵,重复多次,计算调整后的兰德指数的平均值为CPI[13],一般选择导致CPI最大值的聚类数作为最佳聚类数。Gap统计量[14]:引入参考的测值由MonteCarlo采样的方法获得,通过计算标准差来矫正Gap统计量,一般选择Gap统计量最大值对应的聚类数为最佳聚类数。
选择C指数、Brier评分和log-rankP值来评估模型的分类预测性能。C指数取值在0和1之间,该值越高概率预测的准确性越高;Brier评分取值在0和1之间,Brier分数越低概率预测的准确性越高;检验水准α=0.05。
本研究在R 4.1.3软件中完成,去除批次效应在sva包中实现,聚类分析采用MOVICS包,随机森林预测模型构建在randomForest包中实现。
从TCGA数据库下载LGG的mRNA、miRNA和DNA甲基化数据预处理后整合得到502个患者,mRNA、DNA甲基化数据筛选泛癌相关基因并归一化。CGGA数据库的两个mRNA数据集筛选泛癌相关基因,去批次效应。所用数据集关键特征如表1所示。
使用MOVICS对TCGA的多组学数据进行聚类,聚类数的范围设定为[2,8],结合不同聚类数的CPI和Gap统计量结果,当聚类数为2时CPI和Gap统计量之和达到最大值,模型较优,见图1
最佳聚类数为2,即MOVICS将502名LGG患者分为两个亚型,CS1和CS2,基本资料如表2所示。绘制Kaplan-Meier生存曲线,见图2,两个亚型患者的生存率差异有统计学意义(χ2=54.410,P<0.001),且CS2组生存率较低,预后较差,表明多组学数据聚类可以得到不同预后的LGG患者。
以生存时间和生存状态作为因变量,对年龄、性别(女性=0,男性=1)、癌症分级(WHO G2=1,WHO G3=2)和MOVICS聚类亚型结果(CS1=1,CS2=2)进行多因素Cox逐步回归分析,结果显示年龄、癌症分级、聚类亚型结果具有统计学意义(P<0.05),可作为低级别胶质瘤患者的预后预测因素,见表3
基于多因素Cox回归分析的结果,对LGG患者的预后因素构建Nomogram图,预测LGG患者1、3和5年的生存率。Nomogram图的C指数为0.82。Nomogram图中所有变量Points轴的分数之和在Totalpoints轴上显示,可以直观的估算出LGG患者1、3和5年的生存率,见图3
利用校准曲线评价Nomogram图,预测LGG患者生存率的准确性,以坐标轴原点且斜率为1的标准曲线作为参照。LGG患者的1、3和5年偏差校准曲线接近标准曲线,表明预测的LGG患者的生存率与实际观察到的结果偏差小,有很好的一致性,见图4。绘制1、3和5年的ROC曲线,LGG患者的AUC均大于0.8,表明模型有很好的预测准确性,见图5
利用TCGA数据库中LGG患者的mRNA数据,根据滑动窗口序贯向前特征选择法筛选出145个基因,将这145个基因的表达量以及获得的样本聚类标签整合到一起,构建一个十折交叉验证RF模型。按70%和30%的比例划分训练集和内部验证集,十折交叉验证的平均预测准确率为87.81%。使用CGGA数据库中两个LGG患者数据集进行外部验证。
训练集、内部验证集和外部验证集的C指数、Brier评分和log-rank P值结果如表4所示。训练集、内部验证集的C指数大于0.7,Brier评分小于0.1,模型性能良好;两个CGGA数据集的C指数大于0.5,Brier评分小于0.25,模型性能较好,表明使用的特征选择算法对预测LGG的亚型具有鲁棒性。绘制Kaplan-Meier生存曲线,表明RF模型能够很好地区分该LGG内部验证集和CGGA外部验证集的亚型类型,生存差异均有统计学意义(P<0.05),见图6
LGG具有高度的遗传异质性,传统组织病理学分级中同一类别的预后也存在异质性[15],需要寻求其他分类方法;多组学聚类的方法适用于癌症等遗传驱动疾病的无监督聚类,目的是发现新的疾病亚型,对疾病的诊断和治疗意义重大[16]。本研究在LGG多组学数据的基础上通过MOVICS综合十种聚类结果确定LGG的最终分子亚型,然后通过RF构建预后模型,为临床更准确的癌症诊断和治疗提供了可能。
为了提高聚类的鲁棒性,MOVICS通过借鉴共识集成[17]的思想,对不同的聚类结果进行整合,得到的共识矩阵代表了样本的成对相似性。本文采用综合聚类方法对LGG多组学数据进行整合,最终将TCGA的502个LGG患者分为两型。多因素Cox分析结果显示年龄、癌症分级、聚类亚型具有统计学意义(P<0.05),可作为低级别胶质瘤患者的预后预测因素,预测LGG患者1年、3年和5年总生存期的ROC曲线AUC值都在0.82以上,校准曲线也接近标准曲线,提示模型拥有较高的预测能力。Gao等[18]关于过氧化物酶体相关基因特征预测LGG患者预后的结果中,风险评分和年龄是LGG患者的预后因素;Zhao等[19]研究了铜氧化酶基因预测LGG患者的生存率,结果显示风险评分、年龄和癌症分级是LGG患者的预后因素,本研究与之有相同之处。在LGG患者中,年龄越大,患者的死亡风险越高,生存率越低,预后越差;在癌症分级中,WHO G3级患者的死亡风险是WHO G2级患者2.733倍,在两亚型中,LGG患者CS2组的死亡风险是CS1组患者的3.210倍,且CS2组患者的平均年龄为46.6岁,大于CS1组的40.9岁,WHO G3级患者在CS2组的占比也多于CS1组,故CS2组患者的死亡风险更高,预后较差。这提示临床应对高龄、WHO G3级患者给予一定的重视,且使用多组学数据进行聚类发现的疾病亚型,为LGG患者的诊断和治疗提供了新思路。
构建的随机森林分类预测模型,在TCGA内部验证集表现出较好的分类性能,在两个CGGA外部验证集上,成功预测出样本的亚型标签,亚型之间的生存差异都具有统计学意义,预后差的亚型与TCGA训练集相一致,均为CS2组。通过内部和外部验证,说明该模型具有较好的分类预测性能,为LGG亚型提供了理论参考,具有一定的实际意义。
本研究也具有一定的局限性,第一,在获得聚类标签时,仅用到mRNA和DNA甲基化的泛癌基因,可能会缺失一些特有的关键基因;第二,在构建随机森林分类预测模型时,由于样本量的原因,仅使用了mRNA,今后会试图整合多个组学数据进行构建模型;第三,分类模型验证时,仅用到CGGA数据库,为了检验模型的泛化能力,接下来会寻找其它的数据集进行验证,如GEO数据库。
  • 山西省自然科学基金(201801D121210)
参考文献 引证文献
排序方式:
[1]
蔡祥,王仁东,王世佳,等.胶质母细胞瘤恶性进展中不同细胞亚群的动态轨迹和细胞通讯网络[J].北京大学学报:医学版202456(2):199-206.
Cai X,Wang RD, Wang SJ, et al. Dynamic trajectory and cell communication of different cell clusters in malignant progression of glioblastoma[J]. Journal of Peking University(Health Sciences), 2024, 56(2): 199-206.
[2]
兰宁,张永超,李淼,等.结合正则化方法的区块森林在癌症多组学数据预后预测中的应用研究[J].中国卫生统计202340(3):382-385.
Lan N, Zhang YC, Li M, et al. Application of block forest combined with regularization method in prognostic prediction of cancer multi-omics data[J]. Chinese Journal of Health Statistics, 2023, 40(3): 382-385.
[3]
龙智平,王帆.多组学整合分析的设计及统计方法在肿瘤流行病学研究中的应用[J].中华流行病学杂志202041(5):788-793.
Long ZP, Wang F. Study design and statistical methods used for integrative analysis on multi-omics in cancer epidemiology[J]. Chinese Journal of Epidemiology, 2020, 41(5): 788-793.
[4]
Liu XD, Wang WH, Zhang XL, et al. Metabolism pathway-based subtyping in endometrial cancer: An integrated study by multi-omics analysis and machine learning algorithms[J]. Molecular Therapy. Nucleic Acids, 2024, 35(2): 102155.
[5]
Chakraborty S, Sharma G, Karmakar S, et al. Multi-OMICS approaches in cancer biology: New era in cancer therapy[J]. Biochim Biophys Acta Mol Basis Dis, 2024, 1870(5): 167120.
[6]
钟雅婷,林艳梅,陈定甲,等.多组学数据整合分析和应用研究综述[J].计算机工程与应用202157(23):1-17.
Zhong YT, Lin YM, Chen DJ, et al. Review on integration analysis and application of multi-omics data[J]. Computer Engineering and Applications, 2021, 57(23): 1-17.
[7]
Cai YY, Wang SF. Deeply integrating latent consistent representations in high-noise multi-omics data for cancer subtyping[J]. Briefings in Bioinformatics, 2024, 25(2): bbae061.
[8]
高振博.基于多组学数据的聚类方法研究[D].大连:大连理工大学,2020.
Gao ZB. Research on clustering method based on multi-omics data[D].Dalian: Dalian University of Technology, 2020.
[9]
Lu XF, Meng JL, Zhou YJ, et al. MOVICS: an R package for multi-omics integration and visualization in cancer subtyping[J]. Bioinformatics, 2021, 36(22/23): 5539-5541.
[10]
Guan Y, Yue SY, Chen YD, et al. Molecular cluster mining of adrenocortical carcinoma via Multi-Omics data analysis Aids precise clinical therapy[J]. Cells (Basel, Switzerland), 2022, 11(23): 3784.
[11]
Macaulay BO, Aribisala BS, Akande SA, et al. Breast cancer risk prediction in African women using Random Forest Classifier[J]. Cancer Treatmentand Research Communications, 2021, 28: 100396.
[12]
Halder RK,Uddin MN, Uddin MA, et al. ML-CKDP: machine learning-based chronic kidney disease prediction with smart web application[J]. Journal of Pathology Informatics, 2024, 15: 100371.
[13]
Chalise P, Fridley BL. Integrative clustering of multi-level ’omic data based on non-negative matrix factorization algorithm[J]. PLOS One, 2017, 12(5): e0176278.
[14]
王俊丰,贾晓霞,李志强.基于K-means算法改进的短文本聚类研究与实现[J].信息技术2019(12):76-80.
Wang JF, Jia XX, Li ZQ. Research and implementation of short text clustering based on improved K-means algorithm[J]. Information Technology, 2019(12): 76-80.
[15]
Du ZX, Wang YQ, Liang JQ, et al. Association of glioma CD44 expression with glial dynamics in the tumour microenvironment and patient prognosis[J]. Computational and Structural Biotechnology Journal, 2022, 20: 5203-5217.
[16]
Li L, Zhang WW, Sun YJ, et al. A clinical prognostic model of oxidative stress-related genes linked to tumor immune cell infiltration and the prognosis of ovarian cancer patients[J]. Heliyon, 2024, 10(7): e28442.
[17]
Rappoport N, Shamir R. Multi-omic and multi-view clustering algorithms: review and cancer benchmark[J]. Nucleic Acids Research, 2018, 46(20): 10546-10562.
[18]
Gao DD, Zhou QY, Hou DQ, et al. A novel peroxisome-related gene signature predicts clinical prognosis and is associated with immune microenvironment in low-grade glioma[J]. PeerJ, 2024, 12: e16874.
[19]
Zhao ZR, Ma YH, Liu Y, et al. A cuproptosis-based prognostic model for predicting survival in low-grade glioma[J]. Aging, 2024, 16(10): 8697-8716.
2024年第51卷第14期
PDF下载
49
23
引用本文
BibTeX
文章信息
doi: 10.20043/j.cnki.MPM.202404262
  • 接收时间:2024-04-16
  • 首发时间:2026-03-18
  • 出版时间:2024-07-25
补充材料
相关文章
文章信息
作者
出版历史
  • 收稿日期:2024-04-16
基金
山西省自然科学基金(201801D121210)
作者信息
    1.山西医科大学公共卫生学院卫生统计学教研室,山西 太原 030001
    2.煤炭环境致病与防治教育部重点实验室

通讯作者:

罗天娥,E-mail:
参考文献
分享链接
https://castjournals.cast.org.cn/joweb/xdyfyx/CN/10.20043/j.cnki.MPM.202404262
分享至
全文二维码

扫描看全文

引用本文
BibTeX
本文的引用情况
2种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏