基于生物信息学的肝细胞癌患者预后风险模型的建立与验证

Construction and validation of prognostic risk model for patients with hepatocellular carcinoma based on bioinformatics analysis

  • 摘要:
      目的  利用公共数据库构建用于临床治疗肝细胞癌(HCC)的预后风险模型。
      方法  分别从癌症基因组图谱(TCGA)和国际癌症基因组联盟(ICGC)下载HCC以及癌旁正常组织的mRNA表达数据及临床信息。在TCGA队列中筛选与总生存期(OS)相关的差异表达基因(DEGs), 从中抽取2个或3个mRNAs构成一个组合, 对所有组合进行Cox风险比例回归模型构建。通过受试者工作特征(ROC)曲线的曲线下面积(AUC)确定最优基因组合,并进行基于ICGC队列的外部验证; 以TCGA队列的风险评分中位值将患者分为高风险组与低风险组,进行基因集富集分析(GSEA), 并通过pRRophetic R软件包预测HCC患者使用索拉非尼、丝裂霉素、依托泊苷、阿霉素、紫杉醇和顺铂的相对半抑制浓度(IC50)。
      结果  该预后风险模型预测TCGA队列的1、3、5年OS的ROC的AUC分别是0.786、0.713、0.699, 预测ICGC队列的1、3、4年OS的ROC的AUC分别为0.719、0.709、0.766。GSEA表明高风险组患者细胞周期相关通路被激活,胆汁酸代谢被抑制。索拉非尼在低风险组的IC50低于高风险组,而细胞周期相关化疗药物在低风险组的IC50高于高风险组,差异均有统计学意义(P < 0.05)。
      结论  本研究建立并验证了HCC预后风险模型,为HCC患者个体化诊疗方案的制订提供参考依据。

     

    Abstract:
      Objective  To construct a prognostic risk model for clinical treatment of hepatocellular carcinoma (HCC) based on public databases.
      Methods  The mRNA expression data and clinical information of HCC and adjacent normal tissues were downloaded from The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC). Differentially expressed genes (DEGs) related to overall survival (OS) were screened in the TCGA cohort, 2 or 3 mRNAs were selected to form a combination, and Cox risk proportional regression model was constructed for all combinations. The optimal gene combination was determined by the area under the curve (AUC) of the receiver operating characteristic (ROC) curve, and external validation based on ICGC cohort was carried out. The patients were divided into high-risk group and low-risk group according to the median risk score of TCGA cohort, gene set enrichment analysis (GSEA) was performed, and the relative half-inhibitory concentrations (IC50) of sorafenib, mitomycin, etoposide, adriamycin, paclitaxel and cisplatin in HCC patients were predicted by pRRophetic R software package.
      Results  For this prognostic risk model, the AUC of the ROC curve for predicting 1-, 3- and 5-year OS in the TCGA cohort were 0.786, 0.713 and 0.699, respectively, and the AUC of the ROC curve for predicting 1-, 3- and 4-year OS in the ICGC cohort were 0.719, 0.709 and 0.766, respectively. GSEA revealed that cell cycle related pathways were activated and bile acid metabolism was inhibited in the high-risk group. The IC50 of sorafenib in the low-risk group was significantly lower than that in the high-risk group, while the IC50 of cell cycle related chemotherapy drugs in the low-risk group was significantly higher than that in the high-risk group (P < 0.05).
      Conclusion  This study establishes and verifies the prognostic risk model for HCC, and provides a reference for the formulation of individualized diagnosis and treatment plan for HCC patients.

     

/

返回文章
返回