基于公共数据库的甲状腺癌预后风险基因筛选
作者:
通讯作者:
作者单位:

1.湖南师范大学附属第一医院/湖南省人民医院 乳甲外科,湖南 长沙 410005;2.中南大学湘雅医院 老年外科,湖南 长沙 410008

作者简介:

曾杰,湖南师范大学附属第一医院/湖南省人民医院副主任医师,主要从事乳腺及甲状腺疾病临床与基础方面的研究。

基金项目:

湖南省长沙市科技计划基金资助项目(kq1907061);湖南省教育厅优秀青年基金资助项目(20B355)。


Identification of prognostic risk genes for thyroid cancer based on public databases
Author:
Affiliation:

1.Department of Breast and Thyroid Surgery, the First Affiliated Hospital of Hunan Normal University/Hunan Provincial People's Hospital, Changsha 410005, China;2.Department of Geriatric Surgery, Xiangya Hospital, Central South University, Changsha 410008, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 音频文件
  • |
  • 视频文件
    摘要:

    背景与目的 甲状腺癌的发病率呈逐年增长趋势,尽管其总体预后较好,但仍有部分患者因复发或转移而死亡。本研究旨在基于公共数据库应用生物信息学方法筛选甲状腺癌的预后风险基因。方法 从癌症RNA测序关系(CRN)数据库下载甲状腺癌的蛋白编码基因RNA-seq数据,筛选甲状腺癌中差异表达的蛋白编码基因。通过DAVID数据库对差异表达的蛋白编码基因进行功能富集分析。用STRING数据库和Cytoscape软件构建差异表达蛋白编码基因之间的相互作用网络,分别用Cytoscape软件中的cytoHubba插件与ClueGO插件筛选核心基因,并对核心基因进行功能预测。用UALCAN数据库验证核心基因在甲状腺癌中的表达水平,通过GEPIA数据库对核心基因进行生存分析,分析核心基因的表达水平对甲状腺癌的生存时间有无影响。结果 筛选共得到913个差异表达的蛋白编码基因。这些基因主要富集于调控小分子GTP酶介导的信号转导、Z膜、结合肌动蛋白和细胞色素P450介导的药物代谢。构建互作网络后,筛选出10个核心基因,分别为TP53、ESR1、FOS、SYP、PPARG、ACTB、GRIA1、NRXN1、HDAC3和KIT,其中TP53得分最高,为62,它们均在甲状腺癌组织中表达下调;预测显示核心基因TP53、ESR1、PPARG可能参与了基因沉默的负性调控,TP53、FOS可能参与了RNA聚合酶II对pri-miRNA的转录调控过程。UALCAN数据库验证结果显示,除TP53外,其余核心基因均在甲状腺癌组织中表达下调(均P<0.05),与CRN数据库中的表达结果一致。生存分析结果显示,KIT的高表达与甲状腺癌患者的无病生存期明显相关(P=0.012),而对其总体生存期无明显影响(P=0.85)。结论 本研究筛选的蛋白编码基因KIT在甲状腺癌组织中呈低表达,其高表达与甲状腺癌的无病生存期密切相关,推测其可能成为甲状腺癌的预后风险标志物或治疗靶点。

    Abstract:

    Background and Aims The incidence of thyroid cancer is increasing over years. Although its overall prognosis is favorable, some patients still die due to recurrence or metastasis. The purpose of this study was to screen the prognostic risk genes for thyroid carcinoma using bioinformatics approaches based on public databases.Methods The protein-coding gene RNA-seq data of thyroid cancer were downloaded from Cancer RNA-Seq Nexus (CRN) database and the differentially expressed protein-coding genes were screened. Then, enrichment analysis of the differentially expressed protein-coding genes was performed using the DAVID database. Protein-protein interaction networks among the differentially expressed protein-coding genes were constructed and analyzed using STRING and Cytoscape. The hub genes and their functional prediction were screened by the Cytohubba and ClueGO plugins, respectively. The expression level of hub genes was verified in thyroid cancer based on the UALCAN database, and survival analysis of hub genes was conducted in the GEPIA database to analyze whether their expression had an impact on the survival time of thyroid cancer.Results A total of 913 differentially expressed protein-coding genes were obtained after screening. These genes were mainly involved in regulation of small GTPase mediated signal transduction, Z disc, actin binding and drug metabolism-cytochrome P450. After construction of interaction networks, 10 hub genes were screened and they were TP53, ESR1, FOS, SYP, PPARG, ACTB, GRIA1, NRXN1, HDAC3 and KIT, of which TP53 had the highest score of 62. All of them were down-regulated in thyroid cancer tissue. Prediction results revealed that TP53, ESR1 and PPARG were probably involved in negative regulation of gene silencing, and TP53 and FOS were probably involved in the process of pri-miRNA transcription by RNA polymerase II. Results of verification in the UALCAN database showed that all except TP53, all other hub genes were down-regulated in thyroid cancer tissues (all P<0.05), which was consistent with the expression results in the CRN database. Results of survival analysis showed that high expression of KIT was significantly associated with disease-free survival of thyroid cancer patients (P=0.012), but had no significant effect on their overall survival (P=0.85).Conclusion The identified protein-coding gene KIT has a low expression in thyroid cancer tissue, and its high expression is closely associated with the disease-free survival of thyroid cancer, which is speculated to be a prognostic risk marker or therapeutic target for thyroid cancer.

    图1 甲状腺癌各临床分期中差异表达蛋白编码基因的Venn图(不同颜色花瓣中的数字代表各临床分期中差异表达蛋白编码基因的数目,图片中央白色区域中的数字为各临床分期中差异表达蛋白编码基因的交集)Fig.1 Venn diagram of the differentially expressed protein-coding genes in thyroid cancer of different clinical stages (the number in petals with different colors representing the number of the differentially expressed protein-coding genes of different clinical stages, and the number in the central region in white color representing the overlap of the differentially expressed protein-coding genes of different clinical stages)
    图2 差异表达蛋白编码基因的功能富集分析(左侧的Y轴显示功能富集分析结果;下方的X轴表示参与BP、CC、MF和KEGG的基因所占的百分比;气泡大小表示参与BP、CC、MF和KEGG的基因数目,气泡越大表示参与的基因越多;气泡颜色代表P值的大小,颜色由红至蓝代表P值越大) A-D:分别显示了这些差异表达基因可能参与的前10个BP、CC、MF和KEGG结果Fig.2 Enrichment analysis of the differentially expressed protein-coding genes (the left Y axis showing the results of enrichment analysis, the inferior X axis showing the percentages of genes involved in BP, CC, MF and KEGG, the bigger of the bubble, the larger number of genes involved; the color of the bubble standing for the level of the P-value, and the level increasing from the red color to blue color) A-D: The results of BP, CC, MF and KEGG of the top ten differentially expressed genes
    图3 差异表达蛋白编码基因的相互作用网络图(图中的节点代表每个差异表达的蛋白编码基因,黄边粉红色节点代表核心基因)Fig.3 Interaction networks of the differentially expressed protein-coding genes (each node indicating a differentially expressed protein-coding gene, and the nodes in pink color with yellow border indicating the hub genes)
    图4 核心基因的功能富集分析(红色字体的节点代表核心基因,绿色和灰色的节点表示核心基因参与的生物学过程)Fig.4 Enrichment analysis of the hub genes (the red nodes standing for the hub genes, and the green and grey nodes standing for the biological processes involving the hub genes)
    图5 基于UALCAN数据库验证核心基因在甲状腺癌组织与癌旁正常组织中的表达水平(蓝色箱型图代表癌旁正常组织,红色箱型图代表甲状腺癌组织)Fig.5 Verification of the hub genes in thyroid tissue and normal adjacent tissue based on the UALCAN database (the blue box diagram representing the normal paracancer tissue and the red box diagram diagram vepresenting the thyroid cancer tissne)
    图6 核心基因的生存曲线(红色曲线代表高表达组,蓝色曲线代表低表达组)Fig.6 survival curves of the hub genes (the red curve representing the high-expression group and the blue curve representing the low-expression group)
    参考文献
    相似文献
    引证文献
引用本文

曾杰,杨秋怡,张志鹏,范培芝,张超杰,廖雯.基于公共数据库的甲状腺癌预后风险基因筛选[J].中国普通外科杂志,2021,30(11):1334-1342.
DOI:10.7659/j. issn.1005-6947.2021.11.009

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2021-07-16
  • 最后修改日期:2021-10-16
  • 录用日期:
  • 在线发布日期: 2021-12-24