Yang Qingxia, Gong Yaguo
Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, China.
School of Pharmacy, Macau University of Science and Technology, Macau, China.
Front Genet. 2022 Jan 14;12:791349. doi: 10.3389/fgene.2021.791349. eCollection 2021.
Thyroid nodules are present in upto 50% of the population worldwide, and thyroid malignancy occurs in only 5-15% of nodules. Until now, fine-needle biopsy with cytologic evaluation remains the diagnostic choice to determine the risk of malignancy, yet it fails to discriminate as benign or malignant in one-third of cases. In order to improve the diagnostic accuracy and reliability, molecular testing based on transcriptomic data has developed rapidly. However, gene signatures of thyroid nodules identified in a plenty of transcriptomic studies are highly inconsistent and extremely difficult to be applied in clinical application. Therefore, it is highly necessary to identify consistent signatures to discriminate benign or malignant thyroid nodules. In this study, five independent transcriptomic studies were combined to discover the gene signature between benign and malignant thyroid nodules. This combined dataset comprises 150 malignant and 93 benign thyroid samples. Then, there were 279 differentially expressed genes (DEGs) discovered by the feature selection method (Student's test and fold change). And the weighted gene co-expression network analysis (WGCNA) was performed to identify the modules of highly co-expressed genes, and 454 genes in the gray module were discovered as the hub genes. The intersection between DEGs by the feature selection method and hub genes in the WGCNA model was identified as the key genes for thyroid nodules. Finally, four key genes (ST3GAL5, NRCAM, MT1F, and PROS1) participated in the pathogenesis of malignant thyroid nodules were validated using an independent dataset. Moreover, a high-performance classification model for discriminating thyroid nodules was constructed using these key genes. All in all, this study might provide a new insight into the key differentiation of benign and malignant thyroid nodules.
甲状腺结节在全球高达50%的人群中存在,而甲状腺恶性肿瘤仅发生在5%-15%的结节中。到目前为止,细针穿刺活检及细胞学评估仍是确定恶性风险的诊断选择,但在三分之一的病例中它无法区分良性或恶性。为了提高诊断的准确性和可靠性,基于转录组数据的分子检测发展迅速。然而,在大量转录组研究中确定的甲状腺结节基因特征高度不一致,极难应用于临床。因此,非常有必要确定一致的特征来区分良性和恶性甲状腺结节。在本研究中,合并了五项独立的转录组研究以发现良性和恶性甲状腺结节之间的基因特征。这个合并的数据集包括150个恶性和93个良性甲状腺样本。然后,通过特征选择方法(学生检验和倍数变化)发现了279个差异表达基因(DEG)。并进行了加权基因共表达网络分析(WGCNA)以识别高度共表达基因的模块,灰色模块中的454个基因被发现为枢纽基因。特征选择方法确定的DEG与WGCNA模型中的枢纽基因之间的交集被确定为甲状腺结节的关键基因。最后,使用独立数据集验证了参与恶性甲状腺结节发病机制的四个关键基因(ST3GAL5、NRCAM、MT1F和PROS1)。此外,使用这些关键基因构建了一个用于区分甲状腺结节的高性能分类模型。总而言之,本研究可能为良性和恶性甲状腺结节的关键鉴别提供新的见解。