• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大规模文本挖掘对甲状腺癌亚型进行分子谱分析。

Molecular profiling of thyroid cancer subtypes using large-scale text mining.

作者信息

Wu Chengkun, Schwartz Jean-Marc, Brabant Georg, Nenadic Goran

出版信息

BMC Med Genomics. 2014;7 Suppl 3(Suppl 3):S3. doi: 10.1186/1755-8794-7-S3-S3. Epub 2014 Dec 8.

DOI:10.1186/1755-8794-7-S3-S3
PMID:25521965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4290788/
Abstract

BACKGROUND

Thyroid cancer is the most common endocrine tumor with a steady increase in incidence. It is classified into multiple histopathological subtypes with potentially distinct molecular mechanisms. Identifying the most relevant genes and biological pathways reported in the thyroid cancer literature is vital for understanding of the disease and developing targeted therapeutics.

RESULTS

We developed a large-scale text mining system to generate a molecular profiling of thyroid cancer subtypes. The system first uses a subtype classification method for the thyroid cancer literature, which employs a scoring scheme to assign different subtypes to articles. We evaluated the classification method on a gold standard derived from the PubMed Supplementary Concept annotations, achieving a micro-average F1-score of 85.9% for primary subtypes. We then used the subtype classification results to extract genes and pathways associated with different thyroid cancer subtypes and successfully unveiled important genes and pathways, including some instances that are missing from current manually annotated databases or most recent review articles.

CONCLUSIONS

Identification of key genes and pathways plays a central role in understanding the molecular biology of thyroid cancer. An integration of subtype context can allow prioritized screening for diagnostic biomarkers and novel molecular targeted therapeutics. Source code used for this study is made freely available online at https://github.com/chengkun-wu/GenesThyCan.

摘要

背景

甲状腺癌是最常见的内分泌肿瘤,其发病率呈稳步上升趋势。它被分为多种组织病理学亚型,可能具有不同的分子机制。识别甲状腺癌文献中最相关的基因和生物学途径对于理解该疾病和开发靶向治疗至关重要。

结果

我们开发了一个大规模文本挖掘系统,以生成甲状腺癌亚型的分子图谱。该系统首先对甲状腺癌文献使用一种亚型分类方法,该方法采用评分方案为文章分配不同的亚型。我们在源自PubMed补充概念注释的金标准上评估了该分类方法,主要亚型的微平均F1分数达到了85.9%。然后,我们使用亚型分类结果提取与不同甲状腺癌亚型相关的基因和途径,并成功揭示了重要的基因和途径,包括一些当前手动注释数据库或最新综述文章中缺失的实例。

结论

关键基因和途径的识别在理解甲状腺癌的分子生物学中起着核心作用。整合亚型背景可以优先筛选诊断生物标志物和新型分子靶向治疗药物。本研究使用的源代码可在https://github.com/chengkun-wu/GenesThyCan上免费在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/84a876a0fae1/1755-8794-7-S3-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/66914c2edc2d/1755-8794-7-S3-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/285c053cc436/1755-8794-7-S3-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/73a7e478e5e8/1755-8794-7-S3-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/e58059a3001a/1755-8794-7-S3-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/84a876a0fae1/1755-8794-7-S3-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/66914c2edc2d/1755-8794-7-S3-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/285c053cc436/1755-8794-7-S3-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/73a7e478e5e8/1755-8794-7-S3-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/e58059a3001a/1755-8794-7-S3-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/257d/4290788/84a876a0fae1/1755-8794-7-S3-S3-5.jpg

相似文献

1
Molecular profiling of thyroid cancer subtypes using large-scale text mining.利用大规模文本挖掘对甲状腺癌亚型进行分子谱分析。
BMC Med Genomics. 2014;7 Suppl 3(Suppl 3):S3. doi: 10.1186/1755-8794-7-S3-S3. Epub 2014 Dec 8.
2
Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events.通过对基因和通路事件进行大规模文本挖掘构建甲状腺癌分子相互作用网络。
BMC Syst Biol. 2015;9 Suppl 6(Suppl 6):S5. doi: 10.1186/1752-0509-9-S6-S5. Epub 2015 Dec 9.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.BioCreative VI 精准医学赛道概述:精准医学中的蛋白质相互作用和突变挖掘。
Database (Oxford). 2019 Jan 1;2019:bay147. doi: 10.1093/database/bay147.
5
Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.癌症特征分析工具(CHAT):一种文本挖掘方法,用于组织和评估癌症相关科学文献。
Bioinformatics. 2017 Dec 15;33(24):3973-3981. doi: 10.1093/bioinformatics/btx454.
6
Biomarker identification using text mining.使用文本挖掘进行生物标志物识别。
Comput Math Methods Med. 2012;2012:135780. doi: 10.1155/2012/135780. Epub 2012 Nov 11.
7
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
8
ModEx: A text mining system for extracting mode of regulation of transcription factor-gene regulatory interaction.ModEx:一种用于提取转录因子-基因调控相互作用的调控模式的文本挖掘系统。
J Biomed Inform. 2020 Feb;102:103353. doi: 10.1016/j.jbi.2019.103353. Epub 2019 Dec 16.
9
Cell line name recognition in support of the identification of synthetic lethality in cancer from text.支持从文本中识别癌症合成致死性的细胞系名称识别
Bioinformatics. 2016 Jan 15;32(2):276-82. doi: 10.1093/bioinformatics/btv570. Epub 2015 Oct 1.
10
Combining literature text mining with microarray data: advances for system biology modeling.结合文献文本挖掘和微阵列数据:系统生物学建模的进展。
Brief Bioinform. 2012 Jan;13(1):61-82. doi: 10.1093/bib/bbr018. Epub 2011 Jun 15.

引用本文的文献

1
The correlation between the ultrasound examination parameters and the pathological characteristics of papillary thyroid carcinomas.甲状腺乳头状癌超声检查参数与病理特征的相关性
Pak J Med Sci. 2025 Mar;41(3):848-855. doi: 10.12669/pjms.41.3.10566.
2
The role of genetic and epigenetic modifications as potential biomarkers in the diagnosis and prognosis of thyroid cancer.基因和表观遗传修饰作为甲状腺癌诊断和预后潜在生物标志物的作用。
Front Oncol. 2024 Nov 4;14:1474267. doi: 10.3389/fonc.2024.1474267. eCollection 2024.
3
Genetic Changes in Thyroid Cancers and the Importance of Their Preoperative Detection in Relation to the General Treatment and Determination of the Extent of Surgical Intervention-A Review.

本文引用的文献

1
RET revisited: expanding the oncogenic portfolio.RET 再探:扩展致癌基因谱。
Nat Rev Cancer. 2014 Mar;14(3):173-86. doi: 10.1038/nrc3680.
2
PathNER: a tool for systematic identification of biological pathway mentions in the literature.PathNER:一种用于系统识别文献中生物途径提及内容的工具。
BMC Syst Biol. 2013 Oct 16;7 Suppl 3(Suppl 3):S2. doi: 10.1186/1752-0509-7-S3-S2.
3
An update on molecular biology of thyroid cancers.甲状腺癌的分子生物学研究进展。
甲状腺癌的基因变化及其术前检测在总体治疗和手术干预范围确定方面的重要性——综述
Biomedicines. 2022 Jun 27;10(7):1515. doi: 10.3390/biomedicines10071515.
4
ParaBTM: A Parallel Processing Framework for Biomedical Text Mining on Supercomputers.ParaBTM:一种用于生物医学文本挖掘的超级计算机并行处理框架。
Molecules. 2018 Apr 27;23(5):1028. doi: 10.3390/molecules23051028.
5
Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events.通过对基因和通路事件进行大规模文本挖掘构建甲状腺癌分子相互作用网络。
BMC Syst Biol. 2015;9 Suppl 6(Suppl 6):S5. doi: 10.1186/1752-0509-9-S6-S5. Epub 2015 Dec 9.
6
Diagnostic, Prognostic, and Predictive Molecular Biomarkers and the Utility of Molecular Imaging in Common Gastrointestinal Tumors.常见胃肠道肿瘤的诊断、预后和预测分子生物标志物及分子成像的应用
Biomed Res Int. 2015;2015:890805. doi: 10.1155/2015/890805. Epub 2015 Nov 4.
7
Integrative enrichment analysis: a new computational method to detect dysregulated pathways in heterogeneous samples.整合富集分析:一种检测异质样本中失调通路的新计算方法。
BMC Genomics. 2015 Nov 10;16:918. doi: 10.1186/s12864-015-2188-7.
8
InCoB2014: mining biological data from genomics for transforming industry and health.InCoB2014:从基因组学中挖掘生物数据以推动产业与健康变革。
BMC Genomics. 2014;15 Suppl 9(Suppl 9):I1. doi: 10.1186/1471-2164-15-S9-I1. Epub 2014 Dec 8.
Crit Rev Oncol Hematol. 2014 Jun;90(3):233-52. doi: 10.1016/j.critrevonc.2013.12.007. Epub 2013 Dec 18.
4
DigSee: Disease gene search engine with evidence sentences (version cancer).DigSee:带证据句的疾病基因搜索引擎(癌症版)。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W510-7. doi: 10.1093/nar/gkt531. Epub 2013 Jun 12.
5
Progress in molecular-based management of differentiated thyroid cancer.分化型甲状腺癌的分子靶向治疗进展。
Lancet. 2013 Mar 23;381(9871):1058-69. doi: 10.1016/S0140-6736(13)60109-9. Epub 2013 Mar 22.
6
DNA methylation signatures identify biologically distinct thyroid cancer subtypes.DNA 甲基化特征可识别生物学上不同的甲状腺癌亚型。
J Clin Endocrinol Metab. 2013 Jul;98(7):2811-21. doi: 10.1210/jc.2012-3566. Epub 2013 May 10.
7
Resveratrol induces differentiation markers expression in anaplastic thyroid carcinoma via activation of Notch1 signaling and suppresses cell growth.白藜芦醇通过激活 Notch1 信号诱导间变性甲状腺癌分化标志物的表达,并抑制细胞生长。
Mol Cancer Ther. 2013 Jul;12(7):1276-87. doi: 10.1158/1535-7163.MCT-12-0841. Epub 2013 Apr 17.
8
MalaCards: an integrated compendium for diseases and their annotation.MalaCards:疾病及其注释的综合纲要。
Database (Oxford). 2013 Apr 12;2013:bat018. doi: 10.1093/database/bat018. Print 2013.
9
Notch pathway is activated by MAPK signaling and influences papillary thyroid cancer proliferation.Notch 通路被 MAPK 信号激活并影响甲状腺乳头状癌的增殖。
Transl Oncol. 2013 Apr;6(2):197-205. doi: 10.1593/tlo.12442. Epub 2013 Apr 1.
10
Increasing incidence of thyroid cancer: controversies explored.甲状腺癌发病率上升:争议探讨。
Nat Rev Endocrinol. 2013 Mar;9(3):178-84. doi: 10.1038/nrendo.2012.257. Epub 2013 Jan 29.