• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

癌症特征分析工具(CHAT):一种文本挖掘方法,用于组织和评估癌症相关科学文献。

Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.

机构信息

Computer Laboratory.

Language Technology Lab, Department of Theoretical and Applied Linguistics, University of Cambridge, Cambridge CB3 9DA, UK.

出版信息

Bioinformatics. 2017 Dec 15;33(24):3973-3981. doi: 10.1093/bioinformatics/btx454.

DOI:10.1093/bioinformatics/btx454
PMID:29036271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5860084/
Abstract

MOTIVATION

To understand the molecular mechanisms involved in cancer development, significant efforts are being invested in cancer research. This has resulted in millions of scientific articles. An efficient and thorough review of the existing literature is crucially important to drive new research. This time-demanding task can be supported by emerging computational approaches based on text mining which offer a great opportunity to organize and retrieve the desired information efficiently from sizable databases. One way to organize existing knowledge on cancer is to utilize the widely accepted framework of the Hallmarks of Cancer. These hallmarks refer to the alterations in cell behaviour that characterize the cancer cell.

RESULTS

We created an extensive Hallmarks of Cancer taxonomy and developed automatic text mining methodology and a tool (CHAT) capable of retrieving and organizing millions of cancer-related references from PubMed into the taxonomy. The efficiency and accuracy of the tool was evaluated intrinsically as well as extrinsically by case studies. The correlations identified by the tool show that it offers a great potential to organize and correctly classify cancer-related literature. Furthermore, the tool can be useful, for example, in identifying hallmarks associated with extrinsic factors, biomarkers and therapeutics targets.

AVAILABILITY AND IMPLEMENTATION

CHAT can be accessed at: http://chat.lionproject.net. The corpus of hallmark-annotated PubMed abstracts and the software are available at: http://chat.lionproject.net/about.

CONTACT

simon.baker@cl.cam.ac.uk.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

为了了解癌症发展中涉及的分子机制,人们正在癌症研究方面投入大量精力。这导致了数以百万计的科学文章的产生。对现有文献进行高效、彻底的综述对于推动新的研究至关重要。这项耗时的任务可以通过新兴的基于文本挖掘的计算方法来支持,这些方法为从大规模数据库中高效地组织和检索所需信息提供了很好的机会。组织癌症现有知识的一种方法是利用广泛接受的癌症特征框架。这些特征是指表征癌细胞的细胞行为改变。

结果

我们创建了一个广泛的癌症特征分类法,并开发了自动文本挖掘方法和工具(CHAT),能够从 PubMed 中检索和组织数百万篇与癌症相关的参考文献到分类法中。该工具的效率和准确性通过案例研究进行了内在和外在的评估。该工具识别出的相关性表明,它具有组织和正确分类癌症相关文献的巨大潜力。此外,该工具可用于例如识别与外在因素、生物标志物和治疗靶点相关的特征。

可用性和实现

CHAT 可在 http://chat.lionproject.net 上访问。带有注释的 PubMed 摘要和软件的语料库可在 http://chat.lionproject.net/about 上获得。

联系人

simon.baker@cl.cam.ac.uk

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/2440bc327f07/btx454f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/8dfcb235ef13/btx454f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/16c4ef3ef914/btx454f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/ada2d21605af/btx454f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/3447c9dbdfb0/btx454f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/550d0215fdb8/btx454f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/2440bc327f07/btx454f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/8dfcb235ef13/btx454f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/16c4ef3ef914/btx454f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/ada2d21605af/btx454f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/3447c9dbdfb0/btx454f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/550d0215fdb8/btx454f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1da6/5860084/2440bc327f07/btx454f6.jpg

相似文献

1
Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.癌症特征分析工具(CHAT):一种文本挖掘方法,用于组织和评估癌症相关科学文献。
Bioinformatics. 2017 Dec 15;33(24):3973-3981. doi: 10.1093/bioinformatics/btx454.
2
Automatic semantic classification of scientific literature according to the hallmarks of cancer.根据癌症特征对科学文献进行自动语义分类。
Bioinformatics. 2016 Feb 1;32(3):432-40. doi: 10.1093/bioinformatics/btv585. Epub 2015 Oct 9.
3
The first step in the development of Text Mining technology for Cancer Risk Assessment: identifying and organizing scientific evidence in risk assessment literature.癌症风险评估文本挖掘技术的发展的第一步:识别和组织风险评估文献中的科学证据。
BMC Bioinformatics. 2009 Sep 22;10:303. doi: 10.1186/1471-2105-10-303.
4
Unsupervised discovery of information structure in biomedical documents.生物医学文献中信息结构的无监督发现。
Bioinformatics. 2015 Apr 1;31(7):1084-92. doi: 10.1093/bioinformatics/btu758. Epub 2014 Nov 18.
5
Text mining tools for extracting information about microbial biodiversity in food.用于从食品中提取微生物生物多样性信息的文本挖掘工具。
Food Microbiol. 2019 Aug;81:63-75. doi: 10.1016/j.fm.2018.04.011. Epub 2018 Apr 21.
6
bioNerDS: exploring bioinformatics' database and software use through literature mining.生物信息学数据库和软件的文献挖掘研究。
BMC Bioinformatics. 2013 Jun 15;14:194. doi: 10.1186/1471-2105-14-194.
7
BioReader: a text mining tool for performing classification of biomedical literature.BioReader:一种文本挖掘工具,用于对生物医学文献进行分类。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):57. doi: 10.1186/s12859-019-2607-x.
8
Cell line name recognition in support of the identification of synthetic lethality in cancer from text.支持从文本中识别癌症合成致死性的细胞系名称识别
Bioinformatics. 2016 Jan 15;32(2):276-82. doi: 10.1093/bioinformatics/btv570. Epub 2015 Oct 1.
9
RLIMS-P 2.0: A Generalizable Rule-Based Information Extraction System for Literature Mining of Protein Phosphorylation Information.RLIMS-P 2.0:一种用于蛋白质磷酸化信息文献挖掘的可通用的基于规则的信息提取系统。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):17-29. doi: 10.1109/TCBB.2014.2372765.
10
MPTM: A tool for mining protein post-translational modifications from literature.MPTM:一种从文献中挖掘蛋白质翻译后修饰的工具。
J Bioinform Comput Biol. 2017 Oct;15(5):1740005. doi: 10.1142/S0219720017400054. Epub 2017 Sep 11.

引用本文的文献

1
Developing foundations for biomedical knowledgebases from literature using large language models - A systematic assessment.利用大语言模型从文献中开发生物医学知识库的基础——一项系统评估
Comput Struct Biotechnol J. 2025 Jul 24;27:3299-3306. doi: 10.1016/j.csbj.2025.07.042. eCollection 2025.
2
Innovative horizons: harnessing drug repositioning for targeted therapeutics in colorectal cancer.创新视野:利用药物重新定位实现结直肠癌的靶向治疗
Naunyn Schmiedebergs Arch Pharmacol. 2025 Jul 1. doi: 10.1007/s00210-025-04289-3.
3
TRIM29 upregulation contributes to chemoresistance in triple negative breast cancer via modulating S100P-β-catenin axis.

本文引用的文献

1
Identification of stable housekeeping genes in response to ionizing radiation in cancer research.鉴定癌症研究中电离辐射响应的稳定管家基因。
Sci Rep. 2017 Mar 6;7:43763. doi: 10.1038/srep43763.
2
Aspirin and colorectal cancer: the promise of precision chemoprevention.阿司匹林与结直肠癌:精准化学预防的前景
Nat Rev Cancer. 2016 Mar;16(3):173-86. doi: 10.1038/nrc.2016.4. Epub 2016 Feb 12.
3
Substantial contribution of extrinsic risk factors to cancer development.外在风险因素对癌症发展的重大贡献。
TRIM29上调通过调节S100P-β-连环蛋白轴促进三阴性乳腺癌的化疗耐药。
Cell Commun Signal. 2025 May 26;23(1):244. doi: 10.1186/s12964-025-02233-9.
4
IARC Workshop on the Key Characteristics of Carcinogens: Assessment of End Points for Evaluating Mechanistic Evidence of Carcinogenic Hazards.国际癌症研究机构致癌物关键特性研讨会:评估用于评价致癌危害机制证据的终点指标
Environ Health Perspect. 2025 Feb;133(2):25001. doi: 10.1289/EHP15389. Epub 2025 Feb 3.
5
BioKGrapher: Initial evaluation of automated knowledge graph construction from biomedical literature.生物知识图谱绘制工具:对从生物医学文献中自动构建知识图谱的初步评估。
Comput Struct Biotechnol J. 2024 Oct 17;24:639-660. doi: 10.1016/j.csbj.2024.10.017. eCollection 2024 Dec.
6
Text mining for contexts and relationships in cancer genomics literature.癌症基因组文献中的语境和关系的文本挖掘。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae021.
7
A Second Career for p53 as A Broad-Spectrum Antiviral?p53 作为广谱抗病毒药物的第二职业生涯?
Viruses. 2023 Dec 3;15(12):2377. doi: 10.3390/v15122377.
8
Unlocking hidden potential: advancements, approaches, and obstacles in repurposing drugs for cancer therapy.挖掘潜在可能:药物重用于癌症治疗的进展、方法和障碍。
Br J Cancer. 2024 Mar;130(5):703-715. doi: 10.1038/s41416-023-02502-9. Epub 2023 Nov 27.
9
Functional Profiling of Soft Tissue Sarcoma Using Mechanistic Models.使用机制模型对软组织肉瘤进行功能分析
Int J Mol Sci. 2023 Sep 29;24(19):14732. doi: 10.3390/ijms241914732.
10
Crosstalk between Metabolite Production and Signaling Activity in Breast Cancer.乳腺癌代谢产物生成与信号活性的串扰。
Int J Mol Sci. 2023 Apr 18;24(8):7450. doi: 10.3390/ijms24087450.
Nature. 2016 Jan 7;529(7584):43-7. doi: 10.1038/nature16166. Epub 2015 Dec 16.
4
Automatic semantic classification of scientific literature according to the hallmarks of cancer.根据癌症特征对科学文献进行自动语义分类。
Bioinformatics. 2016 Feb 1;32(3):432-40. doi: 10.1093/bioinformatics/btv585. Epub 2015 Oct 9.
5
Targeting Angiogenesis in Cancer Therapy: Moving Beyond Vascular Endothelial Growth Factor.癌症治疗中针对血管生成:超越血管内皮生长因子
Oncologist. 2015 Jun;20(6):660-73. doi: 10.1634/theoncologist.2014-0465. Epub 2015 May 22.
6
Cancer etiology. Variation in cancer risk among tissues can be explained by the number of stem cell divisions.癌症病因。组织间癌症风险的差异可由干细胞分裂次数来解释。
Science. 2015 Jan 2;347(6217):78-81. doi: 10.1126/science.1260825.
7
Text mining of cancer-related information: review of current status and future directions.癌症相关信息的文本挖掘:现状与未来方向综述
Int J Med Inform. 2014 Sep;83(9):605-23. doi: 10.1016/j.ijmedinf.2014.06.009. Epub 2014 Jun 24.
8
Biomedical text mining and its applications in cancer research.生物医学文本挖掘及其在癌症研究中的应用。
J Biomed Inform. 2013 Apr;46(2):200-11. doi: 10.1016/j.jbi.2012.10.007. Epub 2012 Nov 15.
9
Hallmarks of cancer: the next generation.癌症的特征:下一代。
Cell. 2011 Mar 4;144(5):646-74. doi: 10.1016/j.cell.2011.02.013.
10
Metastasis: from dissemination to organ-specific colonization.转移:从播散到器官特异性定植。
Nat Rev Cancer. 2009 Apr;9(4):274-84. doi: 10.1038/nrc2622.