整合文本挖掘、数据挖掘和网络分析以识别遗传性乳腺癌趋势。

Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends.

作者信息

Jurca Gabriela, Addam Omar, Aksac Alper, Gao Shang, Özyer Tansel, Demetrick Douglas, Alhajj Reda

机构信息

Department of Computer Science, University of Calgary, Calgary, AB, Canada.

College of Computer Science and Technology, Jilin University, Changchun, China.

出版信息

BMC Res Notes. 2016 Apr 26;9:236. doi: 10.1186/s13104-016-2023-5.

DOI:10.1186/s13104-016-2023-5

PMID:27112211

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4845430/

Abstract

BACKGROUND

Breast cancer is a serious disease which affects many women and may lead to death. It has received considerable attention from the research community. Thus, biomedical researchers aim to find genetic biomarkers indicative of the disease. Novel biomarkers can be elucidated from the existing literature. However, the vast amount of scientific publications on breast cancer make this a daunting task. This paper presents a framework which investigates existing literature data for informative discoveries. It integrates text mining and social network analysis in order to identify new potential biomarkers for breast cancer.

RESULTS

We utilized PubMed for the testing. We investigated gene-gene interactions, as well as novel interactions such as gene-year, gene-country, and abstract-country to find out how the discoveries varied over time and how overlapping/diverse are the discoveries and the interest of various research groups in different countries.

CONCLUSIONS

Interesting trends have been identified and discussed, e.g., different genes are highlighted in relationship to different countries though the various genes were found to share functionality. Some text analysis based results have been validated against results from other tools that predict gene-gene relations and gene functions.

摘要

背景

乳腺癌是一种严重的疾病，影响着众多女性，甚至可能导致死亡。它受到了研究界的广泛关注。因此，生物医学研究人员旨在寻找指示该疾病的基因生物标志物。可以从现有文献中阐明新的生物标志物。然而，关于乳腺癌的大量科学出版物使得这成为一项艰巨的任务。本文提出了一个框架，用于研究现有文献数据以获得有价值的发现。它整合了文本挖掘和社会网络分析，以识别乳腺癌新的潜在生物标志物。

结果

我们利用PubMed进行测试。我们研究了基因 - 基因相互作用，以及诸如基因 - 年份、基因 - 国家和摘要 - 国家等新的相互作用，以了解发现如何随时间变化，以及不同国家各种研究小组的发现、兴趣的重叠/差异情况。

结论

已识别并讨论了有趣的趋势，例如，尽管发现各种基因具有共享功能，但与不同国家相关的不同基因被突出显示。基于文本分析的一些结果已与其他预测基因 - 基因关系和基因功能的工具的结果进行了验证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caf6/4845430/e289b91dab81/13104_2016_2023_Fig1_HTML.jpg

相似文献

Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends.整合文本挖掘、数据挖掘和网络分析以识别遗传性乳腺癌趋势。

BMC Res Notes. 2016 Apr 26;9:236. doi: 10.1186/s13104-016-2023-5.

Analysis of biological processes and diseases using text mining approaches.使用文本挖掘方法分析生物过程和疾病。

Methods Mol Biol. 2010;593:341-82. doi: 10.1007/978-1-60327-194-3_16.

Application of text mining in the biomedical domain.文本挖掘在生物医学领域的应用。

Methods. 2015 Mar;74:97-106. doi: 10.1016/j.ymeth.2015.01.015. Epub 2015 Jan 30.

Identification of Gene Expression Pattern Related to Breast Cancer Survival Using Integrated TCGA Datasets and Genomic Tools.使用整合的TCGA数据集和基因组工具鉴定与乳腺癌生存相关的基因表达模式

Biomed Res Int. 2015;2015:878546. doi: 10.1155/2015/878546. Epub 2015 Oct 20.

A network-based approach to identify disease-associated gene modules through integrating DNA methylation and gene expression.一种通过整合DNA甲基化和基因表达来识别疾病相关基因模块的基于网络的方法。

Biochem Biophys Res Commun. 2015 Sep 25;465(3):437-42. doi: 10.1016/j.bbrc.2015.08.033. Epub 2015 Aug 14.

Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。

Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

DISEASES: text mining and data integration of disease-gene associations.疾病：疾病-基因关联的文本挖掘与数据整合

Methods. 2015 Mar;74:83-9. doi: 10.1016/j.ymeth.2014.11.020. Epub 2014 Dec 5.

Text mining for precision medicine: automating disease-mutation relationship extraction from biomedical literature.精准医学的文本挖掘：从生物医学文献中自动提取疾病-突变关系

J Am Med Inform Assoc. 2016 Jul;23(4):766-72. doi: 10.1093/jamia/ocw041. Epub 2016 Apr 27.

Biomedical hypothesis generation by text mining and gene prioritization.通过文本挖掘和基因优先级排序生成生物医学假设。

Protein Pept Lett. 2014;21(8):847-57. doi: 10.2174/09298665113209990063.

Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning.利用深度强化学习实现类人自动挖掘与构建可靠的基因关联数据库

Pac Symp Biocomput. 2019;24:112-123.

引用本文的文献

biotextgraph: graphical summarization of functional similarities from textual information.生物文本图：从文本信息中提取功能相似性的图形总结。

Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae357.

Identifying and Validating Networks of Oncology Biomarkers Mined From the Scientific Literature.识别和验证从科学文献中挖掘出的肿瘤生物标志物网络。

Cancer Inform. 2022 Mar 22;21:11769351221086441. doi: 10.1177/11769351221086441. eCollection 2022.

Epione application: An integrated web‑toolkit of clinical genomics and personalized medicine in systemic lupus erythematosus.Epione 应用：系统性红斑狼疮临床基因组学和个性化医学的综合网络工具包。

Int J Mol Med. 2022 Jan;49(1). doi: 10.3892/ijmm.2021.5063. Epub 2021 Nov 18.

Text Mining for Building Biomedical Networks Using Cancer as a Case Study.基于癌症案例研究的生物医学网络构建的文本挖掘。

Biomolecules. 2021 Sep 29;11(10):1430. doi: 10.3390/biom11101430.

Mining Proteome Research Reports: A Bird's Eye View.挖掘蛋白质组研究报告：鸟瞰视角。

Proteomes. 2021 Jun 10;9(2):29. doi: 10.3390/proteomes9020029.

Demetra Application: An integrated genotype analysis web server for clinical genomics in endometriosis.德梅特拉应用程序：子宫内膜异位症临床基因组学综合基因型分析网络服务器。

Int J Mol Med. 2021 Jun;47(6). doi: 10.3892/ijmm.2021.4948. Epub 2021 Apr 28.

Identification of most influential co-occurring gene suites for gastrointestinal cancer using biomedical literature mining and graph-based influence maximization.利用生物医学文献挖掘和基于图的影响力最大化方法识别对胃肠道癌最具影响力的共现基因集

BMC Med Inform Decis Mak. 2020 Sep 3;20(1):208. doi: 10.1186/s12911-020-01227-6.

Edgetic perturbation signatures represent known and novel cancer biomarkers.边缘扰动特征代表了已知和新的癌症生物标志物。

Sci Rep. 2020 Mar 9;10(1):4350. doi: 10.1038/s41598-020-61422-3.

Tracking knowledge evolution, hotspots and future directions of emerging technologies in cancers research: a bibliometrics review.追踪癌症研究中新兴技术的知识演进、热点及未来方向：一项文献计量学综述

J Cancer. 2019 Jun 2;10(12):2643-2653. doi: 10.7150/jca.32739. eCollection 2019.

Identification of pharmacodynamic biomarker hypotheses through literature analysis with IBM Watson.通过 IBM Watson 进行文献分析识别药效生物标志物假说。

PLoS One. 2019 Apr 8;14(4):e0214619. doi: 10.1371/journal.pone.0214619. eCollection 2019.

本文引用的文献

Regulators of genetic risk of breast cancer identified by integrative network analysis.通过整合网络分析鉴定的乳腺癌遗传风险调控因子。

Nat Genet. 2016 Jan;48(1):12-21. doi: 10.1038/ng.3458. Epub 2015 Nov 30.

Cancer biomarkers: are we ready for the prime time?癌症生物标志物：我们是否已经准备好迎接黄金时代？

Cancers (Basel). 2010 Mar 22;2(1):190-208. doi: 10.3390/cancers2010190.

Activities at the Universal Protein Resource (UniProt).通用蛋白质资源库（UniProt）的活动。

Nucleic Acids Res. 2014 Jan;42(Database issue):D191-8. doi: 10.1093/nar/gkt1140. Epub 2013 Nov 18.

Autonomic nerve development contributes to prostate cancer progression.自主神经发育促进前列腺癌的进展。

Science. 2013 Jul 12;341(6142):1236361. doi: 10.1126/science.1236361.

BeCAS: biomedical concept recognition services and visualization.BeCAS：生物医学概念识别服务和可视化。

Bioinformatics. 2013 Aug 1;29(15):1915-6. doi: 10.1093/bioinformatics/btt317. Epub 2013 Jun 4.

Treatment of estrogen receptor-positive breast cancer.雌激素受体阳性乳腺癌的治疗。

Curr Med Chem. 2013;20(5):596-604. doi: 10.2174/092986713804999303.

Biomedical text mining and its applications in cancer research.生物医学文本挖掘及其在癌症研究中的应用。

J Biomed Inform. 2013 Apr;46(2):200-11. doi: 10.1016/j.jbi.2012.10.007. Epub 2012 Nov 15.

Text-mining solutions for biomedical research: enabling integrative biology.文本挖掘在生物医学研究中的应用：实现综合生物学。

Nat Rev Genet. 2012 Dec;13(12):829-39. doi: 10.1038/nrg3337. Epub 2012 Nov 14.

Gene-disease network analysis reveals functional modules in mendelian, complex and environmental diseases.基因-疾病网络分析揭示了孟德尔、复杂和环境疾病中的功能模块。

PLoS One. 2011;6(6):e20284. doi: 10.1371/journal.pone.0020284. Epub 2011 Jun 14.

Combining literature text mining with microarray data: advances for system biology modeling.结合文献文本挖掘和微阵列数据：系统生物学建模的进展。

Brief Bioinform. 2012 Jan;13(1):61-82. doi: 10.1093/bib/bbr018. Epub 2011 Jun 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

整合文本挖掘、数据挖掘和网络分析以识别遗传性乳腺癌趋势。

Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献