• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于真相与路径:在无数文章中追寻点滴信息。

Of truth and pathways: chasing bits of information through myriads of articles.

作者信息

Krauthammer Michael, Kra Pauline, Iossifov Ivan, Gomez Shawn M, Hripcsak George, Hatzivassiloglou Vasileios, Friedman Carol, Rzhetsky Andrey

机构信息

Department of Medical Informatics, Columbia University, New York, NY 10032, USA.

出版信息

Bioinformatics. 2002;18 Suppl 1:S249-57. doi: 10.1093/bioinformatics/18.suppl_1.s249.

DOI:10.1093/bioinformatics/18.suppl_1.s249
PMID:12169554
Abstract

Knowledge on interactions between molecules in living cells is indispensable for theoretical analysis and practical applications in modern genomics and molecular biology. Building such networks relies on the assumption that the correct molecular interactions are known or can be identified by reading a few research articles. However, this assumption does not necessarily hold, as truth is rather an emerging property based on many potentially conflicting facts. This paper explores the processes of knowledge generation and publishing in the molecular biology literature using modelling and analysis of real molecular interaction data. The data analysed in this article were automatically extracted from 50000 research articles in molecular biology using a computer system called GeneWays containing a natural language processing module. The paper indicates that truthfulness of statements is associated in the minds of scientists with the relative importance (connectedness) of substances under study, revealing a potential selection bias in the reporting of research results. Aiming at understanding the statistical properties of the life cycle of biological facts reported in research articles, we formulate a stochastic model describing generation and propagation of knowledge about molecular interactions through scientific publications. We hope that in the future such a model can be useful for automatically producing consensus views of molecular interaction data.

摘要

了解活细胞中分子间的相互作用对于现代基因组学和分子生物学的理论分析及实际应用而言不可或缺。构建此类网络依赖于这样一种假设,即正确的分子相互作用是已知的,或者可以通过阅读几篇研究文章来识别。然而,这一假设不一定成立,因为真相实际上是基于许多可能相互矛盾的事实而产生的一种属性。本文利用对真实分子相互作用数据的建模与分析,探讨了分子生物学文献中的知识生成与发表过程。本文所分析的数据是使用一个名为GeneWays的包含自然语言处理模块的计算机系统,从50000篇分子生物学研究文章中自动提取的。本文指出,在科学家的认知中,陈述的真实性与所研究物质的相对重要性(关联性)相关,这揭示了研究结果报告中存在潜在的选择偏差。为了理解研究文章中所报告的生物学事实生命周期的统计特性,我们构建了一个随机模型,描述通过科学出版物产生和传播分子相互作用知识的过程。我们希望未来这样的模型能够有助于自动生成分子相互作用数据的共识观点。

相似文献

1
Of truth and pathways: chasing bits of information through myriads of articles.关于真相与路径:在无数文章中追寻点滴信息。
Bioinformatics. 2002;18 Suppl 1:S249-57. doi: 10.1093/bioinformatics/18.suppl_1.s249.
2
GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data.基因途径系统:一个用于提取、分析、可视化和整合分子途径数据的系统。
J Biomed Inform. 2004 Feb;37(1):43-53. doi: 10.1016/j.jbi.2003.10.001.
3
Emergent behavior of growing knowledge about molecular interactions.关于分子相互作用的知识增长的涌现行为。
Nat Biotechnol. 2005 Oct;23(10):1243-7. doi: 10.1038/nbt1005-1243.
4
Extraction of biological interaction networks from scientific literature.从科学文献中提取生物相互作用网络。
Brief Bioinform. 2005 Sep;6(3):263-76. doi: 10.1093/bib/6.3.263.
5
The emerging in-silico scientist: how text-based bioinformatics is bridging biology and artificial intelligence.新兴的虚拟科学家:基于文本的生物信息学如何架起生物学与人工智能之间的桥梁。
IEEE Eng Med Biol Mag. 2004 Mar-Apr;23(2):87-93. doi: 10.1109/memb.2004.1310989.
6
Wnt pathway curation using automated natural language processing: combining statistical methods with partial and full parse for knowledge extraction.使用自动自然语言处理技术对Wnt信号通路进行整理:结合统计方法与部分及完全句法分析进行知识提取。
Bioinformatics. 2005 Apr 15;21(8):1653-8. doi: 10.1093/bioinformatics/bti165. Epub 2004 Nov 25.
7
Discovering patterns to extract protein-protein interactions from full texts.从全文中发现提取蛋白质-蛋白质相互作用的模式。
Bioinformatics. 2004 Dec 12;20(18):3604-12. doi: 10.1093/bioinformatics/bth451. Epub 2004 Jul 29.
8
Probabilistic inference of molecular networks from noisy data sources.从噪声数据源进行分子网络的概率推断。
Bioinformatics. 2004 May 22;20(8):1205-13. doi: 10.1093/bioinformatics/bth061. Epub 2004 Feb 10.
9
Automatic pathway building in biological association networks.生物关联网络中的自动通路构建
BMC Bioinformatics. 2006 Mar 24;7:171. doi: 10.1186/1471-2105-7-171.
10
Extracting human protein interactions from MEDLINE using a full-sentence parser.使用全句解析器从MEDLINE中提取人类蛋白质相互作用。
Bioinformatics. 2004 Mar 22;20(5):604-11. doi: 10.1093/bioinformatics/btg452. Epub 2004 Jan 22.

引用本文的文献

1
Looking at cerebellar malformations through text-mined interactomes of mice and humans.通过挖掘小鼠和人类的互作网络来观察小脑畸形。
PLoS Comput Biol. 2009 Nov;5(11):e1000559. doi: 10.1371/journal.pcbi.1000559. Epub 2009 Nov 6.
2
Are figure legends sufficient? Evaluating the contribution of associated text to biomedical figure comprehension.图注是否足够?评估相关文本对生物医学图理解的贡献。
J Biomed Discov Collab. 2009 Jan 6;4:1. doi: 10.1186/1747-5333-4-1.
3
Multi-dimensional classification of biomedical text: toward automated, practical provision of high-utility text to diverse users.
生物医学文本的多维分类:致力于为不同用户自动提供实用价值高的文本。
Bioinformatics. 2008 Sep 15;24(18):2086-93. doi: 10.1093/bioinformatics/btn381. Epub 2008 Aug 20.
4
A cheminformatic toolkit for mining biomedical knowledge.一种用于挖掘生物医学知识的化学信息学工具包。
Pharm Res. 2007 Oct;24(10):1791-802. doi: 10.1007/s11095-007-9285-5. Epub 2007 Mar 24.
5
New directions in biomedical text annotation: definitions, guidelines and corpus construction.生物医学文本注释的新方向:定义、指南与语料库构建
BMC Bioinformatics. 2006 Jul 25;7:356. doi: 10.1186/1471-2105-7-356.
6
Microparadigms: chains of collective reasoning in publications about molecular interactions.微观范式:关于分子相互作用的出版物中的集体推理链
Proc Natl Acad Sci U S A. 2006 Mar 28;103(13):4940-5. doi: 10.1073/pnas.0600591103. Epub 2006 Mar 16.
7
MachineProse: an ontological framework for scientific assertions.机器散文:一种用于科学断言的本体框架。
J Am Med Inform Assoc. 2006 Mar-Apr;13(2):220-32. doi: 10.1197/jamia.M1910. Epub 2005 Dec 15.
8
Visualizing information across multidimensional post-genomic structured and textual databases.跨多维度后基因组结构化和文本数据库可视化信息。
Bioinformatics. 2005 Apr 15;21(8):1659-67. doi: 10.1093/bioinformatics/bti210. Epub 2004 Dec 14.
9
Kinase pathway database: an integrated protein-kinase and NLP-based protein-interaction resource.激酶途径数据库:一个基于整合蛋白激酶和自然语言处理的蛋白质相互作用资源库。
Genome Res. 2003 Jun;13(6A):1231-43. doi: 10.1101/gr.835903.