Suppr超能文献

生物分子相互作用的自动提取:一种经验方法。

Automatic extraction of biomolecular interactions: an empirical approach.

机构信息

Siemens Corporate Research, Princeton, NJ, USA.

出版信息

BMC Bioinformatics. 2013 Jul 24;14:234. doi: 10.1186/1471-2105-14-234.

Abstract

BACKGROUND

We describe a method for extracting data about how biomolecule pairs interact from texts. This method relies on empirically determined characteristics of sentences. The characteristics are efficient to compute, making this approach to extraction of biomolecular interactions scalable. The results of such interaction mining can support interaction network annotation, question answering, database construction, and other applications.

RESULTS

We constructed a software system to search MEDLINE for sentences likely to describe interactions between given biomolecules. The system extracts a list of the interaction-indicating terms appearing in those sentences, then ranks those terms based on their likelihood of correctly characterizing how the biomolecules interact. The ranking process uses a tf-idf (term frequency-inverse document frequency) based technique using empirically derived knowledge about sentences, and was applied to the MEDLINE literature collection. Software was developed as part of the MetNet toolkit (http://www.metnetdb.org).

CONCLUSIONS

Specific, efficiently computable characteristics of sentences about biomolecular interactions were analyzed to better understand how to use these characteristics to extract how biomolecules interact.The text empirics method that was investigated, though arising from a classical tradition, has yet to be fully explored for the task of extracting biomolecular interactions from the literature. The conclusions we reach about the sentence characteristics investigated in this work, as well as the technique itself, could be used by other systems to provide evidence about putative interactions, thus supporting efforts to maximize the ability of hybrid systems to support such tasks as annotating and constructing interaction networks.

摘要

背景

我们描述了一种从文本中提取生物分子对相互作用信息的方法。该方法依赖于经验确定的句子特征。这些特征计算效率高,使这种生物分子相互作用提取方法具有可扩展性。这种交互挖掘的结果可以支持交互网络注释、问答、数据库构建和其他应用。

结果

我们构建了一个软件系统,用于在 MEDLINE 中搜索可能描述给定生物分子相互作用的句子。该系统提取了这些句子中出现的交互指示项列表,然后根据它们正确描述生物分子相互作用的可能性对这些术语进行排名。排名过程使用了基于 tf-idf(词频-逆文档频率)的技术,并利用了关于句子的经验知识,应用于 MEDLINE 文献集。该软件是 MetNet 工具包(http://www.metnetdb.org)的一部分。

结论

分析了关于生物分子相互作用的句子的具体、高效可计算特征,以更好地理解如何利用这些特征来提取生物分子的相互作用方式。尽管文本实证方法源于经典传统,但尚未充分探索从文献中提取生物分子相互作用的任务。我们在这项工作中调查的句子特征以及该技术本身的结论,可以被其他系统用于提供关于假定相互作用的证据,从而支持最大限度地提高混合系统支持此类任务(如注释和构建交互网络)的能力的努力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/428d/3729816/e74219248bb4/1471-2105-14-234-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验