• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

KiPar,一种用于系统检索酵母代谢途径动力学建模参数相关信息的工具。

KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways.

作者信息

Spasic Irena, Simeonidis Evangelos, Messiha Hanan L, Paton Norman W, Kell Douglas B

机构信息

Manchester Centre for Integrative Systems Biology, The University of Manchester, Manchester, UK.

出版信息

Bioinformatics. 2009 Jun 1;25(11):1404-11. doi: 10.1093/bioinformatics/btp175. Epub 2009 Mar 31.

DOI:10.1093/bioinformatics/btp175
PMID:19336445
Abstract

MOTIVATION

Most experimental evidence on kinetic parameters is buried in the literature, whose manual searching is complex, time consuming and partial. These shortcomings become particularly acute in systems biology, where these parameters need to be integrated into detailed, genome-scale, metabolic models. These problems are addressed by KiPar, a dedicated information retrieval system designed to facilitate access to the literature relevant for kinetic modelling of a given metabolic pathway in yeast. Searching for kinetic data in the context of an individual pathway offers modularity as a way of tackling the complexity of developing a full metabolic model. It is also suitable for large-scale mining, since multiple reactions and their kinetic parameters can be specified in a single search request, rather than one reaction at a time, which is unsuitable given the size of genome-scale models.

RESULTS

We developed an integrative approach, combining public data and software resources for the rapid development of large-scale text mining tools targeting complex biological information. The user supplies input in the form of identifiers used in relevant data resources to refer to the concepts of interest, e.g. EC numbers, GO and SBO identifiers. By doing so, the user is freed from providing any other knowledge or terminology concerned with these concepts and their relations, since they are retrieved from these and cross-referenced resources automatically. The terminology acquired is used to index the literature by mapping concepts to their synonyms, and then to textual documents mentioning them. The indexing results and the previously acquired knowledge about relations between concepts are used to formulate complex search queries aiming at documents relevant to the user's information needs. The conceptual approach is demonstrated in the implementation of KiPar. Evaluation reveals that KiPar performs better than a Boolean search. The precision achieved for abstracts (60%) and full-text articles (48%) is considerably better than the baseline precision (44% and 24%, respectively). The baseline recall is improved by 36% for abstracts and by 100% for full text. It appears that full-text articles are a much richer source of information on kinetic data than are their abstracts. Finally, the combined results for abstracts and full text compared with the curated literature provide high values for relative recall (88%) and novelty ratio (92%), suggesting that the system is able to retrieve a high proportion of new documents.

AVAILABILITY

Source code and documentation are available at: (http://www.mcisb.org/resources/kipar/).

摘要

动机

大多数关于动力学参数的实验证据都埋没在文献中,人工检索这些文献复杂、耗时且不全面。在系统生物学中,这些缺点尤为突出,因为需要将这些参数整合到详细的、全基因组规模的代谢模型中。KiPar解决了这些问题,它是一个专门的信息检索系统,旨在方便获取与酵母中给定代谢途径动力学建模相关的文献。在单个途径的背景下搜索动力学数据提供了模块化,作为应对开发完整代谢模型复杂性的一种方式。它也适用于大规模挖掘,因为可以在单个搜索请求中指定多个反应及其动力学参数,而不是一次指定一个反应,鉴于全基因组规模模型的大小,一次指定一个反应是不合适的。

结果

我们开发了一种综合方法,结合公共数据和软件资源,用于快速开发针对复杂生物信息的大规模文本挖掘工具。用户以相关数据资源中使用的标识符的形式提供输入,以指代感兴趣的概念,例如酶委员会编号、基因本体(GO)和系统生物学本体(SBO)标识符。通过这样做,用户无需提供与这些概念及其关系相关的任何其他知识或术语,因为它们会自动从这些资源和交叉引用的资源中检索出来。获取的术语用于通过将概念映射到其同义词,然后映射到提及它们的文本文档来对文献进行索引。索引结果以及先前获取的关于概念之间关系的知识用于制定针对与用户信息需求相关的文档的复杂搜索查询。KiPar的实现展示了这种概念方法。评估表明,KiPar的性能优于布尔搜索。摘要(60%)和全文文章(48%)的精确率明显高于基线精确率(分别为44%和24%)。摘要的基线召回率提高了36%,全文的基线召回率提高了100%。似乎全文文章是关于动力学数据的比其摘要丰富得多的信息来源。最后,与经过整理的文献相比,摘要和全文的综合结果提供了较高的相对召回率(88%)和新颖率(92%),表明该系统能够检索出高比例的新文档。

可用性

源代码和文档可在以下网址获取:(http://www.mcisb.org/resources/kipar/) 。

相似文献

1
KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways.KiPar,一种用于系统检索酵母代谢途径动力学建模参数相关信息的工具。
Bioinformatics. 2009 Jun 1;25(11):1404-11. doi: 10.1093/bioinformatics/btp175. Epub 2009 Mar 31.
2
SysBioMed report: advancing systems biology for medical applications.系统生物医学报告:推动系统生物学在医学应用中的发展。
IET Syst Biol. 2009 May;3(3):131-6. doi: 10.1049/iet-syb.2009.0005.
3
KEGGanim: pathway animations for high-throughput data.KEGGanim:用于高通量数据的通路动画
Bioinformatics. 2008 Feb 15;24(4):588-90. doi: 10.1093/bioinformatics/btm581. Epub 2007 Dec 1.
4
Automated recognition of malignancy mentions in biomedical literature.生物医学文献中恶性肿瘤提及的自动识别。
BMC Bioinformatics. 2006 Nov 7;7:492. doi: 10.1186/1471-2105-7-492.
5
An automated procedure to identify biomedical articles that contain cancer-associated gene variants.一种识别包含癌症相关基因变异的生物医学文章的自动化程序。
Hum Mutat. 2006 Sep;27(9):957-64. doi: 10.1002/humu.20363.
6
BlotBase: a northern blot database.印迹数据库:一个Northern印迹数据库。
Gene. 2008 Dec 31;427(1-2):47-50. doi: 10.1016/j.gene.2008.08.026. Epub 2008 Sep 18.
7
ReMatch: a web-based tool to construct, store and share stoichiometric metabolic models with carbon maps for metabolic flux analysis.ReMatch:一个基于网络的工具,用于构建、存储和共享带有碳图的化学计量代谢模型以进行代谢通量分析。
J Integr Bioinform. 2008 Aug 25;5(2):102. doi: 10.2390/biecoll-jib-2008-102.
8
DESHARKY: automatic design of metabolic pathways for optimal cell growth.DESHARKY:用于优化细胞生长的代谢途径自动设计。
Bioinformatics. 2008 Nov 1;24(21):2554-6. doi: 10.1093/bioinformatics/btn471. Epub 2008 Sep 5.
9
Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?文本中的事实:文本挖掘能否助力利用本体对基因产物进行大规模高质量人工编目?
Brief Bioinform. 2008 Nov;9(6):466-78. doi: 10.1093/bib/bbn043. Epub 2008 Dec 6.
10
CARMEN - Comparative Analysis and in silico Reconstruction of organism-specific MEtabolic Networks.CARMEN - 特定生物体代谢网络的比较分析与计算机重建
Genet Mol Res. 2010 Aug 24;9(3):1660-72. doi: 10.4238/vol9-3gmr901.

引用本文的文献

1
An automated approach to identify scientific publications reporting pharmacokinetic parameters.一种识别报告药代动力学参数的科学出版物的自动化方法。
Wellcome Open Res. 2021 Apr 21;6:88. doi: 10.12688/wellcomeopenres.16718.1. eCollection 2021.
2
Ion Channel ElectroPhysiology Ontology (ICEPO) - a case study of text mining assisted ontology development.离子通道电生理学本体论(ICEPO)——文本挖掘辅助本体论开发的案例研究。
AMIA Jt Summits Transl Sci Proc. 2016 Jul 20;2016:42-51. eCollection 2016.
3
Event-based text mining for biology and functional genomics.
用于生物学和功能基因组学的基于事件的文本挖掘
Brief Funct Genomics. 2015 May;14(3):213-30. doi: 10.1093/bfgp/elu015. Epub 2014 Jun 6.
4
Automated extraction and semantic analysis of mutation impacts from the biomedical literature.从生物医学文献中自动提取和语义分析突变影响。
BMC Genomics. 2012 Jun 18;13 Suppl 4(Suppl 4):S10. doi: 10.1186/1471-2164-13-S4-S10.
5
Semantic annotation of biological concepts interplaying microbial cellular responses.生物概念的语义标注在微生物细胞反应中的相互作用。
BMC Bioinformatics. 2011 Nov 28;12:460. doi: 10.1186/1471-2105-12-460.
6
KID--an algorithm for fast and efficient text mining used to automatically generate a database containing kinetic information of enzymes.KID 是一种快速高效的文本挖掘算法,用于自动生成包含酶动力学信息的数据库。
BMC Bioinformatics. 2010 Jul 13;11:375. doi: 10.1186/1471-2105-11-375.