Krallinger Martin, Rodriguez-Penagos Carlos, Tendulkar Ashish, Valencia Alfonso
Structural Biology and Biocomputing programme, Spanish National Cancer Center (CNIO), Melchor Fernandez Almagro 3, Madrid, 28029, Spain.
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W160-5. doi: 10.1093/nar/gkp484. Epub 2009 Jun 11.
There is an increasing interest in using literature mining techniques to complement information extracted from annotation databases or generated by bioinformatics applications. Here we present PLAN2L, a web-based online search system that integrates text mining and information extraction techniques to access systematically information useful for analyzing genetic, cellular and molecular aspects of the plant model organism Arabidopsis thaliana. Our system facilitates a more efficient retrieval of information relevant to heterogeneous biological topics, from implications in biological relationships at the level of protein interactions and gene regulation, to sub-cellular locations of gene products and associations to cellular and developmental processes, i.e. cell cycle, flowering, root, leaf and seed development. Beyond single entities, also predefined pairs of entities can be provided as queries for which literature-derived relations together with textual evidences are returned. PLAN2L does not require registration and is freely accessible at http://zope.bioinfo.cnio.es/plan2l.
利用文献挖掘技术来补充从注释数据库中提取的信息或由生物信息学应用程序生成的信息,这一做法正受到越来越多的关注。在此,我们展示了PLAN2L,这是一个基于网络的在线搜索系统,它整合了文本挖掘和信息提取技术,以便系统地获取对分析植物模式生物拟南芥的遗传、细胞和分子方面有用的信息。我们的系统有助于更高效地检索与异质生物学主题相关的信息,这些主题涵盖从蛋白质相互作用和基因调控层面的生物学关系,到基因产物的亚细胞定位以及与细胞和发育过程(即细胞周期、开花、根、叶和种子发育)的关联。除了单个实体之外,还可以提供预定义的实体对作为查询,系统会返回源自文献的关系以及文本证据。PLAN2L无需注册,可通过http://zope.bioinfo.cnio.es/plan2l免费访问。