Department of Pharmacology, Addiction Science and Toxicology, University of Tennessee Health Science, Memphis, TN 38103, USA.
Department of Genetics, Genomics and Informatics, University of Tennessee Health Science, Memphis, TN 38103, USA.
G3 (Bethesda). 2022 May 6;12(5). doi: 10.1093/g3journal/jkac059.
Interpreting and integrating results from omics studies typically requires a comprehensive and time consuming survey of extant literature. GeneCup is a literature mining web service that retrieves sentences containing user-provided gene symbols and keywords from PubMed abstracts. The keywords are organized into an ontology and can be extended to include results from human genome-wide association studies. We provide a drug addiction keyword ontology that contains over 300 keywords as an example. The literature search is conducted by querying the PubMed server using a programming interface, which is followed by retrieving abstracts from a local copy of the PubMed archive. The main results presented to the user are sentences where gene symbol and keywords co-occur. These sentences are presented through an interactive graphical interface or as tables. All results are linked to the original abstract in PubMed. In addition, a convolutional neural network is employed to distinguish sentences describing systemic stress from those describing cellular stress. The automated and comprehensive search strategy provided by GeneCup facilitates the integration of new discoveries from omic studies with existing literature. GeneCup is free and open source software. The source code of GeneCup and the link to a running instance is available at https://github.com/hakangunturkun/GeneCup.
从组学研究中解释和整合结果通常需要对现有文献进行全面且耗时的调查。GeneCup 是一种文献挖掘网络服务,可从 PubMed 摘要中检索包含用户提供的基因符号和关键字的句子。关键字被组织到一个本体中,并且可以扩展到包括人类全基因组关联研究的结果。我们提供了一个药物成瘾关键字本体,其中包含 300 多个关键字作为示例。文献搜索是通过使用编程接口查询 PubMed 服务器来进行的,然后从 PubMed 存档的本地副本中检索摘要。主要向用户呈现的结果是基因符号和关键字共同出现的句子。这些句子通过交互式图形界面或表格呈现。所有结果都链接到 PubMed 中的原始摘要。此外,还使用卷积神经网络来区分描述全身应激的句子和描述细胞应激的句子。GeneCup 提供的自动化和全面的搜索策略有助于将组学研究的新发现与现有文献相结合。GeneCup 是免费的开源软件。GeneCup 的源代码和运行实例的链接可在 https://github.com/hakangunturkun/GeneCup 上获得。