• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用语义分析预测新的人类基因本体论注释。

Predicting novel human gene ontology annotations using semantic analysis.

机构信息

Department of Computer Science, Wayne State University, 5143 Cass Ave., Detroit, MI 48202, USA.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):91-9. doi: 10.1109/TCBB.2008.29.

DOI:10.1109/TCBB.2008.29
PMID:20150671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3712327/
Abstract

The correct interpretation of many molecular biology experiments depends in an essential way on the accuracy and consistency of the existing annotation databases. Such databases are meant to act as repositories for our biological knowledge as we acquire and refine it. Hence, by definition, they are incomplete at any given time. In this paper, we describe a technique that improves our previous method for predicting novel GO annotations by extracting implicit semantic relationships between genes and functions. In this work, we use a vector space model and a number of weighting schemes in addition to our previous latent semantic indexing approach. The technique described here is able to take into consideration the hierarchical structure of the Gene Ontology (GO) and can weight differently GO terms situated at different depths. The prediction abilities of 15 different weighting schemes are compared and evaluated. Nine such schemes were previously used in other problem domains, while six of them are introduced in this paper. The best weighting scheme was a novel scheme, n2tn. Out of the top 50 functional annotations predicted using this weighting scheme, we found support in the literature for 84 percent of them, while 6 percent of the predictions were contradicted by the existing literature. For the remaining 10 percent, we did not find any relevant publications to confirm or contradict the predictions. The n2tn weighting scheme also outperformed the simple binary scheme used in our previous approach.

摘要

许多分子生物学实验的正确解释在很大程度上依赖于现有注释数据库的准确性和一致性。这些数据库旨在作为我们获取和完善生物知识的知识库。因此,从定义上讲,它们在任何给定的时间都是不完整的。在本文中,我们描述了一种技术,通过提取基因和功能之间的隐含语义关系,改进了我们以前用于预测新的 GO 注释的方法。在这项工作中,我们除了使用先前的潜在语义索引方法之外,还使用了向量空间模型和几种加权方案。这里描述的技术能够考虑到基因本体论 (GO) 的层次结构,并可以对位于不同深度的 GO 术语进行不同的加权。比较和评估了 15 种不同加权方案的预测能力。其中 9 种方案以前在其他问题领域中使用过,而其中 6 种是本文中引入的。最佳加权方案是一种新方案 n2tn。在使用这种加权方案预测的前 50 个功能注释中,我们在文献中找到了 84%的注释的支持,而 6%的预测与现有文献相矛盾。对于剩下的 10%,我们没有找到任何相关的出版物来证实或反驳这些预测。n2tn 加权方案也优于我们以前方法中使用的简单二进制方案。

相似文献

1
Predicting novel human gene ontology annotations using semantic analysis.利用语义分析预测新的人类基因本体论注释。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):91-9. doi: 10.1109/TCBB.2008.29.
2
A semantic analysis of the annotations of the human genome.人类基因组注释的语义分析。
Bioinformatics. 2005 Aug 15;21(16):3416-21. doi: 10.1093/bioinformatics/bti538. Epub 2005 Jun 14.
3
GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products.GOChase-II:纠正基于基因本体论注释的基因产物中的语义不一致性。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S40. doi: 10.1186/1471-2105-12-S1-S40.
4
Computational algorithms to predict Gene Ontology annotations.预测基因本体注释的计算算法。
BMC Bioinformatics. 2015;16 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-16-S6-S4. Epub 2015 Apr 17.
5
GOcats: A tool for categorizing Gene Ontology into subgraphs of user-defined concepts.GOcats:一个将基因本体论分类为用户定义概念子图的工具。
PLoS One. 2020 Jun 11;15(6):e0233311. doi: 10.1371/journal.pone.0233311. eCollection 2020.
6
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
7
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.GO 功能相似性聚类取决于相似性度量、聚类方法和注释完整性。
BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2.
8
Novelty Indicator for Enhanced Prioritization of Predicted Gene Ontology Annotations.新型指标提高预测基因本体论注释的优先级。
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):954-965. doi: 10.1109/TCBB.2017.2695459. Epub 2017 Apr 18.
9
Information theory applied to the sparse gene ontology annotation network to predict novel gene function.信息论应用于稀疏基因本体注释网络以预测新的基因功能。
Bioinformatics. 2007 Jul 1;23(13):i529-38. doi: 10.1093/bioinformatics/btm195.
10
Ontology-Based Prediction and Prioritization of Gene Functional Annotations.基于本体的基因功能注释预测与优先级排序
IEEE/ACM Trans Comput Biol Bioinform. 2016 Mar-Apr;13(2):248-60. doi: 10.1109/TCBB.2015.2459694.

引用本文的文献

1
HEC-ASD: a hybrid ensemble-based classification model for predicting autism spectrum disorder disease genes.HEC-ASD:一种基于混合集成的自闭症谱系障碍疾病基因预测分类模型。
BMC Bioinformatics. 2022 Dec 21;23(1):554. doi: 10.1186/s12859-022-05099-7.
2
Gene function finding through cross-organism ensemble learning.通过跨物种集成学习进行基因功能发现。
BioData Min. 2021 Feb 12;14(1):14. doi: 10.1186/s13040-021-00239-w.
3
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.基于基因本体建模的基因功能预测文献综述

本文引用的文献

1
Ontological analysis of gene expression data: current tools, limitations, and open problems.基因表达数据的本体分析:当前工具、局限性及开放问题
Bioinformatics. 2005 Sep 15;21(18):3587-95. doi: 10.1093/bioinformatics/bti565. Epub 2005 Jun 30.
2
A semantic analysis of the annotations of the human genome.人类基因组注释的语义分析。
Bioinformatics. 2005 Aug 15;21(16):3416-21. doi: 10.1093/bioinformatics/bti538. Epub 2005 Jun 14.
3
Textpresso: an ontology-based information retrieval and extraction system for biological literature.
Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020.
4
Protein Function Prediction Using Deep Restricted Boltzmann Machines.使用深度受限玻尔兹曼机进行蛋白质功能预测。
Biomed Res Int. 2017;2017:1729301. doi: 10.1155/2017/1729301. Epub 2017 Jun 28.
5
NoGOA: predicting noisy GO annotations using evidences and sparse representation.NoGOA:利用证据和稀疏表示预测有噪声的基因本体注释
BMC Bioinformatics. 2017 Jul 21;18(1):350. doi: 10.1186/s12859-017-1764-z.
6
Predicting protein function via downward random walks on a gene ontology.通过在基因本体上进行向下随机游走预测蛋白质功能。
BMC Bioinformatics. 2015 Aug 27;16:271. doi: 10.1186/s12859-015-0713-y.
7
A method of searching for related literature on protein structure analysis by considering a user's intention.一种通过考虑用户意图来搜索蛋白质结构分析相关文献的方法。
BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S4. doi: 10.1186/1471-2105-16-S7-S4. Epub 2015 Apr 23.
8
Hierarchical ensemble methods for protein function prediction.用于蛋白质功能预测的分层集成方法。
ISRN Bioinform. 2014 May 4;2014:901419. doi: 10.1155/2014/901419. eCollection 2014.
9
Computational algorithms to predict Gene Ontology annotations.预测基因本体注释的计算算法。
BMC Bioinformatics. 2015;16 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-16-S6-S4. Epub 2015 Apr 17.
10
Gene function prediction based on the Gene Ontology hierarchical structure.基于基因本体层次结构的基因功能预测
PLoS One. 2014 Sep 5;9(9):e107187. doi: 10.1371/journal.pone.0107187. eCollection 2014.
Textpresso:一个基于本体的生物文献信息检索与提取系统。
PLoS Biol. 2004 Nov;2(11):e309. doi: 10.1371/journal.pbio.0020309. Epub 2004 Sep 21.
4
Gene clustering by latent semantic indexing of MEDLINE abstracts.通过MEDLINE摘要的潜在语义索引进行基因聚类。
Bioinformatics. 2005 Jan 1;21(1):104-15. doi: 10.1093/bioinformatics/bth464. Epub 2004 Aug 12.
5
Onto-Tools: an ensemble of web-accessible, ontology-based tools for the functional design and interpretation of high-throughput gene expression experiments.Onto-Tools:一套基于本体的网络工具,用于高通量基因表达实验的功能设计与解读。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W449-56. doi: 10.1093/nar/gkh409.
6
B.E.A.R. GeneInfo: a tool for identifying gene-related biomedical publications through user modifiable queries.B.E.A.R.基因信息:一种通过用户可修改查询来识别与基因相关的生物医学出版物的工具。
BMC Bioinformatics. 2004 Apr 29;5:46. doi: 10.1186/1471-2105-5-46.
7
FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes.FatiGO:一个用于查找基因本体术语与基因组之间显著关联的网络工具。
Bioinformatics. 2004 Mar 1;20(4):578-80. doi: 10.1093/bioinformatics/btg455. Epub 2004 Jan 22.
8
GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies.基因本体树状图机器(GOTM):一个基于网络的平台,用于使用基因本体层次结构解释感兴趣的基因集。
BMC Bioinformatics. 2004 Feb 18;5:16. doi: 10.1186/1471-2105-5-16.
9
GOstat: find statistically overrepresented Gene Ontologies within a group of genes.GOstat:在一组基因中查找统计学上过度富集的基因本体。
Bioinformatics. 2004 Jun 12;20(9):1464-5. doi: 10.1093/bioinformatics/bth088. Epub 2004 Feb 12.
10
Genomics. Microarrays--guilt by association.基因组学。微阵列——关联有罪推断。
Science. 2003 Oct 10;302(5643):240-1. doi: 10.1126/science.1090887.