• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因本体层次结构保持哈希的基因功能预测。

Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

机构信息

College of Computer and Information Science, Southwest University, Chongqing 400715, China.

School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 100044, China; Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing 100044, China.

出版信息

Genomics. 2019 May;111(3):334-342. doi: 10.1016/j.ygeno.2018.02.008. Epub 2018 Feb 23.

DOI:10.1016/j.ygeno.2018.02.008
PMID:29477548
Abstract

Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash.

摘要

GO 术语(GO)使用结构化词汇(或术语)来描述基因产物的分子功能、生物功能和细胞位置,其采用分层本体。GO 注释将基因与 GO 术语相关联,并指出具有相关术语所描述的生物功能的特定基因产物。然而,从 GO 定义的大量 GO 术语中预测基因的正确 GO 注释是一项具有挑战性的任务。为了解决这个挑战,我们引入了一种基于基因本体论层次保持哈希(HPHash)的语义方法用于基因功能预测。HPHash 首先测量 GO 术语之间的分类相似性。然后,它使用层次保持哈希技术来保持 GO 术语之间的层次顺序,并优化一系列哈希函数,通过紧凑的二进制代码对大量 GO 术语进行编码。之后,HPHash 利用这些哈希函数将基因-术语关联矩阵投影到低维空间中,并在低维空间中进行基于语义相似性的基因功能预测。在三种模式物种(人类、小鼠和大鼠)之间进行物种间基因功能预测的实验结果表明,HPHash 的性能优于其他相关方法,并且对哈希函数的数量具有鲁棒性。此外,我们还将 HPHash 作为基于 BLAST 的基因功能预测的插件。从实验结果来看,HPHash 再次显著提高了预测性能。HPHash 的代码可在以下网址获取:http://mlda.swu.edu.cn/codes.php?name=HPHash。

相似文献

1
Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.基于基因本体层次结构保持哈希的基因功能预测。
Genomics. 2019 May;111(3):334-342. doi: 10.1016/j.ygeno.2018.02.008. Epub 2018 Feb 23.
2
HashGO: hashing gene ontology for protein function prediction.HashGO:用于蛋白质功能预测的基因本体哈希法
Comput Biol Chem. 2017 Dec;71:264-273. doi: 10.1016/j.compbiolchem.2017.09.010. Epub 2017 Oct 4.
3
NMFGO: Gene Function Prediction via Nonnegative Matrix Factorization with Gene Ontology.NMFGO:基于基因本体论的非负矩阵分解进行基因功能预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):238-249. doi: 10.1109/TCBB.2018.2861379. Epub 2018 Jul 30.
4
NoGOA: predicting noisy GO annotations using evidences and sparse representation.NoGOA:利用证据和稀疏表示预测有噪声的基因本体注释
BMC Bioinformatics. 2017 Jul 21;18(1):350. doi: 10.1186/s12859-017-1764-z.
5
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
6
NoisyGOA: Noisy GO annotations prediction using taxonomic and semantic similarity.NoisyGOA:利用分类学和语义相似性预测有噪声的基因本体注释
Comput Biol Chem. 2016 Dec;65:203-211. doi: 10.1016/j.compbiolchem.2016.09.005. Epub 2016 Sep 13.
7
Predicting functions of maize proteins using graph convolutional network.利用图卷积网络预测玉米蛋白的功能。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):420. doi: 10.1186/s12859-020-03745-6.
8
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
9
Anc2vec: embedding gene ontology terms by preserving ancestors relationships.Anc2vec:通过保留祖先关系来嵌入基因本体论术语。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac003.
10
Isoform function prediction by Gene Ontology embedding.通过基因本体论嵌入进行同工型功能预测。
Bioinformatics. 2022 Sep 30;38(19):4581-4588. doi: 10.1093/bioinformatics/btac576.

引用本文的文献

1
Identification of compounds to promote diabetic wound healing based on transcriptome signature.基于转录组特征鉴定促进糖尿病伤口愈合的化合物
Front Pharmacol. 2025 Jun 2;16:1576056. doi: 10.3389/fphar.2025.1576056. eCollection 2025.
2
Exploring the Potential Molecular Mechanism of Sijunzi Decoction in the Treatment of Non-Segmental Vitiligo Based on Network Pharmacology and Molecular Docking.基于网络药理学和分子对接技术探索四君子汤治疗非节段型白癜风的潜在分子机制
Clin Cosmet Investig Dermatol. 2023 Apr 1;16:821-836. doi: 10.2147/CCID.S403732. eCollection 2023.
3
Multilingual translation for zero-shot biomedical classification using BioTranslator.
基于 BioTranslator 的零样本生物医学分类的多语言翻译。
Nat Commun. 2023 Feb 10;14(1):738. doi: 10.1038/s41467-023-36476-2.
4
PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms.PFP-GO:利用排序后的基因本体(GO)术语整合蛋白质序列、结构域和蛋白质-蛋白质相互作用信息以进行蛋白质功能预测。
Front Genet. 2022 Sep 29;13:969915. doi: 10.3389/fgene.2022.969915. eCollection 2022.
5
Finding Gene Associations by Text Mining and Annotating it with Gene Ontology.通过文本挖掘发现基因关联,并使用基因本体论对其进行注释。
Methods Mol Biol. 2022;2496:71-90. doi: 10.1007/978-1-0716-2305-3_4.
6
Integrated proteomic and metabolomic analyses reveal significant changes in chloroplasts and mitochondria of pepper (Capsicum annuum L.) during Sclerotium rolfsii infection.整合蛋白质组学和代谢组学分析揭示了辣椒疫霉感染期间辣椒(Capsicum annuum L.)叶绿体和线粒体的显著变化。
J Microbiol. 2022 May;60(5):511-525. doi: 10.1007/s12275-022-1603-4. Epub 2022 Mar 31.
7
Transcriptome sequencing and lncRNA-miRNA-mRNA network construction in cardiac fibrosis and heart failure.心脏纤维化和心力衰竭中转录组测序和 lncRNA-miRNA-mRNA 网络构建。
Bioengineered. 2022 Mar;13(3):7118-7133. doi: 10.1080/21655979.2022.2045839.
8
A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology.一种新的基因功能相似性计算模型,利用基因本体论中术语和关系的特异性。
BMC Bioinformatics. 2022 Jan 20;23(Suppl 1):47. doi: 10.1186/s12859-022-04557-6.
9
Assigning protein function from domain-function associations using DomFun.基于域-功能关联来分配蛋白质功能,使用 DomFun。
BMC Bioinformatics. 2022 Jan 15;23(1):43. doi: 10.1186/s12859-022-04565-6.
10
Proteomic Analysis of Human Serum for Patients at Different Pathological Stages of Hepatic Fibrosis.肝纤维化不同病理阶段患者血清蛋白质组学分析。
Biomed Res Int. 2021 Nov 28;2021:3580090. doi: 10.1155/2021/3580090. eCollection 2021.