• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用功能一致性评分进行缺失基因鉴定。

Missing gene identification using functional coherence scores.

机构信息

Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA.

Department of Biological Sciences, Purdue University, West Lafayette IN, 47907, USA.

出版信息

Sci Rep. 2016 Aug 24;6:31725. doi: 10.1038/srep31725.

DOI:10.1038/srep31725
PMID:27552989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4995438/
Abstract

Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new method for finding missing genes, which integrates multiple features, including gene expression, phylogenetic profile, and function association scores. Particularly, for considering function association between candidate genes and neighboring proteins to the target missing gene in the network, we used Co-occurrence Association Score (CAS) and PubMed Association Score (PAS), which are designed for capturing functional coherence of proteins. We showed that adding CAS and PAS substantially improve the accuracy of identifying missing genes in the yeast enzyme-enzyme network compared to the cases when only the conventional features, gene expression, phylogenetic profile, were used. Finally, it was also demonstrated that the accuracy improves by considering indirect neighbors to the target enzyme position in the network using a proper network-topology-based weighting scheme.

摘要

重建代谢和信号通路是解释基因组序列的有效方法。在通路重建中面临的一个挑战是,通路中的基因往往不容易被找到,这反映了目标生物目前不完善的信息。在这项工作中,我们开发了一种新的方法来寻找缺失的基因,该方法整合了多个特征,包括基因表达、系统发育谱和功能关联分数。特别是,为了考虑候选基因与目标缺失基因在网络中的邻近蛋白质之间的功能关联,我们使用了共现关联分数(Co-occurrence Association Score,CAS)和 PubMed 关联分数(PubMed Association Score,PAS),这两种分数是为了捕捉蛋白质的功能一致性而设计的。结果表明,与仅使用传统特征(基因表达、系统发育谱)相比,在酵母酶-酶网络中添加 CAS 和 PAS 可以显著提高识别缺失基因的准确性。最后,还通过使用适当的基于网络拓扑的加权方案来考虑网络中目标酶位置的间接邻居,证明了准确性的提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/43a0b5a3dbfc/srep31725-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/7e149c8bcbe5/srep31725-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/f8bfc167e856/srep31725-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/b4ef7d4a3deb/srep31725-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/f93ed30db9f0/srep31725-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/32fe94b11fe6/srep31725-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/e95ca51ba166/srep31725-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/a55e96f90bee/srep31725-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/43a0b5a3dbfc/srep31725-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/7e149c8bcbe5/srep31725-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/f8bfc167e856/srep31725-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/b4ef7d4a3deb/srep31725-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/f93ed30db9f0/srep31725-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/32fe94b11fe6/srep31725-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/e95ca51ba166/srep31725-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/a55e96f90bee/srep31725-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b43/4995438/43a0b5a3dbfc/srep31725-f8.jpg

相似文献

1
Missing gene identification using functional coherence scores.利用功能一致性评分进行缺失基因鉴定。
Sci Rep. 2016 Aug 24;6:31725. doi: 10.1038/srep31725.
2
Supervised enzyme network inference from the integration of genomic data and chemical information.基于基因组数据与化学信息整合的监督式酶网络推断
Bioinformatics. 2005 Jun;21 Suppl 1:i468-77. doi: 10.1093/bioinformatics/bti1012.
3
Quantitative inference of dynamic regulatory pathways via microarray data.通过微阵列数据对动态调控途径进行定量推断。
BMC Bioinformatics. 2005 Mar 7;6:44. doi: 10.1186/1471-2105-6-44.
4
Sample scale-free gene regulatory network using gene ontology.使用基因本体论的无标度基因调控网络示例。
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5523-6. doi: 10.1109/IEMBS.2006.259261.
5
Inferring gene regulatory networks using differential evolution with local search heuristics.使用带有局部搜索启发式算法的差分进化来推断基因调控网络。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Oct-Dec;4(4):634-47. doi: 10.1109/TCBB.2007.1058.
6
Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.基于蛋白质的氨基酸和二肽组成对基因表达水平进行相关性分析与预测。
BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.
7
Biological network mapping and source signal deduction.生物网络映射与源信号推导。
Bioinformatics. 2007 Jul 15;23(14):1783-91. doi: 10.1093/bioinformatics/btm246. Epub 2007 May 11.
8
Unraveling networks of co-regulated genes on the sole basis of genome sequences.仅基于基因组序列揭示共同调控基因的网络。
Nucleic Acids Res. 2011 Aug;39(15):6340-58. doi: 10.1093/nar/gkr264. Epub 2011 May 13.
9
Genetic Interaction Motif Finding by expectation maximization--a novel statistical model for inferring gene modules from synthetic lethality.通过期望最大化进行遗传相互作用基序发现——一种从合成致死性推断基因模块的新型统计模型。
BMC Bioinformatics. 2005 Dec 6;6:288. doi: 10.1186/1471-2105-6-288.
10
Using large-scale perturbations in gene network reconstruction.在基因网络重建中使用大规模扰动。
BMC Bioinformatics. 2005 Jan 19;6:11. doi: 10.1186/1471-2105-6-11.

引用本文的文献

1
Urea-based mutualistic transfer of nitrogen in biological soil crusts.生物土壤结皮中基于尿素的氮互惠转移
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wrae246.
2
Computational identification of protein-protein interactions in model plant proteomes.计算鉴定模式植物蛋白质组中的蛋白质-蛋白质相互作用。
Sci Rep. 2019 Jun 19;9(1):8740. doi: 10.1038/s41598-019-45072-8.

本文引用的文献

1
Computational protein function predictions.计算蛋白质功能预测
Methods. 2016 Jan 15;93:1-2. doi: 10.1016/j.ymeth.2016.01.001.
2
KEGG as a reference resource for gene and protein annotation.KEGG作为基因和蛋白质注释的参考资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62. doi: 10.1093/nar/gkv1070. Epub 2015 Oct 17.
3
Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology.通路工具19.0版本更新:用于通路/基因组信息学和系统生物学的软件。
Brief Bioinform. 2016 Sep;17(5):877-90. doi: 10.1093/bib/bbv079. Epub 2015 Oct 10.
4
Integrated protein function prediction by mining function associations, sequences, and protein-protein and gene-gene interaction networks.通过挖掘功能关联、序列以及蛋白质-蛋白质和基因-基因相互作用网络进行综合蛋白质功能预测。
Methods. 2016 Jan 15;93:84-91. doi: 10.1016/j.ymeth.2015.09.011. Epub 2015 Sep 11.
5
The GOA database: gene Ontology annotation updates for 2015.基因本体注释数据库(GOA):2015年基因本体注释更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D1057-63. doi: 10.1093/nar/gku1113. Epub 2014 Nov 6.
6
Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0.通过改进的基于补丁的方法Patch-Surfer2.0进行大规模结合配体预测。
Bioinformatics. 2015 Mar 1;31(5):707-13. doi: 10.1093/bioinformatics/btu724. Epub 2014 Oct 29.
7
PFP/ESG: automated protein function prediction servers enhanced with Gene Ontology visualization tool.PFP/ESG:通过基因本体可视化工具增强的自动化蛋白质功能预测服务器。
Bioinformatics. 2015 Jan 15;31(2):271-2. doi: 10.1093/bioinformatics/btu646. Epub 2014 Oct 1.
8
Profiling the orphan enzymes.鉴定孤儿酶。
Biol Direct. 2014 Jun 6;9:10. doi: 10.1186/1745-6150-9-10.
9
Optimizing genome-scale network reconstructions.优化基因组规模的网络重建。
Nat Biotechnol. 2014 May;32(5):447-52. doi: 10.1038/nbt.2870.
10
Dead end metabolites--defining the known unknowns of the E. coli metabolic network.终产物代谢物——定义大肠杆菌代谢网络的已知未知物。
PLoS One. 2013 Sep 23;8(9):e75210. doi: 10.1371/journal.pone.0075210. eCollection 2013.