定量同线性评分提高了同源性推断和基因家族的划分。

Quantitative synteny scoring improves homology inference and partitioning of gene families.

出版信息

BMC Bioinformatics. 2013;14 Suppl 15(Suppl 15):S12. doi: 10.1186/1471-2105-14-S15-S12. Epub 2013 Oct 15.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3852004/

Abstract

BACKGROUND

Clustering sequences into families has long been an important step in characterization of genes and proteins. There are many algorithms developed for this purpose, most of which are based on either direct similarity between gene pairs or some sort of network structure, where weights on edges of constructed graphs are based on similarity. However, conserved synteny is an important signal that can help distinguish homology and it has not been utilized to its fullest potential.

RESULTS

Here, we present GenFamClust, a pipeline that combines the network properties of sequence similarity and synteny to assess homology relationship and merge known homologs into groups of gene families. GenFamClust identifies homologs in a more informed and accurate manner as compared to similarity based approaches. We tested our method against the Neighborhood Correlation method on two diverse datasets consisting of fully sequenced genomes of eukaryotes and synthetic data.

CONCLUSIONS

The results obtained from both datasets confirm that synteny helps determine homology and GenFamClust improves on Neighborhood Correlation method. The accuracy as well as the definition of synteny scores is the most valuable contribution of GenFamClust.

摘要

背景

将序列聚类成家族一直是基因和蛋白质特征描述的重要步骤。为此目的开发了许多算法，其中大多数基于基因对之间的直接相似性或某种网络结构，其中构建图的边的权重基于相似性。然而，保守的同线性是可以帮助区分同源性的重要信号，但尚未充分利用。

结果

在这里，我们提出了 GenFamClust，这是一个结合了序列相似性和同线性的网络特性的管道，用于评估同源关系并将已知的同源物合并成基因家族组。与基于相似性的方法相比，GenFamClust 以更明智和更准确的方式识别同源物。我们在包含真核生物全序列基因组和合成数据的两个不同数据集上对我们的方法进行了测试。

结论

来自两个数据集的结果均证实，同线性有助于确定同源性，而 GenFamClust 则优于邻居相关性方法。准确性以及同线性得分的定义是 GenFamClust 最有价值的贡献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/24f5/3852004/7cb07aa39d0c/1471-2105-14-S15-S12-1.jpg

相似文献

Quantitative synteny scoring improves homology inference and partitioning of gene families.

BMC Bioinformatics. 2013;14 Suppl 15(Suppl 15):S12. doi: 10.1186/1471-2105-14-S15-S12. Epub 2013 Oct 15.

GenFamClust: an accurate, synteny-aware and reliable homology inference algorithm.

BMC Evol Biol. 2016 Jun 4;16(1):120. doi: 10.1186/s12862-016-0684-2.

Using shared genomic synteny and shared protein functions to enhance the identification of orthologous gene pairs.

Bioinformatics. 2005 Mar;21(6):703-10. doi: 10.1093/bioinformatics/bti045. Epub 2004 Sep 30.

fagin: synteny-based phylostratigraphy and finer classification of young genes.

BMC Bioinformatics. 2019 Aug 27;20(1):440. doi: 10.1186/s12859-019-3023-y.

SynBlast: assisting the analysis of conserved synteny information.

BMC Bioinformatics. 2008 Aug 24;9:351. doi: 10.1186/1471-2105-9-351.

Identification of conserved gene clusters in multiple genomes based on synteny and homology.

BMC Bioinformatics. 2011 Oct 5;12 Suppl 9(Suppl 9):S18. doi: 10.1186/1471-2105-12-S9-S18.

Techniques for multi-genome synteny analysis to overcome assembly limitations.

Genome Inform. 2006;17(2):152-61.

Assessing the evolutionary rate of positional orthologous genes in prokaryotes using synteny data.

BMC Evol Biol. 2007 Nov 29;7:237. doi: 10.1186/1471-2148-7-237.

PLoS Comput Biol. 2008 May 16;4(4):e1000063. doi: 10.1371/journal.pcbi.1000063.

Synteny conservation between the Prunus genome and both the present and ancestral Arabidopsis genomes.

BMC Genomics. 2006 Apr 14;7:81. doi: 10.1186/1471-2164-7-81.

引用本文的文献

B Cell Receptor Activation Predominantly Regulates AKT-mTORC1/2 Substrates Functionally Related to RNA Processing.

PLoS One. 2016 Aug 3;11(8):e0160255. doi: 10.1371/journal.pone.0160255. eCollection 2016.

GenFamClust: an accurate, synteny-aware and reliable homology inference algorithm.

BMC Evol Biol. 2016 Jun 4;16(1):120. doi: 10.1186/s12862-016-0684-2.

本文引用的文献

A tight link between orthologs and bidirectional best hits in bacterial and archaeal genomes.

Genome Biol Evol. 2012;4(12):1286-94. doi: 10.1093/gbe/evs100.

PHYRN: a robust method for phylogenetic analysis of highly divergent sequences.

PLoS One. 2012;7(4):e34261. doi: 10.1371/journal.pone.0034261. Epub 2012 Apr 13.

High-quality sequence clustering guided by network topology and multiple alignment likelihood.

Bioinformatics. 2012 Apr 15;28(8):1078-85. doi: 10.1093/bioinformatics/bts098. Epub 2012 Feb 25.

MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity.

Nucleic Acids Res. 2012 Apr;40(7):e49. doi: 10.1093/nar/gkr1293. Epub 2012 Jan 4.

ALF--a simulation framework for genome evolution.

Mol Biol Evol. 2012 Apr;29(4):1115-23. doi: 10.1093/molbev/msr268. Epub 2011 Dec 8.

Identification of conserved gene clusters in multiple genomes based on synteny and homology.

BMC Bioinformatics. 2011 Oct 5;12 Suppl 9(Suppl 9):S18. doi: 10.1186/1471-2105-12-S9-S18.

Ensembl 2012.

Nucleic Acids Res. 2012 Jan;40(Database issue):D84-90. doi: 10.1093/nar/gkr991. Epub 2011 Nov 15.

Computational methods for Gene Orthology inference.

Brief Bioinform. 2011 Sep;12(5):379-91. doi: 10.1093/bib/bbr030. Epub 2011 Jun 19.

Ultra-fast sequence clustering from similarity networks with SiLiX.

BMC Bioinformatics. 2011 Apr 22;12:116. doi: 10.1186/1471-2105-12-116.

CYNTENATOR: progressive gene order alignment of 17 vertebrate genomes.

PLoS One. 2010 Jan 28;5(1):e8861. doi: 10.1371/journal.pone.0008861.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

定量同线性评分提高了同源性推断和基因家族的划分。

Quantitative synteny scoring improves homology inference and partitioning of gene families.

出版信息

BMC Bioinformatics. 2013;14 Suppl 15(Suppl 15):S12. doi: 10.1186/1471-2105-14-S15-S12. Epub 2013 Oct 15.

DOI:10.1186/1471-2105-14-S15-S12

PMID:24564516

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3852004/

Abstract

BACKGROUND

RESULTS

CONCLUSIONS

摘要

定量同线性评分提高了同源性推断和基因家族的划分。

Quantitative synteny scoring improves homology inference and partitioning of gene families.

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

定量同线性评分提高了同源性推断和基因家族的划分。

Quantitative synteny scoring improves homology inference and partitioning of gene families.

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献