• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于鉴定高度保守元件及对囊泡虫超门进行进化分析的方法。

A method for identification of highly conserved elements and evolutionary analysis of superphylum Alveolata.

作者信息

Rubanov Lev I, Seliverstov Alexandr V, Zverkov Oleg A, Lyubetsky Vassily A

机构信息

Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Bolshoi Karetnyi per. 19, Building 1, Moscow, 127051, Russia.

出版信息

BMC Bioinformatics. 2016 Sep 20;17:385. doi: 10.1186/s12859-016-1257-5.

DOI:10.1186/s12859-016-1257-5
PMID:27645252
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5028923/
Abstract

BACKGROUND

Perfectly or highly conserved DNA elements were found in vertebrates, invertebrates, and plants by various methods. However, little is known about such elements in protists. The evolutionary distance between apicomplexans can be very high, in particular, due to the positive selection pressure on them. This complicates the identification of highly conserved elements in alveolates, which is overcome by the proposed algorithm.

RESULTS

A novel algorithm is developed to identify highly conserved DNA elements. It is based on the identification of dense subgraphs in a specially built multipartite graph (whose parts correspond to genomes). Specifically, the algorithm does not rely on genome alignments, nor pre-identified perfectly conserved elements; instead, it performs a fast search for pairs of words (in different genomes) of maximum length with the difference below the specified edit distance. Such pair defines an edge whose weight equals the maximum (or total) length of words assigned to its ends. The graph composed of these edges is then compacted by merging some of its edges and vertices. The dense subgraphs are identified by a cellular automaton-like algorithm; each subgraph defines a cluster composed of similar inextensible words from different genomes. Almost all clusters are considered as predicted highly conserved elements. The algorithm is applied to the nuclear genomes of the superphylum Alveolata, and the corresponding phylogenetic tree is built and discussed.

CONCLUSION

We proposed an algorithm for the identification of highly conserved elements. The multitude of identified elements was used to infer the phylogeny of Alveolata.

摘要

背景

通过各种方法在脊椎动物、无脊椎动物和植物中发现了完全或高度保守的DNA元件。然而,关于原生生物中的此类元件却知之甚少。顶复门生物之间的进化距离可能非常大,特别是由于它们受到正选择压力。这使得在肺泡虫中鉴定高度保守元件变得复杂,而本文提出的算法克服了这一难题。

结果

开发了一种用于鉴定高度保守DNA元件的新算法。它基于在专门构建的多部分图(其部分对应于基因组)中识别密集子图。具体而言,该算法不依赖于基因组比对,也不依赖于预先确定的完全保守元件;相反,它快速搜索不同基因组中长度最大且差异低于指定编辑距离的单词对。这样的单词对定义了一条边,其权重等于分配给其两端的单词的最大(或总)长度。然后通过合并其一些边和顶点来压缩由这些边组成的图。通过类似细胞自动机的算法识别密集子图;每个子图定义一个由来自不同基因组的相似不可扩展单词组成的簇。几乎所有簇都被视为预测的高度保守元件。该算法应用于肺泡虫超门的核基因组,并构建和讨论了相应的系统发育树。

结论

我们提出了一种鉴定高度保守元件的算法。所鉴定的大量元件被用于推断肺泡虫的系统发育。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/58e2e370c902/12859_2016_1257_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/399fd8b5d64e/12859_2016_1257_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/bf0f9aee9cd2/12859_2016_1257_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/2ee86e16dc29/12859_2016_1257_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/52a542147390/12859_2016_1257_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/58e2e370c902/12859_2016_1257_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/399fd8b5d64e/12859_2016_1257_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/bf0f9aee9cd2/12859_2016_1257_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/2ee86e16dc29/12859_2016_1257_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/52a542147390/12859_2016_1257_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c78f/5028923/58e2e370c902/12859_2016_1257_Fig5_HTML.jpg

相似文献

1
A method for identification of highly conserved elements and evolutionary analysis of superphylum Alveolata.一种用于鉴定高度保守元件及对囊泡虫超门进行进化分析的方法。
BMC Bioinformatics. 2016 Sep 20;17:385. doi: 10.1186/s12859-016-1257-5.
2
Diversity of extracellular proteins during the transition from the 'proto-apicomplexan' alveolates to the apicomplexan obligate parasites.从“原始顶复门”肺泡虫向顶复门专性寄生虫转变过程中细胞外蛋白质的多样性。
Parasitology. 2016 Jan;143(1):1-17. doi: 10.1017/S0031182015001213. Epub 2015 Nov 20.
3
Colponemids represent multiple ancient alveolate lineages.科隆体菌目代表了多个古老的有孔虫类谱系。
Curr Biol. 2013 Dec 16;23(24):2546-52. doi: 10.1016/j.cub.2013.10.062. Epub 2013 Dec 5.
4
Statistical power of phylo-HMM for evolutionarily conserved element detection.用于检测进化保守元件的系统发育隐马尔可夫模型的统计功效。
BMC Bioinformatics. 2007 Oct 5;8:374. doi: 10.1186/1471-2105-8-374.
5
Primary Structure of 28S rRNA Gene Confirms Monophyly of Free-Living Heterotrophic and Phototrophic Apicomplexans (Alveolata).28S rRNA基因的一级结构证实了自由生活的异养和光合顶复门生物(囊泡虫类)的单系性。
Biochemistry (Mosc). 2015 Nov;80(11):1492-9. doi: 10.1134/S0006297915110115.
6
The chloroplast genome of a Symbiodinium sp. clade C3 isolate.一种 C3 类共生藻的叶绿体基因组。
Protist. 2014 Jan;165(1):1-13. doi: 10.1016/j.protis.2013.09.006. Epub 2013 Oct 17.
7
Multiple organism algorithm for finding ultraconserved elements.用于寻找超保守元件的多生物算法。
BMC Bioinformatics. 2008 Jan 11;9:15. doi: 10.1186/1471-2105-9-15.
8
The evolution of ultraconserved elements with different phylogenetic origins.具有不同进化起源的超保守元件的进化。
BMC Evol Biol. 2012 Dec 5;12:236. doi: 10.1186/1471-2148-12-236.
9
A novel feature-based method for whole genome phylogenetic analysis without alignment: application to HEV genotyping and subtyping.一种用于全基因组系统发育分析的无需比对的基于特征的新方法:在戊型肝炎病毒基因分型和亚型分析中的应用。
Biochem Biophys Res Commun. 2008 Apr 4;368(2):223-30. doi: 10.1016/j.bbrc.2008.01.070. Epub 2008 Jan 28.
10
Reducing long-branch effects in multi-protein data uncovers a close relationship between Alveolata and Rhizaria.减少多蛋白数据中的长枝效应揭示了囊泡虫类和有孔虫界之间的密切关系。
Mol Phylogenet Evol. 2016 Aug;101:1-7. doi: 10.1016/j.ympev.2016.04.033. Epub 2016 Apr 27.

引用本文的文献

1
Bioinformatics Screening of Genes Specific for Well-Regenerating Vertebrates Reveals c-answer, a Regulator of Brain Development and Regeneration.生物信息学筛选特定于再生能力强的脊椎动物的基因揭示了 c-answer,这是一种大脑发育和再生的调控因子。
Cell Rep. 2019 Oct 22;29(4):1027-1040.e6. doi: 10.1016/j.celrep.2019.09.038.
2
Dicyemida and Orthonectida: Two Stories of Body Plan Simplification.双胚虫纲和直泳虫纲:身体结构简化的两个故事。
Front Genet. 2019 May 24;10:443. doi: 10.3389/fgene.2019.00443. eCollection 2019.
3
Non-Mendelian assortment of homologous autosomes of different sizes in males is the ancestral state in the Caenorhabditis lineage.

本文引用的文献

1
Regulation of Expression and Evolution of Genes in Plastids of Rhodophytic Branch.红藻分支质体中基因的表达调控与进化
Life (Basel). 2016 Jan 29;6(1):7. doi: 10.3390/life6010007.
2
Algorithms for reconstruction of chromosomal structures.染色体结构重建算法。
BMC Bioinformatics. 2016 Jan 19;17:40. doi: 10.1186/s12859-016-0878-z.
3
Dynamic distributions of long double-stranded RNA in Tetrahymena during nuclear development and genome rearrangements.长双链RNA在四膜虫核发育和基因组重排过程中的动态分布
在雄性中,不同大小的同源常染色体的非 Mendelian 分配是线虫属的祖先状态。
Sci Rep. 2017 Oct 9;7(1):12819. doi: 10.1038/s41598-017-13215-4.
4
Highly Conserved Elements and Chromosome Structure Evolution in Mitochondrial Genomes in Ciliates.纤毛虫线粒体基因组中的高度保守元件与染色体结构进化
Life (Basel). 2017 Feb 27;7(1):9. doi: 10.3390/life7010009.
5
Molecular Phylogenetics 2016.《分子系统发育学2016》
Biomed Res Int. 2016;2016:9029306. doi: 10.1155/2016/9029306. Epub 2016 Dec 29.
J Cell Sci. 2016 Mar 1;129(5):1046-58. doi: 10.1242/jcs.178236. Epub 2016 Jan 14.
4
Characterization and annotation of Babesia orientalis apicoplast genome.东方巴贝斯虫顶质体基因组的特征分析与注释
Parasit Vectors. 2015 Oct 16;8:543. doi: 10.1186/s13071-015-1158-x.
5
A Database of Plastid Protein Families from Red Algae and Apicomplexa and Expression Regulation of the moeB Gene.红藻和顶复门生物质体蛋白家族数据库及moeB基因的表达调控
Biomed Res Int. 2015;2015:510598. doi: 10.1155/2015/510598. Epub 2015 May 31.
6
Comparative Analysis of Apicoplast-Targeted Protein Extension Lengths in Apicomplexan Parasites.顶复门寄生虫中靶向质体蛋白延伸长度的比较分析
Biomed Res Int. 2015;2015:452958. doi: 10.1155/2015/452958. Epub 2015 May 31.
7
The Organellar Genomes of Chromera and Vitrella, the Phototrophic Relatives of Apicomplexan Parasites.Chromera 和 Vitrella 的细胞器基因组,是顶复门寄生虫的光合亲属。
Annu Rev Microbiol. 2015;69:129-44. doi: 10.1146/annurev-micro-091014-104449. Epub 2015 Jun 18.
8
Plastid genome-based phylogeny pinpointed the origin of the green-colored plastid in the dinoflagellate Lepidodinium chlorophorum.基于质体基因组的系统发育分析确定了绿藻甲藻中绿色质体的起源。
Genome Biol Evol. 2015 Apr 2;7(4):1133-40. doi: 10.1093/gbe/evv060.
9
Rfam 12.0: updates to the RNA families database.Rfam 12.0:RNA家族数据库的更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7. doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.
10
Sequence and annotation of the apicoplast genome of the human pathogen Babesia microti.人类病原体微小巴贝斯虫顶质体基因组的序列与注释
PLoS One. 2014 Oct 3;9(10):e107939. doi: 10.1371/journal.pone.0107939. eCollection 2014.