Suppr超能文献

检测水稻基因组中的高度变异串联重复序列。

Detection of Highly Divergent Tandem Repeats in the Rice Genome.

机构信息

Institute of Bioengineering, Research Center of Biotechnology of the Russian Academy of Sciences, Bld.2, 33 Leninsky Ave., 119071 Moscow, Russia.

MEPhI (Moscow Engineering Physics Institute), National Research Nuclear University, 31 Kashirskoye Shosse, 115409 Moscow, Russia.

出版信息

Genes (Basel). 2021 Mar 25;12(4):473. doi: 10.3390/genes12040473.

Abstract

Currently, there is a lack of bioinformatics approaches to identify highly divergent tandem repeats (TRs) in eukaryotic genomes. Here, we developed a new mathematical method to search for TRs, which uses a novel algorithm for constructing multiple alignments based on the generation of random position weight matrices (RPWMs), and applied it to detect TRs of 2 to 50 nucleotides long in the rice genome. The RPWM method could find highly divergent TRs in the presence of insertions or deletions. Comparison of the RPWM algorithm with the other methods of TR identification showed that RPWM could detect TRs in which the average number of base substitutions per nucleotide (x) was between 1.5 and 3.2, whereas T-REKS and TRF methods could not detect divergent TRs with x > 1.5. Applied to the search of TRs in the rice genome, the RPWM method revealed that TRs occupied 5% of the genome and that most of them were 2 and 3 bases long. Using RPWM, we also revealed the correlation of TRs with dispersed repeats and transposons, suggesting that some transposons originated from TRs. Thus, the novel RPWM algorithm is an effective tool to search for highly divergent TRs in the genomes.

摘要

目前,缺乏用于鉴定真核生物基因组中高度差异串联重复(TR)的生物信息学方法。在这里,我们开发了一种新的数学方法来搜索 TR,该方法使用一种新颖的算法来构建基于随机位置权重矩阵(RPWM)生成的多重比对,并将其应用于检测水稻基因组中 2 到 50 个核苷酸长的 TR。RPWM 方法可以在存在插入或缺失的情况下找到高度差异的 TR。将 RPWM 算法与其他 TR 鉴定方法进行比较表明,RPWM 可以检测到平均每个核苷酸的碱基替换数(x)在 1.5 到 3.2 之间的 TR,而 T-REKS 和 TRF 方法则无法检测到 x > 1.5 的差异 TR。将该方法应用于水稻基因组中的 TR 搜索,结果表明 TR 占据了基因组的 5%,其中大多数为 2 到 3 个碱基长。使用 RPWM,我们还揭示了 TR 与分散重复和转座子之间的相关性,表明一些转座子起源于 TR。因此,新型 RPWM 算法是一种在基因组中搜索高度差异 TR 的有效工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b2/8064497/b9e622001ce2/genes-12-00473-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验