• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全局重复图谱算法中符号 DNA 序列到频域的直接映射。

Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm.

机构信息

Faculty of Science, University of Zagreb, Bijenička 32 and Croatian Academy of Sciences and Arts, Zrinski trg 11, 10000 Zagreb, Croatia.

出版信息

Nucleic Acids Res. 2013 Jan 7;41(1):e17. doi: 10.1093/nar/gks721. Epub 2012 Sep 12.

DOI:10.1093/nar/gks721
PMID:22977183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3592446/
Abstract

The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of 'magnifying glass' effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes).

摘要

全局重复图谱 (GRM) 算法(www.hazu.hr/grm/software/win/grm2012.exe)的主要特点是能够识别各种长度不限、在人类染色体大小的序列中可以任意远的重复。其功效归因于使用了完整的 K-字符串集合,这使得将符号 DNA 序列直接映射到频域的新方法成为可能,并且可以直接在 GRM 图谱中识别重复。通过这种方式,我们获得了非常快速、高效和高度自动化的重复发现工具。该方法对替换、插入/缺失以及序列模式的各种复杂性具有鲁棒性。我们展示了几个 GRM 使用的案例研究,以说明其功能:α-卫星串联重复和高阶重复(HOR)的识别、Alu 分散重复和 Alu 串联的识别、外显子中周期 3 模式的识别、“放大镜”效应的实现、复杂 HOR 模式的识别、串联过渡分散重复序列的识别以及长片段重复的识别。GRM 算法便于使用,特别是在重复单元较大、高度突变和/或复杂重复以及大基因组序列(染色体和基因组)的全局重复图谱的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/ab77a6f6eae8/gks721f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/448323a3d78a/gks721f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/339c63c5ceed/gks721f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/a8e13f5215c9/gks721f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/eb0df03b16da/gks721f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/49b0b8f531fd/gks721f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/e06845d3113a/gks721f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/e1ab96bec5d2/gks721f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/d1bbd36ce5eb/gks721f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/d49dc82d174d/gks721f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/013f9db1c976/gks721f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/1869e001c863/gks721f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/57e3ca822167/gks721f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/eac46a388806/gks721f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/ab77a6f6eae8/gks721f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/448323a3d78a/gks721f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/339c63c5ceed/gks721f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/a8e13f5215c9/gks721f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/eb0df03b16da/gks721f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/49b0b8f531fd/gks721f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/e06845d3113a/gks721f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/e1ab96bec5d2/gks721f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/d1bbd36ce5eb/gks721f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/d49dc82d174d/gks721f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/013f9db1c976/gks721f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/1869e001c863/gks721f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/57e3ca822167/gks721f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/eac46a388806/gks721f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4c/3592446/ab77a6f6eae8/gks721f14.jpg

相似文献

1
Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm.全局重复图谱算法中符号 DNA 序列到频域的直接映射。
Nucleic Acids Res. 2013 Jan 7;41(1):e17. doi: 10.1093/nar/gks721. Epub 2012 Sep 12.
2
Key-string algorithm--novel approach to computational analysis of repetitive sequences in human centromeric DNA.密钥串算法——人类着丝粒DNA中重复序列计算分析的新方法。
Croat Med J. 2003 Aug;44(4):386-406.
3
Large tandem, higher order repeats and regularly dispersed repeat units contribute substantially to divergence between human and chimpanzee Y chromosomes.大片段串联重复、高次重复以及规则散布的重复单位导致了人类和黑猩猩 Y 染色体之间的显著差异。
J Mol Evol. 2011 Jan;72(1):34-55. doi: 10.1007/s00239-010-9401-8. Epub 2010 Nov 20.
4
ColorHOR--novel graphical algorithm for fast scan of alpha satellite higher-order repeats and HOR annotation for GenBank sequence of human genome.ColorHOR——用于快速扫描α卫星高阶重复序列的新型图形算法以及人类基因组GenBank序列的HOR注释
Bioinformatics. 2005 Apr 1;21(7):846-52. doi: 10.1093/bioinformatics/bti072. Epub 2004 Oct 27.
5
Global Repeat Map (GRM): Advantageous Method for Discovery of Largest Higher-Order Repeats (HORs) in Neuroblastoma Breakpoint Family (NBPF) Genes, in Hornerin Exon and in Chromosome 21 Centromere.全球重复图谱(GRM):在神经母细胞瘤断裂点家族(NBPF)基因、Hornerin 外显子和 21 号染色体着丝粒中发现最大高阶重复(HOR)的有利方法。
Prog Mol Subcell Biol. 2021;60:203-234. doi: 10.1007/978-3-030-74889-0_8.
6
Discovery of 33mer in chromosome 21 - the largest alpha satellite higher order repeat unit among all human somatic chromosomes.发现 21 号染色体中的 33mer-所有人类体细胞染色体中最大的α卫星高级重复单元。
Sci Rep. 2019 Sep 2;9(1):12629. doi: 10.1038/s41598-019-49022-2.
7
Key-string segmentation algorithm and higher-order repeat 16mer (54 copies) in human alpha satellite DNA in chromosome 7.7号染色体人类α卫星DNA中的密钥串分割算法和高阶重复16聚体(54个拷贝)
J Theor Biol. 2003 Mar 7;221(1):29-37. doi: 10.1006/jtbi.2003.3165.
8
Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.α卫星高阶重复序列傅里叶功率谱中一级和二级周期性级联的层次结构。
BMC Bioinformatics. 2008 Nov 3;9:466. doi: 10.1186/1471-2105-9-466.
9
Precise identification of cascading alpha satellite higher order repeats in T2T-CHM13 assembly of human chromosome 3.在人类3号染色体的T2T-CHM13组装中对级联α卫星高阶重复序列进行精确鉴定。
Croat Med J. 2024 Jun 13;65(3):209-219. doi: 10.3325/cmj.2024.65.209.
10
Consensus higher order repeats and frequency of string distributions in human genome.人类基因组中一致性的高级重复序列和串分布频率。
Curr Genomics. 2007 Apr;8(2):93-111. doi: 10.2174/138920207780368169.

引用本文的文献

1
Precise Identification of Higher-Order Repeats (HORs) in T2T-CHM13 Assembly of Human Chromosome 21-Novel 52mer HOR and Failures of Hg38 Assembly.人类21号染色体T2T-CHM13组装中高阶重复序列(HORs)的精确鉴定——新型52聚体HOR及Hg38组装的失败
Genes (Basel). 2025 Jul 27;16(8):885. doi: 10.3390/genes16080885.
2
Neuroblastoma Breakpoint Family 3mer Higher Order Repeats/Olduvai Triplet Pattern in the Complete Genome of Human and Nonhuman Primates and Relation to Cognitive Capacity.神经母细胞瘤断点家族3聚体高阶重复序列/人类和非人类灵长类动物全基因组中的奥杜威三联体模式及其与认知能力的关系
Genes (Basel). 2024 Dec 13;15(12):1598. doi: 10.3390/genes15121598.
3

本文引用的文献

1
Variation in array size, monomer composition and expression of the macrosatellite DXZ4.重复 DNA 序列大小、单体组成和 macrosatellite DXZ4 表达的变化。
PLoS One. 2011 Apr 22;6(4):e18969. doi: 10.1371/journal.pone.0018969.
2
Characterization of DXZ4 conservation in primates implies important functional roles for CTCF binding, array expression and tandem repeat organization on the X chromosome.描述灵长类动物 DXZ4 保守性的特征表明,CTCF 结合、基因簇表达和串联重复组织在 X 染色体上具有重要的功能作用。
Genome Biol. 2011;12(4):R37. doi: 10.1186/gb-2011-12-4-r37. Epub 2011 Apr 13.
3
Intragene higher order repeats in neuroblastoma breakpoint family genes distinguish humans from chimpanzees.
Efficient genome monomer higher-order structure annotation and identification using the GRMhor algorithm.
使用GRMhor算法进行高效的基因组单体高阶结构注释与识别。
Bioinform Adv. 2024 Nov 28;4(1):vbae191. doi: 10.1093/bioadv/vbae191. eCollection 2024.
4
Novel Cascade Alpha Satellite HORs in Orangutan Chromosome 13 Assembly: Discovery of the 59mer HOR-The largest Unit in Primates-And the Missing Triplet 45/27/18 HOR in Human T2T-CHM13v2.0 Assembly.新型串联 Alpha 卫星 HOR 在猩猩 13 号染色体组装中的发现:59 碱基对 HOR 的发现——灵长类动物中最大的单位——以及人类 T2T-CHM13v2.0 组装中缺失的三核苷酸 45/27/18 HOR。
Int J Mol Sci. 2024 Jul 11;25(14):7596. doi: 10.3390/ijms25147596.
5
Precise identification of cascading alpha satellite higher order repeats in T2T-CHM13 assembly of human chromosome 3.在人类3号染色体的T2T-CHM13组装中对级联α卫星高阶重复序列进行精确鉴定。
Croat Med J. 2024 Jun 13;65(3):209-219. doi: 10.3325/cmj.2024.65.209.
6
Novel Concept of Alpha Satellite Cascading Higher-Order Repeats (HORs) and Precise Identification of 15mer and 20mer Cascading HORs in Complete T2T-CHM13 Assembly of Human Chromosome 15.人类15号染色体完整T2T-CHM13组装体中α卫星级联高阶重复序列(HORs)的新概念及15聚体和20聚体级联HORs的精确鉴定
Int J Mol Sci. 2024 Apr 16;25(8):4395. doi: 10.3390/ijms25084395.
7
Tandem NBPF 3mer HORs (Olduvai triplets) in Neanderthal and two novel HOR tandem arrays in human chromosome 1 T2T-CHM13 assembly.尼安德特人和人类 1 号染色体 T2T-CHM13 组装体中串联 NBPF3mer HOR(奥杜瓦伊三联体)和两个新的 HOR 串联阵列。
Sci Rep. 2023 Sep 2;13(1):14420. doi: 10.1038/s41598-023-41517-3.
8
Tandemly repeated NBPF HOR copies (Olduvai triplets): Possible impact on human brain evolution.串联重复的 NBPF HOR 拷贝(奥杜威三联体):可能对人类大脑进化的影响。
Life Sci Alliance. 2022 Oct 19;6(1). doi: 10.26508/lsa.202101306. Print 2023 Jan.
9
Advances in the computational analysis of SARS-COV2 genome.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组的计算分析进展
Nonlinear Dyn. 2021;106(2):1525-1555. doi: 10.1007/s11071-021-06836-y. Epub 2021 Aug 27.
10
Global Repeat Map (GRM): Advantageous Method for Discovery of Largest Higher-Order Repeats (HORs) in Neuroblastoma Breakpoint Family (NBPF) Genes, in Hornerin Exon and in Chromosome 21 Centromere.全球重复图谱(GRM):在神经母细胞瘤断裂点家族(NBPF)基因、Hornerin 外显子和 21 号染色体着丝粒中发现最大高阶重复(HOR)的有利方法。
Prog Mol Subcell Biol. 2021;60:203-234. doi: 10.1007/978-3-030-74889-0_8.
神经母细胞瘤断点家族基因中的基因内高级重复区分了人类和黑猩猩。
Mol Biol Evol. 2011 Jun;28(6):1877-92. doi: 10.1093/molbev/msr009. Epub 2011 Jan 27.
4
Large tandem, higher order repeats and regularly dispersed repeat units contribute substantially to divergence between human and chimpanzee Y chromosomes.大片段串联重复、高次重复以及规则散布的重复单位导致了人类和黑猩猩 Y 染色体之间的显著差异。
J Mol Evol. 2011 Jan;72(1):34-55. doi: 10.1007/s00239-010-9401-8. Epub 2010 Nov 20.
5
Expression, tandem repeat copy number variation and stability of four macrosatellite arrays in the human genome.人类基因组中四个大卫星重复序列的表达、串联重复拷贝数变异和稳定性。
BMC Genomics. 2010 Nov 15;11:632. doi: 10.1186/1471-2164-11-632.
6
Variable tandem repeats accelerate evolution of coding and regulatory sequences.可变串联重复加速编码和调控序列的进化。
Annu Rev Genet. 2010;44:445-77. doi: 10.1146/annurev-genet-072610-155046.
7
Genomics of long-range regulatory elements.长程调控元件的基因组学。
Annu Rev Genomics Hum Genet. 2010;11:1-23. doi: 10.1146/annurev-genom-082509-141651.
8
Genome-wide analysis of tandem repeats in Daphnia pulex--a comparative approach.对水蚤属内串联重复序列的全基因组分析——一种比较方法。
BMC Genomics. 2010 Apr 30;11:277. doi: 10.1186/1471-2164-11-277.
9
Diverse cis factors controlling Alu retrotransposition: what causes Alu elements to die?控制Alu逆转录转座的多种顺式作用因子:是什么导致Alu元件失活?
Genome Res. 2009 Apr;19(4):545-55. doi: 10.1101/gr.089789.108. Epub 2009 Mar 9.
10
Detection of tandem repeats in DNA sequences based on parametric spectral estimation.基于参数谱估计的DNA序列串联重复检测。
IEEE Trans Inf Technol Biomed. 2009 Sep;13(5):747-55. doi: 10.1109/TITB.2008.920626. Epub 2008 Mar 21.