• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类X和Y染色体卫星阵列的着丝粒参考模型。

Centromere reference models for human chromosomes X and Y satellite arrays.

作者信息

Miga Karen H, Newton Yulia, Jain Miten, Altemose Nicolas, Willard Huntington F, Kent W James

机构信息

Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA;

出版信息

Genome Res. 2014 Apr;24(4):697-707. doi: 10.1101/gr.159624.113. Epub 2014 Feb 5.

DOI:10.1101/gr.159624.113
PMID:24501022
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3975068/
Abstract

The human genome sequence remains incomplete, with multimegabase-sized gaps representing the endogenous centromeres and other heterochromatic regions. Available sequence-based studies within these sites in the genome have demonstrated a role in centromere function and chromosome pairing, necessary to ensure proper chromosome segregation during cell division. A common genomic feature of these regions is the enrichment of long arrays of near-identical tandem repeats, known as satellite DNAs, which offer a limited number of variant sites to differentiate individual repeat copies across millions of bases. This substantial sequence homogeneity challenges available assembly strategies and, as a result, centromeric regions are omitted from ongoing genomic studies. To address this problem, we utilize monomer sequence and ordering information obtained from whole-genome shotgun reads to model two haploid human satellite arrays on chromosomes X and Y, resulting in an initial characterization of 3.83 Mb of centromeric DNA within an individual genome. To further expand the utility of each centromeric reference sequence model, we evaluate sites within the arrays for short-read mappability and chromosome specificity. Because satellite DNAs evolve in a concerted manner, we use these centromeric assemblies to assess the extent of sequence variation among 366 individuals from distinct human populations. We thus identify two satellite array variants in both X and Y centromeres, as determined by array length and sequence composition. This study provides an initial sequence characterization of a regional centromere and establishes a foundation to extend genomic characterization to these sites as well as to other repeat-rich regions within complex genomes.

摘要

人类基因组序列仍不完整,存在代表内源性着丝粒和其他异染色质区域的多兆碱基大小的缺口。基因组中这些位点内现有的基于序列的研究表明,它们在着丝粒功能和染色体配对中发挥作用,这对于确保细胞分裂期间染色体的正确分离是必要的。这些区域的一个常见基因组特征是富含长串几乎相同的串联重复序列,即卫星DNA,它们提供的变异位点数量有限,难以区分数百万碱基上的各个重复拷贝。这种高度的序列同质性给现有的组装策略带来了挑战,因此着丝粒区域被排除在正在进行的基因组研究之外。为了解决这个问题,我们利用从全基因组鸟枪法测序读数中获得的单体序列和排序信息,对X和Y染色体上的两个单倍体人类卫星阵列进行建模,从而对个体基因组内383万个碱基的着丝粒DNA进行了初步表征。为了进一步扩展每个着丝粒参考序列模型的实用性,我们评估了阵列内的位点的短读长可映射性和染色体特异性。由于卫星DNA以协同方式进化,我们使用这些着丝粒组装体来评估来自不同人类群体的366个个体之间的序列变异程度。因此,我们确定了X和Y着丝粒中的两个卫星阵列变体,这是由阵列长度和序列组成决定的。这项研究提供了区域着丝粒的初步序列特征,并为将基因组特征扩展到这些位点以及复杂基因组中其他富含重复序列的区域奠定了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/678aecf5fd99/697fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/49f8db1f9bbe/697fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/f0481e0b0eb0/697fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/e934c8d83bb4/697fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/709ef4ab90fa/697fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/678aecf5fd99/697fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/49f8db1f9bbe/697fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/f0481e0b0eb0/697fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/e934c8d83bb4/697fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/709ef4ab90fa/697fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2976/3975068/678aecf5fd99/697fig5.jpg

相似文献

1
Centromere reference models for human chromosomes X and Y satellite arrays.人类X和Y染色体卫星阵列的着丝粒参考模型。
Genome Res. 2014 Apr;24(4):697-707. doi: 10.1101/gr.159624.113. Epub 2014 Feb 5.
2
Linear assembly of a human centromere on the Y chromosome.线性组装人类着丝粒于 Y 染色体上。
Nat Biotechnol. 2018 Apr;36(4):321-323. doi: 10.1038/nbt.4109. Epub 2018 Mar 19.
3
Genomic characterization of large heterochromatic gaps in the human genome assembly.人类基因组组装中大型异染色质间隙的基因组特征分析。
PLoS Comput Biol. 2014 May 15;10(5):e1003628. doi: 10.1371/journal.pcbi.1003628. eCollection 2014 May.
4
Alpha-CENTAURI: assessing novel centromeric repeat sequence variation with long read sequencing.半人马座α星:利用长读长测序评估新型着丝粒重复序列变异
Bioinformatics. 2016 Jul 1;32(13):1921-1924. doi: 10.1093/bioinformatics/btw101. Epub 2016 Feb 24.
5
Satellite DNAs between selfishness and functionality: structure, genomics and evolution of tandem repeats in centromeric (hetero)chromatin.自私性与功能性之间的卫星DNA:着丝粒(异)染色质中串联重复序列的结构、基因组学及进化
Gene. 2008 Feb 15;409(1-2):72-82. doi: 10.1016/j.gene.2007.11.013. Epub 2007 Dec 4.
6
CENP-B binds a novel centromeric sequence in the Asian mouse Mus caroli.着丝粒蛋白B结合亚洲小鼠(小家鼠)中一种新的着丝粒序列。
Mol Cell Biol. 1995 Aug;15(8):4009-20. doi: 10.1128/MCB.15.8.4009.
7
Repeatless and repeat-based centromeres in potato: implications for centromere evolution.马铃薯中无重复和基于重复的着丝粒:对着丝粒进化的启示。
Plant Cell. 2012 Sep;24(9):3559-74. doi: 10.1105/tpc.112.100511. Epub 2012 Sep 11.
8
The formation and evolution of centromeric satellite repeats in Saccharum species.甘蔗属物种着丝粒卫星重复序列的形成与演化。
Plant J. 2021 May;106(3):616-629. doi: 10.1111/tpj.15186. Epub 2021 Mar 9.
9
The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs.内源性果蝇着丝粒的结构揭示了能够形成i-基序的串联重复序列的普遍性。
Sci Rep. 2015 Aug 20;5:13307. doi: 10.1038/srep13307.
10
Global sequence characterization of rice centromeric satellite based on oligomer frequency analysis in large-scale sequencing data.基于大规模测序数据分析的水稻着丝粒卫星的全局序列特征分析。
Bioinformatics. 2010 Sep 1;26(17):2101-8. doi: 10.1093/bioinformatics/btq343. Epub 2010 Jul 8.

引用本文的文献

1
Precise Identification of Higher-Order Repeats (HORs) in T2T-CHM13 Assembly of Human Chromosome 21-Novel 52mer HOR and Failures of Hg38 Assembly.人类21号染色体T2T-CHM13组装中高阶重复序列(HORs)的精确鉴定——新型52聚体HOR及Hg38组装的失败
Genes (Basel). 2025 Jul 27;16(8):885. doi: 10.3390/genes16080885.
2
Telomere interactions and structural variants in ALT cells revealed with TelSPRITE.利用TelSPRITE揭示的端粒相互作用和ALT细胞中的结构变异。
bioRxiv. 2024 Nov 22:2024.11.22.624895. doi: 10.1101/2024.11.22.624895.
3
Unveiling unique expression patterns of D20S16 satellite DNA in human embryonic development.

本文引用的文献

1
Sequences associated with centromere competency in the human genome.人类基因组中与着丝粒活性相关的序列。
Mol Cell Biol. 2013 Feb;33(4):763-72. doi: 10.1128/MCB.01198-12. Epub 2012 Dec 10.
2
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
3
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
揭示D20S16卫星DNA在人类胚胎发育中的独特表达模式。
Sci Rep. 2025 Jul 23;15(1):26770. doi: 10.1038/s41598-025-11753-w.
4
Monitoring the rate and variability of somatic genomic alterations using long-read sequencing.使用长读长测序监测体细胞基因组改变的速率和变异性。
Sci Rep. 2025 Jun 11;15(1):18397. doi: 10.1038/s41598-025-01690-z.
5
Photoperiod-driven testicular DNA methylation in gonadotropin and sex steroid receptor promoters in Siberian hamsters.光周期驱动的西伯利亚仓鼠促性腺激素和性类固醇受体启动子中的睾丸DNA甲基化。
J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2025 May;211(3):327-337. doi: 10.1007/s00359-025-01733-w. Epub 2025 Feb 15.
6
Small variant benchmark from a complete assembly of X and Y chromosomes.来自X和Y染色体完整组装的小变异基准。
Nat Commun. 2025 Jan 8;16(1):497. doi: 10.1038/s41467-024-55710-z.
7
ModDotPlot-rapid and interactive visualization of tandem repeats.ModDotPlot-快速和交互式串联重复序列可视化。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae493.
8
Novel Cascade Alpha Satellite HORs in Orangutan Chromosome 13 Assembly: Discovery of the 59mer HOR-The largest Unit in Primates-And the Missing Triplet 45/27/18 HOR in Human T2T-CHM13v2.0 Assembly.新型串联 Alpha 卫星 HOR 在猩猩 13 号染色体组装中的发现:59 碱基对 HOR 的发现——灵长类动物中最大的单位——以及人类 T2T-CHM13v2.0 组装中缺失的三核苷酸 45/27/18 HOR。
Int J Mol Sci. 2024 Jul 11;25(14):7596. doi: 10.3390/ijms25147596.
9
Leveraging Multi-Tissue, Single-Cell Atlases as Tools to Elucidate Shared Mechanisms of Immune-Mediated Inflammatory Diseases.利用多组织单细胞图谱作为工具来阐明免疫介导的炎症性疾病的共同机制。
Biomedicines. 2024 Jun 12;12(6):1297. doi: 10.3390/biomedicines12061297.
10
Precise identification of cascading alpha satellite higher order repeats in T2T-CHM13 assembly of human chromosome 3.在人类3号染色体的T2T-CHM13组装中对级联α卫星高阶重复序列进行精确鉴定。
Croat Med J. 2024 Jun 13;65(3):209-219. doi: 10.3325/cmj.2024.65.209.
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
4
Functional epialleles at an endogenous human centromere.功能性外显子在一个内源性人类着丝粒上。
Proc Natl Acad Sci U S A. 2012 Aug 21;109(34):13704-9. doi: 10.1073/pnas.1203126109. Epub 2012 Jul 30.
5
Repetitive DNA and next-generation sequencing: computational challenges and solutions.重复 DNA 和新一代测序:计算挑战与解决方案。
Nat Rev Genet. 2011 Nov 29;13(1):36-46. doi: 10.1038/nrg3117.
6
BRCA1 tumour suppression occurs via heterochromatin-mediated silencing.BRCA1 肿瘤抑制作用是通过异染色质介导的沉默实现的。
Nature. 2011 Sep 7;477(7363):179-84. doi: 10.1038/nature10371.
7
Aberrant overexpression of satellite repeats in pancreatic and other epithelial cancers.卫星重复序列在胰腺和其他上皮性癌症中的异常过表达。
Science. 2011 Feb 4;331(6017):593-6. doi: 10.1126/science.1200801. Epub 2011 Jan 13.
8
A new generation of homology search tools based on probabilistic inference.基于概率推理的新一代同源性搜索工具。
Genome Inform. 2009 Oct;23(1):205-11.
9
Fast and accurate long-read alignment with Burrows-Wheeler transform.基于 Burrows-Wheeler 变换的快速准确长读比对。
Bioinformatics. 2010 Mar 1;26(5):589-95. doi: 10.1093/bioinformatics/btp698. Epub 2010 Jan 15.
10
Maximum likelihood genome assembly.最大似然基因组组装
J Comput Biol. 2009 Aug;16(8):1101-16. doi: 10.1089/cmb.2009.0047.