Suppr超能文献

个体人类基因组内复杂串联重复的景观。

A landscape of complex tandem repeats within individual human genomes.

机构信息

Department of Computational Biology and Medical Sciences, The University of Tokyo, 277-8561, Chiba, Japan.

出版信息

Nat Commun. 2023 Sep 14;14(1):5530. doi: 10.1038/s41467-023-41262-1.

Abstract

Markedly expanded tandem repeats (TRs) have been correlated with ~60 diseases. TR diversity has been considered a clue toward understanding missing heritability. However, haplotype-resolved long TRs remain mostly hidden or blacked out because their complex structures (TRs composed of various units and minisatellites containing >10-bp units) make them difficult to determine accurately with existing methods. Here, using a high-precision algorithm to determine complex TR structures from long, accurate reads of PacBio HiFi, an investigation of 270 Japanese control samples yields several genome-wide findings. Approximately 322,000 TRs are difficult to impute from the surrounding single-nucleotide variants. Greater genetic divergence of TR loci is significantly correlated with more events of younger replication slippage. Complex TRs are more abundant than single-unit TRs, and a tendency for complex TRs to consist of <10-bp units and single-unit TRs to be minisatellites is statistically significant at loci with ≥500-bp TRs. Of note, 8909 loci with extended TRs (>100b longer than the mode) contain several known disease-associated TRs and are considered candidates for association with disorders. Overall, complex TRs and minisatellites are found to be abundant and diverse, even in genetically small Japanese populations, yielding insights into the landscape of long TRs.

摘要

明显扩展的串联重复(TRs)与~60 种疾病相关。TR 多样性被认为是理解遗传缺失的线索。然而,由于其复杂的结构(由各种单元组成的 TR 和包含>10 个碱基单元的微卫星),使得现有的方法很难准确确定单倍型解析的长 TR,因此这些长 TR 仍然大多隐藏或被掩盖。在这里,使用一种高精度算法来从 PacBio HiFi 的长而准确的读取中确定复杂的 TR 结构,对 270 个日本对照样本的调查产生了一些全基因组发现。大约有 322,000 个 TR 很难从周围的单核苷酸变体推断出来。TR 位点的遗传差异越大,复制滑动的年轻事件就越多。复杂的 TR 比单单元 TR 更丰富,并且在 TR 长度≥500bp 的位点上,复杂 TR 倾向于由<10-bp 单元组成,而单单元 TR 倾向于是微卫星,这在统计学上具有显著意义。值得注意的是,8909 个具有扩展 TR(比模式长>100bp)的位点包含几个已知与疾病相关的 TR,被认为是与疾病相关的候选者。总体而言,复杂的 TR 和微卫星在遗传上较小的日本人群中也很丰富和多样,这为长 TR 的全景提供了深入的了解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d16/10502081/614935045947/41467_2023_41262_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验