• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

极端富集的 VNTR 相关多态性存在于人类亚端粒区:具有最多 VNTR 的基因主要在大脑中表达。

Extreme enrichment of VNTR-associated polymorphicity in human subtelomeres: genes with most VNTRs are predominantly expressed in the brain.

机构信息

Department of Clinical Genetics, Amsterdam Neuroscience, Vrije Universiteit Amsterdam, Amsterdam UMC, Amsterdam, The Netherlands.

Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands.

出版信息

Transl Psychiatry. 2020 Nov 2;10(1):369. doi: 10.1038/s41398-020-01060-5.

DOI:10.1038/s41398-020-01060-5
PMID:33139705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7608644/
Abstract

The human genome harbors numerous structural variants (SVs) which, due to their repetitive nature, are currently underexplored in short-read whole-genome sequencing approaches. Using single-molecule, real-time (SMRT) long-read sequencing technology in combination with FALCON-Unzip, we generated a de novo assembly of the diploid genome of a 115-year-old Dutch cognitively healthy woman. We combined this assembly with two previously published haploid assemblies (CHM1 and CHM13) and the GRCh38 reference genome to create a compendium of SVs that occur across five independent human haplotypes using the graph-based multi-genome aligner REVEAL. Across these five haplotypes, we detected 31,680 euchromatic SVs (>50 bp). Of these, ~62% were comprised of repetitive sequences with 'variable number tandem repeats' (VNTRs), ~10% were mobile elements (Alu, L1, and SVA), while the remaining variants were inversions and indels. We observed that VNTRs with GC-content >60% and repeat patterns longer than 15 bp were 21-fold enriched in the subtelomeric regions (within 5 Mb of the ends of chromosome arms). VNTR lengths can expand to exceed a critical length which is associated with impaired gene transcription. The genes that contained most VNTRs, of which PTPRN2 and DLGAP2 are the most prominent examples, were found to be predominantly expressed in the brain and associated with a wide variety of neurological disorders. Repeat-induced variation represents a sizeable fraction of the genetic variation in human genomes and should be included in investigations of genetic factors associated with phenotypic traits, specifically those associated with neurological disorders. We make available the long and short-read sequence data of the supercentenarian genome, and a compendium of SVs as identified across 5 human haplotypes.

摘要

人类基因组中存在大量结构变异(SV),由于其重复性质,目前在短读长全基因组测序方法中尚未得到充分探索。我们使用单分子实时(SMRT)长读测序技术结合 FALCON-Unzip,对一位 115 岁的荷兰认知健康女性的二倍体基因组进行了从头组装。我们将该组装与之前发表的两个单倍体组装(CHM1 和 CHM13)和 GRCh38 参考基因组结合起来,使用基于图形的多基因组比对器 REVEAL,创建了一个跨越五个独立人类单倍型的 SV 简编。在这五个单倍型中,我们检测到了 31680 个常染色质 SV(>50bp)。其中,约 62%由具有“可变数量串联重复”(VNTR)的重复序列组成,约 10%为移动元件(Alu、L1 和 SVA),其余变体为倒位和插入缺失。我们观察到,GC 含量>60%且重复模式超过 15bp 的 VNTR 在端粒区域(染色体臂末端 5Mb 内)富集了 21 倍。VNTR 长度可以扩展到超过与基因转录受损相关的临界长度。包含最多 VNTR 的基因,其中 PTPRN2 和 DLGAP2 是最突出的例子,被发现主要在大脑中表达,并与多种神经疾病有关。重复诱导的变异代表人类基因组遗传变异的相当大一部分,应包括在与表型特征相关的遗传因素研究中,特别是与神经疾病相关的因素。我们提供了超级百岁老人基因组的长读和短读序列数据,以及跨越 5 个人类单倍型的 SV 简编。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/76337aea39f8/41398_2020_1060_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/a82894c5d37c/41398_2020_1060_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/6cd9f13d19ad/41398_2020_1060_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/b3e370c46305/41398_2020_1060_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/13e3cdc95a5d/41398_2020_1060_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/a9e51fc81380/41398_2020_1060_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/4c7a6a4c7a5c/41398_2020_1060_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/76337aea39f8/41398_2020_1060_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/a82894c5d37c/41398_2020_1060_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/6cd9f13d19ad/41398_2020_1060_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/b3e370c46305/41398_2020_1060_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/13e3cdc95a5d/41398_2020_1060_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/a9e51fc81380/41398_2020_1060_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/4c7a6a4c7a5c/41398_2020_1060_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a0/7608644/76337aea39f8/41398_2020_1060_Fig7_HTML.jpg

相似文献

1
Extreme enrichment of VNTR-associated polymorphicity in human subtelomeres: genes with most VNTRs are predominantly expressed in the brain.极端富集的 VNTR 相关多态性存在于人类亚端粒区:具有最多 VNTR 的基因主要在大脑中表达。
Transl Psychiatry. 2020 Nov 2;10(1):369. doi: 10.1038/s41398-020-01060-5.
2
A pipeline for local assembly of minisatellite alleles from single-molecule sequencing data.一种用于从单分子测序数据中进行小卫星等位基因本地组装的流程。
Bioinformatics. 2017 Mar 1;33(5):650-653. doi: 10.1093/bioinformatics/btw687.
3
Genome-wide prediction of human VNTRs.人类可变数目串联重复序列(VNTRs)的全基因组预测
Genomics. 2005 Jan;85(1):24-35. doi: 10.1016/j.ygeno.2004.10.009.
4
Profiling variable-number tandem repeat variation across populations using repeat-pangenome graphs.利用重复泛基因组图对人群进行可变串联重复序列变异分析。
Nat Commun. 2021 Jul 12;12(1):4250. doi: 10.1038/s41467-021-24378-0.
5
Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。
Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.
6
A phenome-wide association study identifies effects of copy-number variation of VNTRs and multicopy genes on multiple human traits.全基因组关联研究鉴定了 VNTRs 和多拷贝基因的拷贝数变异对多种人类特征的影响。
Am J Hum Genet. 2022 Jun 2;109(6):1065-1076. doi: 10.1016/j.ajhg.2022.04.016. Epub 2022 May 23.
7
NovoGraph: Human genome graph construction from multiple long-read assemblies.NovoGraph:基于多个长读长组装构建人类基因组图谱。
F1000Res. 2018 Sep 3;7:1391. doi: 10.12688/f1000research.15895.2. eCollection 2018.
8
Discovery and genotyping of structural variation from long-read haploid genome sequence data.从长读单倍体基因组序列数据中发现结构变异并进行基因分型。
Genome Res. 2017 May;27(5):677-685. doi: 10.1101/gr.214007.116. Epub 2016 Nov 28.
9
The motif composition of variable number tandem repeats impacts gene expression.可变数串联重复的基序组成影响基因表达。
Genome Res. 2023 Apr;33(4):511-524. doi: 10.1101/gr.276768.122. Epub 2023 Apr 10.
10
Human-specific tandem repeat expansion and differential gene expression during primate evolution.人类特有的串联重复扩展和灵长类动物进化过程中的差异基因表达。
Proc Natl Acad Sci U S A. 2019 Nov 12;116(46):23243-23253. doi: 10.1073/pnas.1912175116. Epub 2019 Oct 28.

引用本文的文献

1
Trisomic rescue via allele-specific multiple chromosome cleavage using CRISPR-Cas9 in trisomy 21 cells.在21三体细胞中通过使用CRISPR-Cas9的等位基因特异性多染色体切割进行三体拯救。
PNAS Nexus. 2025 Feb 18;4(2):pgaf022. doi: 10.1093/pnasnexus/pgaf022. eCollection 2025 Feb.
2
Genome-wide investigation of VNTR motif polymorphisms in 8,222 genomes: Implications for biological regulation and human traits.对8222个基因组中VNTR基序多态性的全基因组研究:对生物调控和人类性状的影响。
Cell Genom. 2024 Dec 11;4(12):100699. doi: 10.1016/j.xgen.2024.100699. Epub 2024 Nov 27.
3
Characterizing tandem repeat complexities across long-read sequencing platforms with TREAT and .

本文引用的文献

1
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
2
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
3
Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。
使用 TREAT 和 对长读测序平台上的串联重复复杂度进行特征分析。
Genome Res. 2024 Nov 20;34(11):1942-1953. doi: 10.1101/gr.279351.124.
4
A comprehensive tandem repeat catalog of the human genome.人类基因组的全面串联重复序列目录。
medRxiv. 2024 Jun 20:2024.06.19.24309173. doi: 10.1101/2024.06.19.24309173.
5
Advances in the discovery and analyses of human tandem repeats.人类串联重复序列的发现和分析进展。
Emerg Top Life Sci. 2023 Dec 14;7(3):361-381. doi: 10.1042/ETLS20230074.
6
Characterization of genome-wide STR variation in 6487 human genomes.6487 个人类基因组中全基因组 STR 变异的特征。
Nat Commun. 2023 Apr 12;14(1):2092. doi: 10.1038/s41467-023-37690-8.
7
The motif composition of variable number tandem repeats impacts gene expression.可变数串联重复的基序组成影响基因表达。
Genome Res. 2023 Apr;33(4):511-524. doi: 10.1101/gr.276768.122. Epub 2023 Apr 10.
8
Research on Voxel-Based Features Detection and Analysis of Alzheimer's Disease Using Random Survey Support Vector Machine.基于随机调查支持向量机的阿尔茨海默病体素特征检测与分析研究
Front Neuroinform. 2022 Mar 28;16:856295. doi: 10.3389/fninf.2022.856295. eCollection 2022.
9
A hidden layer of structural variation in transposable elements reveals potential genetic modifiers in human disease-risk loci.转座元件结构变异的隐藏层揭示了人类疾病风险位点的潜在遗传修饰因子。
Genome Res. 2022 Apr;32(4):656-670. doi: 10.1101/gr.275515.121. Epub 2022 Mar 24.
10
Sweet Taste Preference: Relationships with Other Tastes, Liking for Sugary Foods and Exploratory Genome-Wide Association Analysis in Subjects with Metabolic Syndrome.甜味偏好:与其他味觉的关系、对含糖食物的喜好以及代谢综合征患者的全基因组关联探索性分析
Biomedicines. 2021 Dec 31;10(1):79. doi: 10.3390/biomedicines10010079.
Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.
4
Moderate-to-severe asthma in individuals of European ancestry: a genome-wide association study.欧洲裔人群中中重度哮喘的全基因组关联研究。
Lancet Respir Med. 2019 Jan;7(1):20-34. doi: 10.1016/S2213-2600(18)30389-8. Epub 2018 Dec 11.
5
Somatic APP gene recombination in Alzheimer's disease and normal neurons.阿尔茨海默病和正常神经元中的体细胞 APP 基因重组。
Nature. 2018 Nov;563(7733):639-645. doi: 10.1038/s41586-018-0718-6. Epub 2018 Nov 21.
6
The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.NHGRI-EBI GWAS Catalog 于 2019 年发布的已发表全基因组关联研究、靶向基因芯片和汇总统计数据
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012. doi: 10.1093/nar/gky1120.
7
Genetic influences on susceptibility to rheumatoid arthritis in African-Americans.遗传因素对非裔美国人类风湿关节炎易感性的影响。
Hum Mol Genet. 2019 Mar 1;28(5):858-874. doi: 10.1093/hmg/ddy395.
8
Centenarian controls increase variant effect sizes by an average twofold in an extreme case-extreme control analysis of Alzheimer's disease.百岁老人对照分析发现,在阿尔茨海默病的极端对照极端控制分析中,变异效应平均增加了一倍。
Eur J Hum Genet. 2019 Feb;27(2):244-253. doi: 10.1038/s41431-018-0273-5. Epub 2018 Sep 26.
9
Characterization of a Human-Specific Tandem Repeat Associated with Bipolar Disorder and Schizophrenia.与双相情感障碍和精神分裂症相关的串联重复序列的特征分析。
Am J Hum Genet. 2018 Sep 6;103(3):421-430. doi: 10.1016/j.ajhg.2018.07.011. Epub 2018 Aug 9.
10
Lack of association between variable number tandem repeat and circadian rhythm sleep-wake disorders.可变数目串联重复序列与昼夜节律性睡眠-觉醒障碍之间无关联。
Hum Genome Var. 2018 Jul 12;5:17. doi: 10.1038/s41439-018-0017-7. eCollection 2018.