• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组范围内短串联重复序列的体细胞镶嵌性检测。

Genome-wide detection of somatic mosaicism at short tandem repeats.

机构信息

Department of Computer Science and Engineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, United States.

Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, United States.

出版信息

Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae485.

DOI:10.1093/bioinformatics/btae485
PMID:39078205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11319640/
Abstract

MOTIVATION

Somatic mosaicism has been implicated in several developmental disorders, cancers, and other diseases. Short tandem repeats (STRs) consist of repeated sequences of 1-6 bp and comprise >1 million loci in the human genome. Somatic mosaicism at STRs is known to play a key role in the pathogenicity of loci implicated in repeat expansion disorders and is highly prevalent in cancers exhibiting microsatellite instability. While a variety of tools have been developed to genotype germline variation at STRs, a method for systematically identifying mosaic STRs is lacking.

RESULTS

We introduce prancSTR, a novel method for detecting mosaic STRs from individual high-throughput sequencing datasets. prancSTR is designed to detect loci characterized by a single high-frequency mosaic allele, but can also detect loci with multiple mosaic alleles. Unlike many existing mosaicism detection methods for other variant types, prancSTR does not require a matched control sample as input. We show that prancSTR accurately identifies mosaic STRs in simulated data, demonstrate its feasibility by identifying candidate mosaic STRs in Illumina whole genome sequencing data derived from lymphoblastoid cell lines for individuals sequenced by the 1000 Genomes Project, and evaluate the use of prancSTR on Element and PacBio data. In addition to prancSTR, we present simTR, a novel simulation framework which simulates raw sequencing reads with realistic error profiles at STRs.

AVAILABILITY AND IMPLEMENTATION

prancSTR and simTR are freely available at https://github.com/gymrek-lab/trtools. Detailed documentation is available at https://trtools.readthedocs.io/.

摘要

动机

体细胞镶嵌现象与多种发育障碍、癌症和其他疾病有关。短串联重复序列(STRs)由 1-6 个碱基的重复序列组成,在人类基因组中包含>100 万个位点。已知 STRs 的体细胞镶嵌现象在重复扩展障碍相关位点的致病性中起关键作用,并且在表现出微卫星不稳定的癌症中高度普遍存在。虽然已经开发了多种工具来对 STR 中的种系变异进行基因分型,但缺乏系统识别镶嵌 STR 的方法。

结果

我们引入了 prancSTR,这是一种从单个高通量测序数据集中检测镶嵌 STR 的新方法。prancSTR 旨在检测具有单个高频镶嵌等位基因的位点,但也可以检测具有多个镶嵌等位基因的位点。与许多用于其他变异类型的现有镶嵌性检测方法不同,prancSTR 不需要输入匹配的对照样本。我们表明 prancSTR 可以准确地在模拟数据中识别镶嵌 STR,通过在 1000 基因组计划个体的 Illumina 全基因组测序数据中识别候选镶嵌 STR 来证明其可行性,并评估其在 Element 和 PacBio 数据上的应用。除了 prancSTR,我们还提出了 simTR,这是一种新的模拟框架,它可以在 STR 上模拟具有真实错误分布的原始测序读数。

可用性和实现

prancSTR 和 simTR 可在 https://github.com/gymrek-lab/trtools 上免费获得。详细文档可在 https://trtools.readthedocs.io/ 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/254bb2aacdd3/btae485f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/8e210e61fb16/btae485f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/21ecd88c9461/btae485f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/254bb2aacdd3/btae485f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/8e210e61fb16/btae485f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/21ecd88c9461/btae485f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2687/11319640/254bb2aacdd3/btae485f3.jpg

相似文献

1
Genome-wide detection of somatic mosaicism at short tandem repeats.全基因组范围内短串联重复序列的体细胞镶嵌性检测。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae485.
2
Genome-wide detection of somatic mosaicism at short tandem repeats.全基因组范围内短串联重复序列的体细胞嵌合现象检测。
bioRxiv. 2023 Nov 23:2023.11.22.568371. doi: 10.1101/2023.11.22.568371.
3
Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing.利用混合测序技术快速检测个人基因组中的扩展短串联重复序列。
Bioinformatics. 2014 Mar 15;30(6):815-22. doi: 10.1093/bioinformatics/btt647. Epub 2013 Nov 8.
4
STRAS:a snakemake pipeline for genome-wide short tandem repeats annotation and score.STRAS:一个用于全基因组短串联重复序列注释和评分的 SnakeMake 管道。
Hum Genet. 2024 Jun;143(6):735-738. doi: 10.1007/s00439-024-02662-5. Epub 2024 Mar 20.
5
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.从全基因组测序数据中准确分型短串联重复序列及其应用。
Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30.
6
Dante: genotyping of known complex and expanded short tandem repeats.对已知的复杂和扩展短串联重复序列进行基因分型。
Bioinformatics. 2019 Apr 15;35(8):1310-1317. doi: 10.1093/bioinformatics/bty791.
7
Profiling of Short-Tandem-Repeat Disease Alleles in 12,632 Human Whole Genomes.12632个人类全基因组中短串联重复疾病等位基因的分析
Am J Hum Genet. 2017 Nov 2;101(5):700-715. doi: 10.1016/j.ajhg.2017.09.013.
8
LongTR: genome-wide profiling of genetic variation at tandem repeats from long reads.LongTR:从长读段中进行串联重复的全基因组遗传变异分析。
Genome Biol. 2024 Jul 4;25(1):176. doi: 10.1186/s13059-024-03319-2.
9
TRTools: a toolkit for genome-wide analysis of tandem repeats.TRTools:用于串联重复序列全基因组分析的工具包。
Bioinformatics. 2021 May 5;37(5):731-733. doi: 10.1093/bioinformatics/btaa736.
10
WarpSTR: determining tandem repeat lengths using raw nanopore signals.WarpSTR:使用原始纳米孔信号确定串联重复序列长度。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad388.

引用本文的文献

1
ONT in Clinical Diagnostics of Repeat Expansion Disorders: Detection and Reporting Challenges.光学纳米断层扫描技术在重复序列扩张疾病临床诊断中的应用:检测与报告面临的挑战
Int J Mol Sci. 2025 Mar 18;26(6):2725. doi: 10.3390/ijms26062725.
2
Mosaicism in Short Tandem Repeat Disorders: A Clinical Perspective.短串联重复序列疾病中的嵌合现象:临床视角
Genes (Basel). 2025 Feb 13;16(2):216. doi: 10.3390/genes16020216.
3
Nanopore sequencing with unique molecular identifiers enables accurate mutation analysis and haplotyping in the complex lipoprotein(a) KIV-2 VNTR.

本文引用的文献

1
A deep population reference panel of tandem repeat variation.串联重复变异的深度人群参考面板。
Nat Commun. 2023 Oct 23;14(1):6711. doi: 10.1038/s41467-023-42278-3.
2
Sequencing by avidity enables high accuracy with low reagent consumption.高亲和力测序可实现高精度和低试剂消耗。
Nat Biotechnol. 2024 Jan;42(1):132-138. doi: 10.1038/s41587-023-01750-7. Epub 2023 May 25.
3
Control-independent mosaic single nucleotide variant detection with DeepMosaic.DeepMosaic 实现了与对照无关的嵌合体单核苷酸变异检测。
纳米孔测序结合独特的分子标识符可实现复杂脂蛋白(a) KIV-2 VNTR 中的精确突变分析和单倍型分型。
Genome Med. 2024 Oct 8;16(1):117. doi: 10.1186/s13073-024-01391-8.
Nat Biotechnol. 2023 Jun;41(6):870-877. doi: 10.1038/s41587-022-01559-w. Epub 2023 Jan 2.
4
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
5
PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions.精准FDA真相挑战V2:在难以映射的区域中从短读长和长读长中识别变异体。
Cell Genom. 2022 May 11;2(5). doi: 10.1016/j.xgen.2022.100129. Epub 2022 Apr 27.
6
Somatic mosaicism reveals clonal distributions of neocortical development.体细胞镶嵌现象揭示了新皮层发育的克隆分布。
Nature. 2022 Apr;604(7907):689-696. doi: 10.1038/s41586-022-04602-7. Epub 2022 Apr 20.
7
MONTAGE: a new tool for high-throughput detection of mosaic copy number variation.蒙太奇:一种用于高通量检测嵌合拷贝数变异的新工具。
BMC Genomics. 2021 Feb 24;22(1):133. doi: 10.1186/s12864-021-07395-7.
8
TRTools: a toolkit for genome-wide analysis of tandem repeats.TRTools:用于串联重复序列全基因组分析的工具包。
Bioinformatics. 2021 May 5;37(5):731-733. doi: 10.1093/bioinformatics/btaa736.
9
Comprehensive analysis of indels in whole-genome microsatellite regions and microsatellite instability across 21 cancer types.对21种癌症类型的全基因组微卫星区域中的插入缺失和微卫星不稳定性进行综合分析。
Genome Res. 2020 Mar 24;30(3):334-46. doi: 10.1101/gr.255026.119.
10
Accurate detection of mosaic variants in sequencing data without matched controls.无对照匹配情况下测序数据中嵌合体变异的准确检测。
Nat Biotechnol. 2020 Mar;38(3):314-319. doi: 10.1038/s41587-019-0368-8. Epub 2020 Jan 6.