• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TRGT-denovo:串联重复突变的准确检测

TRGT-denovo: accurate detection of tandem repeat mutations.

作者信息

Mokveld T, Dolzhenko E, Dashnow H, Nicholas T J, Sasani T, van der Sanden B, Jadhav B, Pedersen B, Kronenberg Z, Tucci A, Sharp A J, Quinlan A R, Gilissen C, Hoischen A, Eberle M A

机构信息

PacBio, Menlo Park, CA.

Univ. of Utah, Salt Lake City, UT.

出版信息

bioRxiv. 2024 Jul 19:2024.07.16.600745. doi: 10.1101/2024.07.16.600745.

DOI:10.1101/2024.07.16.600745
PMID:39071386
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11275785/
Abstract

MOTIVATION

Identifying tandem repeat (TR) mutations on a genome-wide scale is essential for understanding genetic variability and its implications in rare diseases. While PacBio HiFi sequencing data enhances the accessibility of the genome's TR regions for genotyping, simple calling strategies often generate an excess of likely false positives, which can obscure true positive findings, particularly as the number of surveyed genomic regions increases.

RESULTS

We developed TRGT-denovo, a computational method designed to accurately identify all types of TR mutations-including expansions, contractions, and compositional changes-within family trios. TRGT-denovo directly interrogates read evidence, allowing for the detection of subtle variations often overlooked in variant call format (VCF) files. TRGT-denovo improves the precision and specificity of mutation (DNM) identification, reducing the number of candidates by an order of magnitude compared to genotype-based approaches. In our experiments involving eight rare disease trios previously studiedTRGT-denovo correctly reclassified all false positive DNM candidates as true negatives. Using an expanded repeat catalog, it identified new candidates, of which 95% (19/20) were experimentally validated, demonstrating its effectiveness in minimizing likely false positives while maintaining high sensitivity for true discoveries.

AVAILABILITY AND IMPLEMENTATION

Built in Rust, TRGT-denovo is available as source code and a pre-compiled Linux binary along with a user guide at: https://github.com/PacificBiosciences/trgt-denovo.

摘要

动机

在全基因组范围内识别串联重复(TR)突变对于理解遗传变异性及其在罕见病中的影响至关重要。虽然PacBio HiFi测序数据提高了基因组TR区域用于基因分型的可及性,但简单的调用策略通常会产生过多可能的假阳性结果,这可能会掩盖真正的阳性发现,尤其是随着被调查基因组区域数量的增加。

结果

我们开发了TRGT-denovo,这是一种计算方法,旨在准确识别家系三联体中的所有类型的TR突变,包括扩增、收缩和组成变化。TRGT-denovo直接审查读取证据,从而能够检测变异调用格式(VCF)文件中经常被忽视的细微变异。TRGT-denovo提高了突变(DNM)识别的精度和特异性,与基于基因型的方法相比,将候选数量减少了一个数量级。在我们涉及八个先前研究过的罕见病三联体的实验中,TRGT-denovo将所有假阳性DNM候选者正确地重新分类为真阴性。使用扩展的重复目录,它识别出了新的候选者,其中95%(19/20)经过实验验证,证明了其在最大限度减少可能的假阳性同时保持对真实发现的高灵敏度方面的有效性。

可用性和实现方式

TRGT-denovo用Rust编写,可作为源代码、预编译的Linux二进制文件以及用户指南获取,网址为:https://github.com/PacificBiosciences/trgt-denovo。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/0d0893081e88/nihpp-2024.07.16.600745v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/6ee7865532ea/nihpp-2024.07.16.600745v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/cc6496391009/nihpp-2024.07.16.600745v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/0d0893081e88/nihpp-2024.07.16.600745v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/6ee7865532ea/nihpp-2024.07.16.600745v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/cc6496391009/nihpp-2024.07.16.600745v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ee/11275785/0d0893081e88/nihpp-2024.07.16.600745v1-f0003.jpg

相似文献

1
TRGT-denovo: accurate detection of tandem repeat mutations.TRGT-denovo:串联重复突变的准确检测
bioRxiv. 2024 Jul 19:2024.07.16.600745. doi: 10.1101/2024.07.16.600745.
2
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
3
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
4
Guaiac-based faecal occult blood tests versus faecal immunochemical tests for colorectal cancer screening in average-risk individuals.基于愈创木脂的粪便潜血试验与粪便免疫化学试验用于一般风险人群结直肠癌筛查。
Cochrane Database Syst Rev. 2022 Jun 6;6(6):CD009276. doi: 10.1002/14651858.CD009276.pub2.
5
Low-complexity manual nucleic acid amplification tests for pulmonary tuberculosis in children.用于儿童肺结核的低复杂度手动核酸扩增检测
Cochrane Database Syst Rev. 2025 Jun 25;6(6):CD015806. doi: 10.1002/14651858.CD015806.pub2.
6
Doppler trans-thoracic echocardiography for detection of pulmonary hypertension in adults.经胸多普勒超声心动图用于检测成人肺动脉高压。
Cochrane Database Syst Rev. 2022 May 9;5(5):CD012809. doi: 10.1002/14651858.CD012809.pub2.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
9
123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.用于诊断神经母细胞瘤的123I-间碘苄胍闪烁扫描术和18F-氟代脱氧葡萄糖正电子发射断层显像
Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2.
10
Characterization and visualization of tandem repeats at genome scale.基因组水平上串联重复序列的特征化和可视化。
Nat Biotechnol. 2024 Oct;42(10):1606-1614. doi: 10.1038/s41587-023-02057-3. Epub 2024 Jan 2.

本文引用的文献

1
A phenome-wide association study of methylated GC-rich repeats identifies a GCC repeat expansion in AFF3 associated with intellectual disability.全基因组关联研究表明甲基化 GC 丰富重复序列与智力障碍相关联,该研究确定了 AFF3 中的 GCC 重复扩展。
Nat Genet. 2024 Nov;56(11):2322-2332. doi: 10.1038/s41588-024-01917-1. Epub 2024 Sep 23.
2
Analysis and benchmarking of small and large genomic variants across tandem repeats.串联重复序列中小的和大的基因组变异的分析与基准测试。
Nat Biotechnol. 2025 Mar;43(3):431-442. doi: 10.1038/s41587-024-02225-z. Epub 2024 Apr 26.
3
Short tandem repeat mutations regulate gene expression in colorectal cancer.
短串联重复突变调节结直肠癌中的基因表达。
Sci Rep. 2024 Feb 9;14(1):3331. doi: 10.1038/s41598-024-53739-0.
4
HiPhase: jointly phasing small, structural, and tandem repeat variants from HiFi sequencing.HiPhase:从 HiFi 测序中联合相位小结构和串联重复变体。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae042.
5
Characterization and visualization of tandem repeats at genome scale.基因组水平上串联重复序列的特征化和可视化。
Nat Biotechnol. 2024 Oct;42(10):1606-1614. doi: 10.1038/s41587-023-02057-3. Epub 2024 Jan 2.
6
Comprehensive de novo mutation discovery with HiFi long-read sequencing.利用 HiFi 长读测序进行全面的从头突变发现。
Genome Med. 2023 May 8;15(1):34. doi: 10.1186/s13073-023-01183-6.
7
Recurrent repeat expansions in human cancer genomes.人类癌症基因组中的重复重复扩展。
Nature. 2023 Jan;613(7942):96-102. doi: 10.1038/s41586-022-05515-1. Epub 2022 Dec 14.
8
30 years of repeat expansion disorders: What have we learned and what are the remaining challenges?30 年的重复扩展障碍研究:我们学到了什么,还有哪些挑战?
Am J Hum Genet. 2021 May 6;108(5):764-785. doi: 10.1016/j.ajhg.2021.03.011. Epub 2021 Apr 2.
9
Fast gap-affine pairwise alignment using the wavefront algorithm.基于波前算法的快速间隙亲和双序列比对。
Bioinformatics. 2021 May 1;37(4):456-463. doi: 10.1093/bioinformatics/btaa777.
10
The role of de novo mutations in adult-onset neurodegenerative disorders.新生突变在成年发病的神经退行性疾病中的作用。
Acta Neuropathol. 2019 Feb;137(2):183-207. doi: 10.1007/s00401-018-1939-3. Epub 2018 Nov 26.