• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TRiCoLOR:使用全基因组长读测序数据进行串联重复分析。

TRiCoLOR: tandem repeat profiling using whole-genome long-read sequencing data.

机构信息

Department of Experimental and Clinical Medicine, University of Florence, Viale Pieraccini 6, Florence 50134, Italy.

European Molecular Biology Laboratory (EMBL), GeneCore, Meyerhofstraße 1, Heidelberg 69117, Germany.

出版信息

Gigascience. 2020 Oct 7;9(10). doi: 10.1093/gigascience/giaa101.

DOI:10.1093/gigascience/giaa101
PMID:33034633
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7539535/
Abstract

BACKGROUND

Tandem repeat sequences are widespread in the human genome, and their expansions cause multiple repeat-mediated disorders. Genome-wide discovery approaches are needed to fully elucidate their roles in health and disease, but resolving tandem repeat variation accurately remains a challenging task. While traditional mapping-based approaches using short-read data have severe limitations in the size and type of tandem repeats they can resolve, recent third-generation sequencing technologies exhibit substantially higher sequencing error rates, which complicates repeat resolution.

RESULTS

We developed TRiCoLOR, a freely available tool for tandem repeat profiling using error-prone long reads from third-generation sequencing technologies. The method can identify repetitive regions in sequencing data without a prior knowledge of their motifs or locations and resolve repeat multiplicity and period size in a haplotype-specific manner. The tool includes methods to interactively visualize the identified repeats and to trace their Mendelian consistency in pedigrees.

CONCLUSIONS

TRiCoLOR demonstrates excellent performance and improved sensitivity and specificity compared with alternative tools on synthetic data. For real human whole-genome sequencing data, TRiCoLOR achieves high validation rates, suggesting its suitability to identify tandem repeat variation in personal genomes.

摘要

背景

串联重复序列广泛存在于人类基因组中,其扩展会导致多种重复介导的疾病。需要进行全基因组发现方法来充分阐明它们在健康和疾病中的作用,但准确解决串联重复变异仍然是一项具有挑战性的任务。虽然使用短读数据的传统基于映射的方法在其可以解决的串联重复的大小和类型方面具有严重的局限性,但最近的第三代测序技术表现出显著更高的测序错误率,这使得重复分辨率复杂化。

结果

我们开发了 TRiCoLOR,这是一种免费的工具,用于使用第三代测序技术的易错长读进行串联重复分析。该方法可以在没有重复序列基序或位置先验知识的情况下识别测序数据中的重复区域,并以单倍型特异性的方式解决重复倍数和周期大小的问题。该工具包括用于交互式可视化识别重复序列和跟踪其在系谱中的孟德尔一致性的方法。

结论

与替代工具相比,TRiCoLOR 在合成数据上表现出出色的性能,并且提高了灵敏度和特异性。对于真实的人类全基因组测序数据,TRiCoLOR 实现了高验证率,表明其适合识别个人基因组中的串联重复变异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8202/7539535/623b0ff05618/giaa101fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8202/7539535/eea1ec6ed351/giaa101fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8202/7539535/623b0ff05618/giaa101fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8202/7539535/eea1ec6ed351/giaa101fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8202/7539535/623b0ff05618/giaa101fig2.jpg

相似文献

1
TRiCoLOR: tandem repeat profiling using whole-genome long-read sequencing data.TRiCoLOR:使用全基因组长读测序数据进行串联重复分析。
Gigascience. 2020 Oct 7;9(10). doi: 10.1093/gigascience/giaa101.
2
Noise-cancelling repeat finder: uncovering tandem repeats in error-prone long-read sequencing data.降噪重复序列发现工具:在易错长读测序数据中发现串联重复序列。
Bioinformatics. 2019 Nov 1;35(22):4809-4811. doi: 10.1093/bioinformatics/btz484.
3
Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads.串联基因型:从长 DNA 读取中稳健检测串联重复扩展。
Genome Biol. 2019 Mar 19;20(1):58. doi: 10.1186/s13059-019-1667-6.
4
LongTR: genome-wide profiling of genetic variation at tandem repeats from long reads.LongTR:从长读段中进行串联重复的全基因组遗传变异分析。
Genome Biol. 2024 Jul 4;25(1):176. doi: 10.1186/s13059-024-03319-2.
5
REViewer: haplotype-resolved visualization of read alignments in and around tandem repeats.REViewer:串联重复序列及其附近读取比对的单倍型解析可视化。
Genome Med. 2022 Aug 11;14(1):84. doi: 10.1186/s13073-022-01085-z.
6
GtTR: Bayesian estimation of absolute tandem repeat copy number using sequence capture and high throughput sequencing.GtTR:基于序列捕获和高通量测序的绝对串联重复拷贝数的贝叶斯估计。
BMC Bioinformatics. 2018 Jul 16;19(1):267. doi: 10.1186/s12859-018-2282-3.
7
RF: a method for filtering short reads with tandem repeats for genome mapping.RF:一种用于基因组图谱构建的带有串联重复的短读过滤方法。
Genomics. 2013 Jul;102(1):35-7. doi: 10.1016/j.ygeno.2013.03.002. Epub 2013 Mar 29.
8
Finding long tandem repeats in long noisy reads.在长噪声读取中查找长串联重复。
Bioinformatics. 2021 May 5;37(5):612-621. doi: 10.1093/bioinformatics/btaa865.
9
Analysis of Tandem Repeat Expansions Using Long DNA Reads.利用长 DNA 读取分析串联重复扩展。
Methods Mol Biol. 2023;2632:147-159. doi: 10.1007/978-1-0716-2996-3_11.
10
Analysis of Tandem Repeats in Short-Read Sequencing Data: From Genotyping Known Pathogenic Repeats to Discovering Novel Expansions.短读测序数据中串联重复序列的分析:从已知致病性重复序列的基因分型到新的扩展序列的发现。
Curr Protoc. 2024 Nov;4(11):e70010. doi: 10.1002/cpz1.70010.

引用本文的文献

1
A practical guide to identifying associations between tandem repeats and complex human traits using consensus genotypes from multiple tools.利用多种工具的一致性基因型识别串联重复序列与复杂人类性状之间关联的实用指南。
Nat Protoc. 2025 Sep 1. doi: 10.1038/s41596-025-01231-y.
2
TRsv: simultaneous detection of tandem repeat variations, structural variations, and short indels using long read sequencing data.TRsv:利用长读长测序数据同时检测串联重复变异、结构变异和短插入缺失变异
Genome Biol. 2025 Aug 20;26(1):246. doi: 10.1186/s13059-025-03718-z.
3
Enhanced detection and genotyping of disease-associated tandem repeats using HMMSTR and targeted long-read sequencing.

本文引用的文献

1
NanoSatellite: accurate characterization of expanded tandem repeat length and sequence through whole genome long-read sequencing on PromethION.纳米卫星:通过 PromethION 全基因组长读测序准确表征扩展串联重复长度和序列。
Genome Biol. 2019 Nov 14;20(1):239. doi: 10.1186/s13059-019-1856-3.
2
Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.Longshot 可通过单分子长读测序对二倍体基因组进行准确的变异调用。
Nat Commun. 2019 Oct 11;10(1):4660. doi: 10.1038/s41467-019-12493-y.
3
VISOR: a versatile haplotype-aware structural variant simulator for short- and long-read sequencing.
使用HMMSTR和靶向长读长测序增强疾病相关串联重复序列的检测和基因分型
Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkae1202.
4
Navigating triplet repeats sequencing: concepts, methodological challenges and perspective for Huntington's disease.解读三联体重复序列测序:概念、方法学挑战及亨廷顿舞蹈症研究前景
Nucleic Acids Res. 2025 Jan 7;53(1). doi: 10.1093/nar/gkae1155.
5
LongTR: genome-wide profiling of genetic variation at tandem repeats from long reads.LongTR:从长读段中进行串联重复的全基因组遗传变异分析。
Genome Biol. 2024 Jul 4;25(1):176. doi: 10.1186/s13059-024-03319-2.
6
Toward understanding the role of genomic repeat elements in neurodegenerative diseases.迈向理解基因组重复元件在神经退行性疾病中的作用。
Neural Regen Res. 2025 Mar 1;20(3):646-659. doi: 10.4103/NRR.NRR-D-23-01568. Epub 2024 Apr 16.
7
Sequencing and characterizing short tandem repeats in the human genome.对人类基因组中的短串联重复序列进行测序和特征分析。
Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16.
8
Expanding horizons of tandem repeats in biology and medicine: Why 'genomic dark matter' matters.拓展串联重复序列在生物学和医学领域的视野:为何“基因组暗物质”至关重要。
Emerg Top Life Sci. 2023 Dec 13;7(3):239-47. doi: 10.1042/ETLS20230075.
9
Evolutionary Study of Protein Short Tandem Repeats in Protein Families.蛋白质家族中蛋白质短串联重复的进化研究。
Biomolecules. 2023 Jul 13;13(7):1116. doi: 10.3390/biom13071116.
10
NanoSTR: A method for detection of target short tandem repeats based on nanopore sequencing data.纳米STR:一种基于纳米孔测序数据检测目标短串联重复序列的方法。
Front Mol Biosci. 2023 Jan 18;10:1093519. doi: 10.3389/fmolb.2023.1093519. eCollection 2023.
VISOR:一种通用的基于单倍型感知的结构变异模拟工具,适用于短读长和长读长测序。
Bioinformatics. 2020 Feb 15;36(4):1267-1269. doi: 10.1093/bioinformatics/btz719.
4
Evolutionary and functional impact of common polymorphic inversions in the human genome.人类基因组中常见多态性倒位的进化和功能影响。
Nat Commun. 2019 Sep 17;10(1):4222. doi: 10.1038/s41467-019-12173-x.
5
TideHunter: efficient and sensitive tandem repeat detection from noisy long-reads using seed-and-chain.TideHunter:使用种子和链在嘈杂的长读取中高效且敏感的串联重复检测。
Bioinformatics. 2019 Jul 15;35(14):i200-i207. doi: 10.1093/bioinformatics/btz376.
6
Noise-cancelling repeat finder: uncovering tandem repeats in error-prone long-read sequencing data.降噪重复序列发现工具:在易错长读测序数据中发现串联重复序列。
Bioinformatics. 2019 Nov 1;35(22):4809-4811. doi: 10.1093/bioinformatics/btz484.
7
Look4TRs: a de novo tool for detecting simple tandem repeats using self-supervised hidden Markov models.Look4TRs:一种使用自监督隐马尔可夫模型检测简单串联重复序列的新工具。
Bioinformatics. 2020 Jan 15;36(2):380-387. doi: 10.1093/bioinformatics/btz551.
8
Long-Read Sequencing Emerging in Medical Genetics.长读长测序在医学遗传学中崭露头角。
Front Genet. 2019 May 7;10:426. doi: 10.3389/fgene.2019.00426. eCollection 2019.
9
NanoR: A user-friendly R package to analyze and compare nanopore sequencing data.NanoR:一个用户友好的 R 包,用于分析和比较纳米孔测序数据。
PLoS One. 2019 May 9;14(5):e0216471. doi: 10.1371/journal.pone.0216471. eCollection 2019.
10
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.