• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Clair3-trio:使用三对三深度神经网络在家庭三对体中进行高性能纳米孔长读变异调用。

Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks.

机构信息

Department of Computer Science, The University of Hong Kong, Hong Kong, China.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac301.

DOI:10.1093/bib/bbac301
PMID:35849103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9487642/
Abstract

Accurate identification of genetic variants from family child-mother-father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio's predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio.

摘要

从家系(child-mother-father 三人组)测序数据中准确识别遗传变异对于基因组学很重要。然而,最先进的方法将三人组的变异调用视为三个独立的任务,这限制了它们对 Nanopore 长读测序数据的调用准确性。为了更好地进行三人组变异调用,我们引入了 Clair3-Trio,这是第一个专门针对 Nanopore 长读测序的家系三人组数据的变异调用程序。 Clair3-Trio 采用了 Trio-to-Trio 深度神经网络模型,允许它输入三人组测序信息,并在单个模型中输出三人组的所有预测变异,以提高变异调用的准确性。我们还提出了 MCVLoss,这是一种专门针对三人组变异调用的新型损失函数,利用 Mendelian 遗传的显式编码。Clair3-Trio 在实验中表现出了全面的改进。它预测的孟德尔遗传违反变异比当前最先进的方法要少得多。我们还证明了我们的 Trio-to-Trio 模型比竞争架构更准确。Clair3-Trio 可在 https://github.com/HKU-BAL/Clair3-Trio 上免费获取,是一个开源项目。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/271dcce01bfe/bbac301f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/283197fe472d/bbac301f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/05ba34ef61be/bbac301f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/dd65a60cc9ed/bbac301f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/b5d20f0e06cf/bbac301f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/271dcce01bfe/bbac301f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/283197fe472d/bbac301f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/05ba34ef61be/bbac301f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/dd65a60cc9ed/bbac301f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/b5d20f0e06cf/bbac301f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/9487642/271dcce01bfe/bbac301f5.jpg

相似文献

1
Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks.Clair3-trio:使用三对三深度神经网络在家庭三对体中进行高性能纳米孔长读变异调用。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac301.
2
Benchmarking reveals superiority of deep learning variant callers on bacterial nanopore sequence data.基准测试显示深度学习变异调用程序在细菌纳米孔测序数据上的优越性。
Elife. 2024 Oct 10;13:RP98300. doi: 10.7554/eLife.98300.
3
Boosting variant-calling performance with multi-platform sequencing data using Clair3-MP.使用 Clair3-MP 结合多平台测序数据提高变异calling 性能。
BMC Bioinformatics. 2023 Aug 3;24(1):308. doi: 10.1186/s12859-023-05434-6.
4
dv-trio: a family-based variant calling pipeline using DeepVariant.dv-trio:一种基于家系的使用 DeepVariant 的变异calling 流程。
Bioinformatics. 2020 Jun 1;36(11):3549-3551. doi: 10.1093/bioinformatics/btaa116.
5
A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark.基于光学基因组图谱作为基准的短读长和纳米孔全基因组测序的结构变异调用比较。
Genes (Basel). 2024 Jul 16;15(7):925. doi: 10.3390/genes15070925.
6
NanoSNP: a progressive and haplotype-aware SNP caller on low-coverage nanopore sequencing data.NanoSNP:一种针对低覆盖度纳米孔测序数据的渐进式、单体型感知 SNP 调用程序。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac824.
7
Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.使用 PEPPER-Margin-DeepVariant 进行单体型感知变异调用可实现纳米孔长读段的高精度。
Nat Methods. 2021 Nov;18(11):1322-1332. doi: 10.1038/s41592-021-01299-w. Epub 2021 Nov 1.
8
miniSNV: accurate and fast single nucleotide variant calling from nanopore sequencing data.miniSNV:从纳米孔测序数据中进行准确快速的单核苷酸变异calling。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae473.
9
Evaluation of the Available Variant Calling Tools for Oxford Nanopore Sequencing in Breast Cancer.评估适用于乳腺癌牛津纳米孔测序的变异调用工具。
Genes (Basel). 2022 Sep 3;13(9):1583. doi: 10.3390/genes13091583.
10
ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell.ECNano:一种经济有效的工作流程,使用单个 MinION 流动池对 4800 个具有临床意义的基因进行靶向富集测序和准确的变异调用。
BMC Med Genomics. 2022 Mar 4;15(1):43. doi: 10.1186/s12920-022-01190-3.

引用本文的文献

1
Indel calling from ONT sequencing data of family trios via sparse attention and 3D convolution.通过稀疏注意力和3D卷积从家系三联体的ONT测序数据中进行插入缺失检测。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf430.
2
Splice-modulating antisense oligonucleotides targeting a pathogenic intronic variant in adult polyglucosan body disease correct mis-splicing and restore enzyme activity in patient cells.靶向成人多糖体病致病性内含子变异体的剪接调节反义寡核苷酸可纠正患者细胞中的错误剪接并恢复酶活性。
Nucleic Acids Res. 2025 Jul 8;53(13). doi: 10.1093/nar/gkaf658.
3
Feasibility of long-read sequencing to identify molecular alterations in an Indonesian cohort of locally advanced to advanced nasopharyngeal cancer.

本文引用的文献

1
Symphonizing pileup and full-alignment for deep learning-based long-read variant calling.基于深度学习的长读变异调用的交响乐堆积和全对齐。
Nat Comput Sci. 2022 Dec;2(12):797-803. doi: 10.1038/s43588-022-00387-x. Epub 2022 Dec 19.
2
Benchmarking challenging small variants with linked and long reads.使用连锁读段和长读段对具有挑战性的小变异进行基准测试。
Cell Genom. 2022 May;2(5). doi: 10.1016/j.xgen.2022.100128.
3
Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.使用 PEPPER-Margin-DeepVariant 进行单体型感知变异调用可实现纳米孔长读段的高精度。
长读长测序用于识别印度尼西亚局部晚期至晚期鼻咽癌队列分子改变的可行性
Sci Rep. 2025 Jul 1;15(1):21087. doi: 10.1038/s41598-025-06096-5.
4
Benchmarking Nanopore Sequencing for CLN2 (TPP1) Mutation Detection: Integrating Rapid Genomics and Orthogonal Validation for Precision Diagnostics.用于CLN2(TPP1)突变检测的纳米孔测序基准测试:整合快速基因组学和正交验证以实现精准诊断。
Int J Mol Sci. 2025 May 23;26(11):5037. doi: 10.3390/ijms26115037.
5
Overcoming limitations to customize DeepVariant for domesticated animals with TrioTrain.利用TrioTrain克服限制以定制适用于家养动物的DeepVariant。
Genome Res. 2025 Aug 1;35(8):1859-1874. doi: 10.1101/gr.279542.124.
6
Investigating the Performance of Oxford Nanopore Long-Read Sequencing with Respect to Illumina Microarrays and Short-Read Sequencing.研究牛津纳米孔长读长测序相对于Illumina微阵列和短读长测序的性能。
Int J Mol Sci. 2025 May 8;26(10):4492. doi: 10.3390/ijms26104492.
7
Development and validation of a carnitine cycle and transport disorders (CCD) panel: an ONT-compatible multi-gene diagnostic kit for newborn and selective screening.肉碱循环与转运障碍(CCD)检测板的开发与验证:一种适用于新生儿和选择性筛查的与纳米孔测序技术(ONT)兼容的多基因诊断试剂盒。
Orphanet J Rare Dis. 2025 May 26;20(1):250. doi: 10.1186/s13023-025-03775-4.
8
An Oxford Nanopore Technologies-Based Sequencing Assay for Molecular Diagnosis of Phenylketonuria and Variant Frequencies in a Turkish Cohort.一种基于牛津纳米孔技术的测序分析方法用于苯丙酮尿症的分子诊断及土耳其人群中的变异频率研究
Int J Genomics. 2025 Apr 25;2025:5552662. doi: 10.1155/ijog/5552662. eCollection 2025.
9
Bidirectional disruption of transcripts causes broad methylation defects in pseudohypoparathyroidism type 1B.转录本的双向破坏在1B型假甲状旁腺功能减退症中导致广泛的甲基化缺陷。
Proc Natl Acad Sci U S A. 2025 Apr 22;122(16):e2423271122. doi: 10.1073/pnas.2423271122. Epub 2025 Apr 18.
10
Long-read whole-genome sequencing-based concurrent haplotyping and aneuploidy profiling of single cells.基于长读长全基因组测序的单细胞并发单倍型分型和非整倍体分析
Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf247.
Nat Methods. 2021 Nov;18(11):1322-1332. doi: 10.1038/s41592-021-01299-w. Epub 2021 Nov 1.
4
NanoCaller for accurate detection of SNPs and indels in difficult-to-map regions from long-read sequencing by haplotype-aware deep neural networks.利用基于单倍型感知的深度神经网络,从长读测序中对难以映射区域中的 SNPs 和 indels 进行精确检测的 NanoCaller。
Genome Biol. 2021 Sep 6;22(1):261. doi: 10.1186/s13059-021-02472-2.
5
Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders.用于神经发育障碍分子诊断的长读长基因组测序
HGG Adv. 2021 Apr 8;2(2). doi: 10.1016/j.xhgg.2021.100023. Epub 2021 Jan 16.
6
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
7
Best practices for variant calling in clinical sequencing.临床测序中变异调用的最佳实践。
Genome Med. 2020 Oct 26;12(1):91. doi: 10.1186/s13073-020-00791-w.
8
Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.纳米孔测序和 Shasta 工具包可实现 11 个人类基因组的高效从头组装。
Nat Biotechnol. 2020 Sep;38(9):1044-1053. doi: 10.1038/s41587-020-0503-6. Epub 2020 May 4.
9
dv-trio: a family-based variant calling pipeline using DeepVariant.dv-trio:一种基于家系的使用 DeepVariant 的变异calling 流程。
Bioinformatics. 2020 Jun 1;36(11):3549-3551. doi: 10.1093/bioinformatics/btaa116.
10
Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.Longshot 可通过单分子长读测序对二倍体基因组进行准确的变异调用。
Nat Commun. 2019 Oct 11;10(1):4660. doi: 10.1038/s41467-019-12493-y.