• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过稀疏注意力和3D卷积从家系三联体的ONT测序数据中进行插入缺失检测。

Indel calling from ONT sequencing data of family trios via sparse attention and 3D convolution.

作者信息

Shi Ying, Wu Chenxu, Luo Shifu, Zhang Songming, Wang Wenjian, Li Jinyan

机构信息

School of Computer and Information Technology, Shanxi University, Taiyuan, 030006, Shanxi Province, China.

Faculty of Computer Science and Control Engineering, Shenzhen University of Advanced Technology, Shenzhen, 518000, Guangdong, China.

出版信息

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf430.

DOI:10.1093/bib/bbaf430
PMID:40828510
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12363238/
Abstract

Accurate calling of parental-child SNPs and Indels in family trios is very helpful for understanding genetic traits and diseases. Indel calling is even more important than SNP calling, as Indels may have led to substantial changes in protein structures that affect more of the traits of the organism. However, the best Indel calling methods have recall rates below 85%, precision below 92%, and F1 below 88% on $60\times $ ONT Q20 data, much lower than their SNP calling's recall performance of 99.87%, precision of 99.86%, and F1 of 99.86%. Difficulties in Indels calling include how to distinguish sequencing errors from genuine Indels and how to optimize the Mendelian genetic model. This work proposes sparse attention learning for high-performance calling of Indels from family-trios' ONT long-read sequencing data, while still maintaining exceptional performance on SNP calling. Key steps include a sparsely connected attention network to convert fully aligned data cubes into essential features, and a deep learning on these features via ResNet and 3D convolutional blocks to enable accurate detection of family-trio variants. This attention network is in fact a dual attention network to aggregate both channel and spatial information, capable of selecting sub-cubes of critical channels and base locations that are resistant to the confounding effects of sequencing errors. Comparing with the current best-performing trio-variant detection method, our F1 is 5.6%-14.19% higher, recall is 7.07%-18.67% higher, and precision is 3.85%-7.87% higher on ONT Q20 datasets. Case studies of indel-dense regions in chromosome 20, including the centromere and disease-associated genes, demonstrate the significant impact of indel variations on disease pathogenesis, providing novel perspectives for future personalized and targeted therapies.

摘要

准确识别家系三联体中的亲子单核苷酸多态性(SNP)和插入缺失(Indel)对于理解遗传特征和疾病非常有帮助。Indel识别比SNP识别更为重要,因为Indel可能导致蛋白质结构发生重大变化,从而影响生物体的更多性状。然而,在60×ONT Q20数据上,最佳的Indel识别方法召回率低于85%,精确率低于92%,F1值低于88%,远低于其SNP识别的召回性能(99.87%)、精确率(99.86%)和F1值(99.86%)。Indel识别的困难包括如何区分测序错误和真正的Indel,以及如何优化孟德尔遗传模型。这项工作提出了稀疏注意力学习方法,用于从家系三联体的ONT长读长测序数据中进行高性能的Indel识别,同时在SNP识别方面仍保持优异性能。关键步骤包括一个稀疏连接的注意力网络,将完全对齐的数据块转换为基本特征,并通过残差网络(ResNet)和3D卷积块对这些特征进行深度学习,以实现对家系三联体变异的准确检测。这个注意力网络实际上是一个双注意力网络,用于聚合通道和空间信息,能够选择关键通道和碱基位置的子块,以抵抗测序错误的混杂影响。与当前性能最佳的三联体变异检测方法相比,在ONT Q20数据集上,我们的F1值高5.6%-14.19%,召回率高7.07%-18.67%,精确率高3.85%-7.87%。对20号染色体上Indel密集区域(包括着丝粒和疾病相关基因)的案例研究表明,Indel变异对疾病发病机制有重大影响,为未来的个性化和靶向治疗提供了新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/d613097abb7e/bbaf430f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/91ea1992fe77/bbaf430f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/799fdb3661b3/bbaf430f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/5f9cf6e9a7cd/bbaf430f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/62b744311c70/bbaf430f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/f6b49bf1d7a0/bbaf430f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/d613097abb7e/bbaf430f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/91ea1992fe77/bbaf430f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/799fdb3661b3/bbaf430f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/5f9cf6e9a7cd/bbaf430f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/62b744311c70/bbaf430f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/f6b49bf1d7a0/bbaf430f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b571/12363238/d613097abb7e/bbaf430f6.jpg

相似文献

1
Indel calling from ONT sequencing data of family trios via sparse attention and 3D convolution.通过稀疏注意力和3D卷积从家系三联体的ONT测序数据中进行插入缺失检测。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf430.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
The impact of bioinformatic choices on variant identification accuracy.生物信息学选择对变异识别准确性的影响。
Microbiol Spectr. 2025 Aug 15:e0123225. doi: 10.1128/spectrum.01232-25.
6
Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。
JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.
7
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
9
Technological aids for the rehabilitation of memory and executive functioning in children and adolescents with acquired brain injury.脑损伤儿童和青少年记忆与执行功能康复的技术辅助手段。
Cochrane Database Syst Rev. 2016 Jul 1;7(7):CD011020. doi: 10.1002/14651858.CD011020.pub2.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

1
Repun: an accurate small variant representation unification method for multiple sequencing platforms.Repun:一种用于多种测序平台的准确小型变异表示统一方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae613.
2
miniSNV: accurate and fast single nucleotide variant calling from nanopore sequencing data.miniSNV:从纳米孔测序数据中进行准确快速的单核苷酸变异calling。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae473.
3
SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies.
SVDF:通过自动过滤策略增强长读测序中的结构变异检测
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae336.
4
Relationship between sex biases in gene expression and sex biases in autism and Alzheimer's disease.基因表达中的性别偏见与自闭症和老年痴呆症中的性别偏见之间的关系。
Biol Sex Differ. 2024 Jun 7;15(1):47. doi: 10.1186/s13293-024-00622-2.
5
Symphonizing pileup and full-alignment for deep learning-based long-read variant calling.基于深度学习的长读变异调用的交响乐堆积和全对齐。
Nat Comput Sci. 2022 Dec;2(12):797-803. doi: 10.1038/s43588-022-00387-x. Epub 2022 Dec 19.
6
Indigenous Australian genomes show deep structure and rich novel variation.澳大利亚原住民基因组显示出深度结构和丰富的新变异。
Nature. 2023 Dec;624(7992):593-601. doi: 10.1038/s41586-023-06831-w. Epub 2023 Dec 13.
7
The landscape of genomic structural variation in Indigenous Australians.澳大利亚原住民的基因组结构变异景观。
Nature. 2023 Dec;624(7992):602-610. doi: 10.1038/s41586-023-06842-7. Epub 2023 Dec 13.
8
PARP14 is a PARP with both ADP-ribosyl transferase and hydrolase activities.PARP14 是一种具有 ADP-ribosyl 转移酶和水解酶活性的 PARP。
Sci Adv. 2023 Sep 15;9(37):eadi2687. doi: 10.1126/sciadv.adi2687. Epub 2023 Sep 13.
9
Identification of a novel frameshift mutation in the SCNN1B causing Liddle syndrome.在SCNN1B中鉴定出导致利德尔综合征的一种新型移码突变。
Sci Bull (Beijing). 2023 Feb 26;68(4):383-387. doi: 10.1016/j.scib.2023.02.006. Epub 2023 Feb 7.
10
Benchmarking challenging small variants with linked and long reads.使用连锁读段和长读段对具有挑战性的小变异进行基准测试。
Cell Genom. 2022 May;2(5). doi: 10.1016/j.xgen.2022.100128.