• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于泛基因组图的稀疏索引的 DNA 序列比对方法。

DNA sequences alignment method using sparse index on pan-genome graph.

机构信息

School of Computer Science, University of Science and Technology of China, Heifei, Anhui 230027, P. R. China.

Key Laboratory on High Performance Computing, Anhui Province, P. R. China.

出版信息

J Bioinform Comput Biol. 2024 Aug;22(4):2450019. doi: 10.1142/S0219720024500197. Epub 2024 Aug 31.

DOI:10.1142/S0219720024500197
PMID:39215522
Abstract

The graph of sequences represents the genetic variations of pan-genome concisely and space-efficiently than multiple linear reference genome. In order to accelerate aligning reads to the graph, an index of graph-based reference genomes is used to obtain candidate locations. However, the potential combinatorial explosion of nodes on the sequence graph leads to increasing the index space and maximum memory usage of alignment process considerably, especially for large-scale datasets. For this, existing methods typically attempt to prune complex regions, or extend the length of seeds, which sacrifices the recall of alignment algorithm despite reducing space usage slightly. We present the and alignment algorithm , capable of indexing and aligning at the lower memory cost. SIG builds the non-overlapping minimizers index inside nodes of sequence graph and SIG-Aligner filters out most of the false positive matches by the method based on the pigeonhole principle. Compared to Giraffe, the results of computational experiments show that SIG achieves a significant reduction in index memory space ranging from 50% to 75% for the human pan-genome graphs, while still preserving superior or comparable accuracy of alignment and the faster alignment time.

摘要

序列图比多个线性参考基因组更简洁、更有效地表示泛基因组的遗传变异。为了加速将读取序列与图谱对齐,使用基于图谱的参考基因组索引来获取候选位置。然而,序列图谱上节点的潜在组合爆炸会导致索引空间和对齐过程的最大内存使用量大大增加,尤其是对于大规模数据集。为此,现有方法通常试图修剪复杂区域,或延长种子的长度,这会牺牲对齐算法的召回率,尽管略微减少了空间使用。我们提出了 和 算法,能够以较低的内存成本进行索引和对齐。SIG 在序列图的节点内构建非重叠最小化索引,SIG-Aligner 通过基于鸽笼原理的方法过滤掉大多数假阳性匹配。与 Giraffe 相比,计算实验的结果表明,SIG 实现了指数级的索引内存空间显著减少,范围从人类泛基因组图谱的 50%到 75%,同时仍然保持了优越或相当的对齐准确性和更快的对齐时间。

相似文献

1
DNA sequences alignment method using sparse index on pan-genome graph.基于泛基因组图的稀疏索引的 DNA 序列比对方法。
J Bioinform Comput Biol. 2024 Aug;22(4):2450019. doi: 10.1142/S0219720024500197. Epub 2024 Aug 31.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Doppler trans-thoracic echocardiography for detection of pulmonary hypertension in adults.经胸多普勒超声心动图用于检测成人肺动脉高压。
Cochrane Database Syst Rev. 2022 May 9;5(5):CD012809. doi: 10.1002/14651858.CD012809.pub2.
4
Coding genomes with gapped pattern graph convolutional network.使用带间隙模式图卷积网络对基因组进行编码。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae188.
5
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
6
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
7
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
8
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
9
Non-invasive diagnostic tests for Helicobacter pylori infection.幽门螺杆菌感染的非侵入性诊断测试。
Cochrane Database Syst Rev. 2018 Mar 15;3(3):CD012080. doi: 10.1002/14651858.CD012080.pub2.
10
Nivolumab for adults with Hodgkin's lymphoma (a rapid review using the software RobotReviewer).纳武单抗用于成人霍奇金淋巴瘤(使用RobotReviewer软件进行的快速综述)
Cochrane Database Syst Rev. 2018 Jul 12;7(7):CD012556. doi: 10.1002/14651858.CD012556.pub2.