• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

哈比:一种用于处理单倍型标记连接读段数据的流程。

Harpy: a pipeline for processing haplotagging linked-read data.

作者信息

Dimens Pavel V, Franckowiak Ryan P, Iqbal Azwad, Grenier Jennifer K, Munn Paul R, Therkildsen Nina Overgaard

机构信息

Department of Natural Resources and the Environment, Cornell University, Ithaca, NY 14853, United States.

Genomics Innovation Hub, Biotechnology Resource Center, Cornell University, Ithaca, NY 14853, United States.

出版信息

Bioinform Adv. 2025 Jun 5;5(1):vbaf133. doi: 10.1093/bioadv/vbaf133. eCollection 2025.

DOI:10.1093/bioadv/vbaf133
PMID:40575478
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12198493/
Abstract

MOTIVATION

Haplotagging is a method for linked-read sequencing, which leverages the cost-effectiveness and throughput of short-read sequencing while retaining part of the long-range haplotype information captured by long-read sequencing. Despite its utility and advantages over similar methods, existing linked-read analytical pipelines are incompatible with haplotagging data.

RESULTS

We describe Harpy, a modular and user-friendly software pipeline for processing all stages of haplotagged linked-read data, from raw sequence data to phased genotypes and structural variant detection.

AVAILABILITY AND IMPLEMENTATION

https://github.com/pdimens/harpy.

摘要

动机

单倍型标记是一种用于连接读取测序的方法,它利用了短读长测序的成本效益和通量,同时保留了长读长测序所捕获的部分长程单倍型信息。尽管它具有实用性且优于类似方法,但现有的连接读取分析流程与单倍型标记数据不兼容。

结果

我们描述了Harpy,这是一个模块化且用户友好的软件流程,用于处理单倍型标记连接读取数据的各个阶段,从原始序列数据到定相基因型和结构变异检测。

可用性和实现方式

https://github.com/pdimens/harpy 。

相似文献

1
Harpy: a pipeline for processing haplotagging linked-read data.哈比:一种用于处理单倍型标记连接读段数据的流程。
Bioinform Adv. 2025 Jun 5;5(1):vbaf133. doi: 10.1093/bioadv/vbaf133. eCollection 2025.
2
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
3
Enhanced identification of novel pathogenic variants in hereditary hearing loss through physical phasing with integrated short and long-read sequencing data.通过整合短读长和长读长测序数据进行物理定相,增强遗传性听力损失中新的致病变异的识别。
Mol Genet Genomics. 2025 Jun 23;300(1):61. doi: 10.1007/s00438-025-02256-4.
4
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
5
Polaris: Polarization of ancestral and derived polymorphic alleles for inferences of extended haplotype homozygosity in human populations.北极星:通过祖先和衍生多态性等位基因的极化来推断人类群体中扩展单倍型纯合性。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf171.
6
BonoboFlow: viral genome assembly and haplotype reconstruction from nanopore reads.倭黑猩猩流程:基于纳米孔测序 reads 的病毒基因组组装与单倍型重建
Bioinform Adv. 2025 May 13;5(1):vbaf115. doi: 10.1093/bioadv/vbaf115. eCollection 2025.
7
SetBERT: the deep learning platform for contextualized embeddings and explainable predictions from high-throughput sequencing.SetBERT:用于从高通量测序中进行上下文嵌入和可解释预测的深度学习平台。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf370.
8
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
9
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
A survey of algorithms for the detection of genomic structural variants from long-read sequencing data.长读测序数据中基因组结构变异检测算法研究综述。
Nat Methods. 2023 Aug;20(8):1143-1158. doi: 10.1038/s41592-023-01932-w. Epub 2023 Jun 29.
2
: fast and scalable deconvolution of linked-read sequencing data.:对连接读长测序数据进行快速且可扩展的反卷积分析
Bioinform Adv. 2022 Sep 26;2(1):vbac068. doi: 10.1093/bioadv/vbac068. eCollection 2022.
3
Strobealign: flexible seed size enables ultra-fast and accurate read alignment.Strobealign:灵活的种子大小可实现超快速和准确的读取对齐。
Genome Biol. 2022 Dec 15;23(1):260. doi: 10.1186/s13059-022-02831-7.
4
Read-Based Phasing and Analysis of Phased Variants with WhatsHap.基于读段的相位分析和 WhatsHap 分析相变异构体。
Methods Mol Biol. 2023;2590:127-138. doi: 10.1007/978-1-0716-2819-5_8.
5
Haplotype tagging reveals parallel formation of hybrid races in two butterfly species.单体型标记揭示了两种蝴蝶物种中杂种种族的平行形成。
Proc Natl Acad Sci U S A. 2021 Jun 22;118(25). doi: 10.1073/pnas.2015005118.
6
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
7
Haplotype threading: accurate polyploid phasing from long reads.单体型连接:长读长准确进行多倍体相位分析。
Genome Biol. 2020 Sep 21;21(1):252. doi: 10.1186/s13059-020-02158-1.
8
Ultralow-input single-tube linked-read library method enables short-read second-generation sequencing systems to routinely generate highly accurate and economical long-range sequencing information.超微量单管连接读取文库方法使短读长第二代测序系统能够常规地生成高度准确和经济的长程测序信息。
Genome Res. 2020 Jun;30(6):898-909. doi: 10.1101/gr.260380.119. Epub 2020 Jun 15.
9
Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly.高效且独特的第二代测序读长 DNA 分子 cobarcoding,实现经济高效、准确的测序、单倍型分析和从头组装。
Genome Res. 2019 May;29(5):798-808. doi: 10.1101/gr.245126.118. Epub 2019 Apr 2.
10
fastp: an ultra-fast all-in-one FASTQ preprocessor.fastp:一个超快速的一体化 FASTQ 预处理程序。
Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.