• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HPTAS:一种无比对的单倍型分型算法,专注于利用转录组数据进行等位基因特异性研究。

HPTAS: An Alignment-Free Haplotype Phasing Algorithm Focused on Allele-Specific Studies Using Transcriptome Data.

作者信息

Wang Jianan, Sun Zhenyuan, Wang Guohua, Miao Yan

机构信息

Faculty of Computing, Harbin Institute of Technology, No. 92 Xidazhi Street, Nangang District, Harbin 150001, China.

College of Computer and Control Engineering, Northeast Forestry University, No. 26 He Xing Road, Xiangfang District, Harbin 150040, China.

出版信息

Int J Mol Sci. 2025 Jun 13;26(12):5700. doi: 10.3390/ijms26125700.

DOI:10.3390/ijms26125700
PMID:40565162
Abstract

Haplotype phasing refers to determining the haplotype sequences inherited from each parent in a diploid organism. It is a critical process for various downstream analyses, and numerous haplotype phasing methods for genomic single nucleotide polymorphisms (SNPs) have been developed. Allele-specific (AS) expression and alternative splicing play key roles in diverse biological processes. AS studies usually focus more on exonic SNPs, and multiple phased SNPs need to be combined to obtain better inferences. In this paper, we introduce an alignment-free algorithm HPTAS for haplotype phasing in AS studies. Instead of using sequence alignment to count the number of reads covering SNPs, HPTAS constructs a mapping structure from transcriptome annotations and SNPs and employs a k-mer-based approach to derive phasing counts from RNA-seq data. Using both next-generation sequencing (NGS) and the third-generation sequencing (TGS) NA12878 RNA-seq data and comparing with the most advanced algorithm in the field, we have demonstrated that HPTAS achieves high phasing accuracy and performance and that transcriptome data indeed facilitates the phasing of exonic SNPs. With the continued advancement of sequencing technology and the improvement in transcriptome annotations, HPTAS may serve as a foundation for future haplotype phasing methods.

摘要

单倍型定相是指确定二倍体生物中从每个亲本遗传而来的单倍型序列。它是各种下游分析的关键过程,并且已经开发出了许多用于基因组单核苷酸多态性(SNP)的单倍型定相方法。等位基因特异性(AS)表达和可变剪接在多种生物学过程中起关键作用。AS研究通常更关注外显子SNP,需要组合多个定相的SNP以获得更好的推断。在本文中,我们介绍了一种用于AS研究中单倍型定相的无比对算法HPTAS。HPTAS不是使用序列比对来计算覆盖SNP的 reads 数量,而是从转录组注释和SNP构建一个映射结构,并采用基于k-mer的方法从RNA-seq数据中得出定相计数。使用下一代测序(NGS)和第三代测序(TGS)的NA12878 RNA-seq数据,并与该领域最先进的算法进行比较,我们证明了HPTAS实现了高定相准确性和性能,并且转录组数据确实有助于外显子SNP的定相。随着测序技术的不断进步和转录组注释的改进,HPTAS可能成为未来单倍型定相方法的基础。

相似文献

1
HPTAS: An Alignment-Free Haplotype Phasing Algorithm Focused on Allele-Specific Studies Using Transcriptome Data.HPTAS:一种无比对的单倍型分型算法,专注于利用转录组数据进行等位基因特异性研究。
Int J Mol Sci. 2025 Jun 13;26(12):5700. doi: 10.3390/ijms26125700.
2
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
3
Specific correlation between the major chromosome 10q26 haplotype conferring risk for age-related macular degeneration and the expression of .赋予年龄相关性黄斑变性风险的主要染色体10q26单倍型与……的表达之间的特定相关性。 (注:原文中“the expression of”后面缺少具体内容)
Mol Vis. 2017 Jun 14;23:318-333. eCollection 2017.
4
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
5
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
7
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
9
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

本文引用的文献

1
ASAS-EGB: A statistical framework for estimating allele-specific alternative splicing events using transcriptome data.ASAS-EGB:一种基于转录组数据估计等位基因特异性剪接事件的统计框架。
Comput Biol Med. 2023 Jun;160:106981. doi: 10.1016/j.compbiomed.2023.106981. Epub 2023 Apr 28.
2
Stan: A Probabilistic Programming Language.斯坦:一种概率编程语言。
J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01. Epub 2017 Jan 11.
3
Kmer2SNP: Reference-Free Heterozygous SNP Calling Using k-mer Frequency Distributions.Kmer2SNP:基于 k-mer 频率分布的无参考杂合 SNP 调用。
Methods Mol Biol. 2022;2493:257-265. doi: 10.1007/978-1-0716-2293-3_16.
4
Towards accurate, contiguous and complete alignment-based polyploid phasing algorithms.迈向基于比对的准确、连续且完整的多倍体定相算法。
Genomics. 2022 May;114(3):110369. doi: 10.1016/j.ygeno.2022.110369. Epub 2022 Apr 26.
5
Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.利用 Hi-C 对长读从头基因组组装进行扩展单倍型相位分析。
Nat Commun. 2021 Apr 28;12(1):1935. doi: 10.1038/s41467-020-20536-y.
6
Haplotype threading: accurate polyploid phasing from long reads.单体型连接:长读长准确进行多倍体相位分析。
Genome Biol. 2020 Sep 21;21(1):252. doi: 10.1186/s13059-020-02158-1.
7
Improved haplotype inference by exploiting long-range linking and allelic imbalance in RNA-seq datasets.通过利用RNA测序数据集的长程连接和等位基因不平衡改进单倍型推断
Nat Commun. 2020 Sep 16;11(1):4662. doi: 10.1038/s41467-020-18320-z.
8
DCHap: A Divide-and-Conquer Haplotype Phasing Algorithm for Third-Generation Sequences.DCHap:一种用于第三代测序的分治单倍型相位算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1277-1284. doi: 10.1109/TCBB.2020.3005673. Epub 2022 Jun 3.
9
BYASE: a Python library for estimating gene and isoform level allele-specific expression.BYASE:一个用于估计基因和异构体水平等位基因特异性表达的 Python 库。
Bioinformatics. 2020 Dec 8;36(19):4955-4956. doi: 10.1093/bioinformatics/btaa636.
10
Exploring effective approaches for haplotype block phasing.探索单倍型块相位的有效方法。
BMC Bioinformatics. 2019 Oct 30;20(1):540. doi: 10.1186/s12859-019-3095-8.