• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HapIso:一种从长单分子读数中准确重建单倍型特异性异构体的方法。

HapIso: An Accurate Method for the Haplotype- Specific Isoforms Reconstruction From Long Single-Molecule Reads.

作者信息

Mangul Serghei, Yang Taegyun Harry, Hormozdiari Farhad, Dainis Alexandra Marie, Tseng Elizabeth, Ashley Euan A, Zelikovsky Alex, Eskin Eleazar

出版信息

IEEE Trans Nanobioscience. 2017 Mar;16(2):108-115. doi: 10.1109/TNB.2017.2675981. Epub 2017 Mar 17.

DOI:10.1109/TNB.2017.2675981
PMID:28328508
Abstract

Sequencing of RNA provides the possibility to study an individual's transcriptome landscape and determine allelic expression ratios. Single-molecule protocols generate multi-kilobase reads longer than most transcripts, allowing sequencing of complete haplotype isoforms. This allows partitioning the reads into two parental haplotypes. While the read length of the single-molecule protocols is long, the relatively high error rate limits the ability to accurately detect the genetic variants and assemble them into the haplotype-specific isoforms. In this paper, we present Haplotype-specific Isoform reconstruction (HapIso), a method able to tolerate the relatively high error rate of the single-molecule platform and partition the isoform reads into the parental alleles. Phasing the reads according to the allele of origin allows our method to efficiently distinguish between the read errors and the true biological mutations. HapIso uses a k -means clustering algorithm aiming to group the reads into two meaningful clusters maximizing the similarity of the reads within the cluster and minimizing the similarity of the reads from different clusters. Each cluster corresponds to a parental haplotype. We used the family pedigree information to evaluate our approach. Experimental validation suggests that HapIso is able to tolerate the relatively high error rate and accurately partition the reads into the parental alleles of the isoform transcripts. We also applied HapIso to novel clinical single-molecule RNA-Seq data to estimate allele-specific expression of genes of interest. Our method was able to correct reads and determine Glu1883Lys point mutation of clinical significance validated by GeneDx HCM panel. Furthermore, our method is the first method able to reconstruct the haplotype-specific isoforms from long single-molecule reads.

摘要

RNA测序为研究个体转录组图谱和确定等位基因表达比率提供了可能。单分子技术方案能生成比大多数转录本更长的数千碱基长读段,从而实现完整单倍型异构体的测序。这使得可以将读段划分为两个亲本单倍型。虽然单分子技术方案的读段长度很长,但相对较高的错误率限制了准确检测遗传变异并将其组装成单倍型特异性异构体的能力。在本文中,我们提出了单倍型特异性异构体重建方法(HapIso),该方法能够容忍单分子平台相对较高的错误率,并将异构体读段划分为亲本等位基因。根据起源等位基因对读段进行定相,使我们的方法能够有效区分读段错误和真正的生物学突变。HapIso使用k均值聚类算法,旨在将读段分组为两个有意义的簇,使簇内读段的相似性最大化,不同簇间读段的相似性最小化。每个簇对应一个亲本单倍型。我们利用家系信息评估我们的方法。实验验证表明,HapIso能够容忍相对较高的错误率,并准确地将读段划分为异构体转录本的亲本等位基因。我们还将HapIso应用于新的临床单分子RNA测序数据,以估计感兴趣基因的等位基因特异性表达。我们的方法能够校正读段,并确定经GeneDx HCM检测板验证具有临床意义的Glu1883Lys点突变。此外,我们的方法是第一种能够从长单分子读段重建单倍型特异性异构体的方法。

相似文献

1
HapIso: An Accurate Method for the Haplotype- Specific Isoforms Reconstruction From Long Single-Molecule Reads.HapIso:一种从长单分子读数中准确重建单倍型特异性异构体的方法。
IEEE Trans Nanobioscience. 2017 Mar;16(2):108-115. doi: 10.1109/TNB.2017.2675981. Epub 2017 Mar 17.
2
Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads.利用多映射 RNA-seq reads 进行单倍型和异构体特异性表达估计。
Genome Biol. 2011;12(2):R13. doi: 10.1186/gb-2011-12-2-r13. Epub 2011 Feb 10.
3
Long Single-Molecule Reads Can Resolve the Complexity of the Influenza Virus Composed of Rare, Closely Related Mutant Variants.长单分子读数能够解析由罕见、密切相关的突变变体组成的流感病毒的复杂性。
J Comput Biol. 2017 Jun;24(6):558-570. doi: 10.1089/cmb.2016.0146. Epub 2016 Nov 30.
4
GenHap: a novel computational method based on genetic algorithms for haplotype assembly.GenHap:一种基于遗传算法的新型单倍型组装计算方法。
BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):172. doi: 10.1186/s12859-019-2691-y.
5
Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification.信息转导能力降低了无注释异构体发现和定量中的不确定性。
Nucleic Acids Res. 2017 Sep 6;45(15):e143. doi: 10.1093/nar/gkx585.
6
HapCHAT: adaptive haplotype assembly for efficiently leveraging high coverage in long reads.HapCHAT:高效利用长读长覆盖度的自适应单倍型组装
BMC Bioinformatics. 2018 Jul 3;19(1):252. doi: 10.1186/s12859-018-2253-8.
7
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.通过对异构体和外显子特异性读段测序率进行建模来改进RNA测序表达估计。
BMC Bioinformatics. 2015 Oct 16;16:332. doi: 10.1186/s12859-015-0750-6.
8
Freddie: annotation-independent detection and discovery of transcriptomic alternative splicing isoforms using long-read sequencing.弗雷迪:使用长读测序进行注释独立的转录组可变剪接异构体的检测和发现。
Nucleic Acids Res. 2023 Jan 25;51(2):e11. doi: 10.1093/nar/gkac1112.
9
DCHap: A Divide-and-Conquer Haplotype Phasing Algorithm for Third-Generation Sequences.DCHap:一种用于第三代测序的分治单倍型相位算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1277-1284. doi: 10.1109/TCBB.2020.3005673. Epub 2022 Jun 3.
10
Ryūtō: network-flow based transcriptome reconstruction.龙童:基于网络流的转录组重构。
BMC Bioinformatics. 2019 Apr 16;20(1):190. doi: 10.1186/s12859-019-2786-5.

引用本文的文献

1
Interrogating the Human Diplome: Computational Methods, Emerging Applications, and Challenges.人类 diplome 的探究:计算方法、新兴应用和挑战。
Methods Mol Biol. 2023;2590:1-30. doi: 10.1007/978-1-0716-2819-5_1.
2
Reviving the Transcriptome Studies: An Insight Into the Emergence of Single-Molecule Transcriptome Sequencing.复兴转录组学研究:洞察单分子转录组测序的兴起
Front Genet. 2019 Apr 26;10:384. doi: 10.3389/fgene.2019.00384. eCollection 2019.
3
Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon.
从 Iso-Seq 数据中破译高度相似的多基因家族转录本,使用 IsoCon。
Nat Commun. 2018 Nov 2;9(1):4601. doi: 10.1038/s41467-018-06910-x.
4
TranscriptClean: variant-aware correction of indels, mismatches and splice junctions in long-read transcripts.转录本清洗:长读长转录本中插入缺失、错配和剪接位点的变体感知校正。
Bioinformatics. 2019 Jan 15;35(2):340-342. doi: 10.1093/bioinformatics/bty483.
5
Recent Advances in Experimental Whole Genome Haplotyping Methods.实验性全基因组单倍型分型方法的最新进展
Int J Mol Sci. 2017 Sep 11;18(9):1944. doi: 10.3390/ijms18091944.