• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞 DNA 测序数据中的单倍型相位。

Haplotype phasing in single-cell DNA-sequencing data.

机构信息

Department of Computer Science, Princeton University, Princeton, NJ, USA.

Department of Computer Science, Brown University, Providence, RI, USA.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i211-i217. doi: 10.1093/bioinformatics/bty286.

DOI:10.1093/bioinformatics/bty286
PMID:29950014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022575/
Abstract

MOTIVATION

Current technologies for single-cell DNA sequencing require whole-genome amplification (WGA), as a single cell contains too little DNA for direct sequencing. Unfortunately, WGA introduces biases in the resulting sequencing data, including non-uniformity in genome coverage and high rates of allele dropout. These biases complicate many downstream analyses, including the detection of genomic variants.

RESULTS

We show that amplification biases have a potential upside: long-range correlations in rates of allele dropout provide a signal for phasing haplotypes at the lengths of amplicons from WGA, lengths which are generally longer than than individual sequence reads. We describe a statistical test to measure concurrent allele dropout between single-nucleotide polymorphisms (SNPs) across multiple sequenced single cells. We use results of this test to perform haplotype assembly across a collection of single cells. We demonstrate that the algorithm predicts phasing between pairs of SNPs with higher accuracy than phasing from reads alone. Using whole-genome sequencing data from only seven neural cells, we obtain haplotype blocks that are orders of magnitude longer than with sequence reads alone (median length 10.2 kb versus 312 bp), with error rates <2%. We demonstrate similar advantages on whole-exome data from 16 cells, where we obtain haplotype blocks with median length 9.2 kb-comparable to typical gene lengths-compared with median lengths of 41 bp with sequence reads alone, with error rates <4%. Our algorithm will be useful for haplotyping of rare alleles and studies of allele-specific somatic aberrations.

AVAILABILITY AND IMPLEMENTATION

Source code is available at https://www.github.com/raphael-group.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

目前用于单细胞 DNA 测序的技术需要全基因组扩增 (WGA),因为单个细胞中的 DNA 太少,无法直接测序。不幸的是,WGA 会在测序数据中引入偏差,包括基因组覆盖的不均匀性和等位基因缺失率高。这些偏差使许多下游分析变得复杂,包括基因组变异的检测。

结果

我们表明,扩增偏差有一个潜在的好处:等位基因缺失率的长程相关性为 WGA 扩增产物的单倍型分相提供了信号,这些长度通常比单个序列读取长。我们描述了一种统计测试方法,用于测量多个测序单细胞中单核苷酸多态性 (SNP) 之间同时发生的等位基因缺失。我们使用该测试的结果在一组单细胞中进行单倍型组装。我们证明,该算法在预测 SNP 对之间的相位方面比单独使用读取的相位具有更高的准确性。仅使用来自七个神经细胞的全基因组测序数据,我们获得了比单独使用序列读取长几个数量级的单倍型块(中位数长度为 10.2kb 与 312bp),错误率<2%。我们在来自 16 个细胞的全外显子组数据上也证明了类似的优势,其中我们获得的单倍型块的中位数长度为 9.2kb-与典型基因长度相当-与单独使用序列读取时的中位数长度 41bp 相比,错误率<4%。我们的算法将有助于稀有等位基因的单倍型分析和等位基因特异性体细胞突变的研究。

可用性和实现

源代码可在 https://www.github.com/raphael-group 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/3a35a1e7b79e/bty286f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/7d508c08311a/bty286f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/72f974f5940d/bty286f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/3a35a1e7b79e/bty286f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/7d508c08311a/bty286f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/72f974f5940d/bty286f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7034/6022575/3a35a1e7b79e/bty286f3.jpg

相似文献

1
Haplotype phasing in single-cell DNA-sequencing data.单细胞 DNA 测序数据中的单倍型相位。
Bioinformatics. 2018 Jul 1;34(13):i211-i217. doi: 10.1093/bioinformatics/bty286.
2
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
3
DCHap: A Divide-and-Conquer Haplotype Phasing Algorithm for Third-Generation Sequences.DCHap:一种用于第三代测序的分治单倍型相位算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1277-1284. doi: 10.1109/TCBB.2020.3005673. Epub 2022 Jun 3.
4
Ultraaccurate genome sequencing and haplotyping of single human cells.单细胞的超高精准基因组测序和单倍型分型。
Proc Natl Acad Sci U S A. 2017 Nov 21;114(47):12512-12517. doi: 10.1073/pnas.1707609114. Epub 2017 Oct 24.
5
WhatsHap: Weighted Haplotype Assembly for Future-Generation Sequencing Reads.WhatsHap:用于下一代测序读数的加权单倍型组装
J Comput Biol. 2015 Jun;22(6):498-509. doi: 10.1089/cmb.2014.0157. Epub 2015 Feb 6.
6
HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences.HaploMaker:一种用于快速组装基因组序列单倍型的改进算法。
Gigascience. 2022 May 17;11. doi: 10.1093/gigascience/giac038.
7
HapCUT2: A Method for Phasing Genomes Using Experimental Sequence Data.HapCUT2:一种使用实验序列数据进行基因组相位分析的方法。
Methods Mol Biol. 2023;2590:139-147. doi: 10.1007/978-1-0716-2819-5_9.
8
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
9
A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings.一种在临床环境中直接进行单体型定相的长读测序方法。
Int J Mol Sci. 2020 Dec 1;21(23):9177. doi: 10.3390/ijms21239177.
10
Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.Longshot 可通过单分子长读测序对二倍体基因组进行准确的变异调用。
Nat Commun. 2019 Oct 11;10(1):4660. doi: 10.1038/s41467-019-12493-y.

引用本文的文献

1
Somatic mutation phasing and haplotype extension using linked-reads in multiple myeloma.利用多重骨髓瘤中的连锁读长进行体细胞突变定相和单倍型扩展。
bioRxiv. 2024 Aug 10:2024.08.09.607342. doi: 10.1101/2024.08.09.607342.
2
Single-nucleus DNA sequencing reveals hidden somatic loss-of-heterozygosity in Cerebral Cavernous Malformations.单细胞 DNA 测序揭示脑动静脉畸形中的隐匿性杂合性丢失。
Nat Commun. 2023 Nov 2;14(1):7009. doi: 10.1038/s41467-023-42908-w.
3
Long-read whole-genome analysis of human single cells.人类单细胞的长读全基因组分析。

本文引用的文献

1
Linked-read analysis identifies mutations in single-cell DNA-sequencing data.关联读取分析可鉴定单细胞 DNA 测序数据中的突变。
Nat Genet. 2019 Apr;51(4):749-754. doi: 10.1038/s41588-019-0366-2. Epub 2019 Mar 18.
2
PaSD-qc: quality control for single cell whole-genome sequencing data using power spectral density estimation.PaSD-qc:使用功率谱密度估计进行单细胞全基因组测序数据的质量控制。
Nucleic Acids Res. 2018 Feb 28;46(4):e20. doi: 10.1093/nar/gkx1195.
3
Ultraaccurate genome sequencing and haplotyping of single human cells.单细胞的超高精准基因组测序和单倍型分型。
Nat Commun. 2023 Aug 24;14(1):5164. doi: 10.1038/s41467-023-40898-3.
4
Pairwise comparative analysis of six haplotype assembly methods based on users' experience.基于用户体验的六种单倍型组装方法的两两比较分析。
BMC Genom Data. 2023 Jun 29;24(1):35. doi: 10.1186/s12863-023-01134-5.
5
Single-cell whole-genome sequencing, haplotype analysis in prenatal diagnosis of monogenic diseases.单细胞全基因组测序,单基因疾病产前诊断中的单体型分析。
Life Sci Alliance. 2023 Feb 21;6(5). doi: 10.26508/lsa.202201761. Print 2023 May.
6
Computational methods for chromosome-scale haplotype reconstruction.染色体级别的单倍型重构的计算方法。
Genome Biol. 2021 Apr 12;22(1):101. doi: 10.1186/s13059-021-02328-9.
7
Improved haplotype inference by exploiting long-range linking and allelic imbalance in RNA-seq datasets.通过利用RNA测序数据集的长程连接和等位基因不平衡改进单倍型推断
Nat Commun. 2020 Sep 16;11(1):4662. doi: 10.1038/s41467-020-18320-z.
8
Eleven grand challenges in single-cell data science.单细胞数据科学的 11 大挑战。
Genome Biol. 2020 Feb 7;21(1):31. doi: 10.1186/s13059-020-1926-6.
9
scHaplotyper: haplotype construction and visualization for genetic diagnosis using single cell DNA sequencing data.scHaplotyper:使用单细胞 DNA 测序数据进行遗传诊断的单体型构建和可视化。
BMC Bioinformatics. 2020 Feb 1;21(1):41. doi: 10.1186/s12859-020-3381-5.
Proc Natl Acad Sci U S A. 2017 Nov 21;114(47):12512-12517. doi: 10.1073/pnas.1707609114. Epub 2017 Oct 24.
4
Single-cell whole-genome analyses by Linear Amplification via Transposon Insertion (LIANTI).通过转座子插入线性扩增进行单细胞全基因组分析(LIANTI)。
Science. 2017 Apr 14;356(6334):189-194. doi: 10.1126/science.aak9787.
5
HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies.HapCUT2:适用于多种测序技术的强大且准确的单倍型组装工具。
Genome Res. 2017 May;27(5):801-812. doi: 10.1101/gr.213462.116. Epub 2016 Dec 9.
6
TruePrime is a novel method for whole-genome amplification from single cells based on TthPrimPol.TruePrime 是一种基于 TthPrimPol 的新型单细胞全基因组扩增方法。
Nat Commun. 2016 Nov 29;7:13296. doi: 10.1038/ncomms13296.
7
Reference-based phasing using the Haplotype Reference Consortium panel.使用单倍型参考联盟面板进行基于参考的定相
Nat Genet. 2016 Nov;48(11):1443-1448. doi: 10.1038/ng.3679. Epub 2016 Oct 3.
8
Direct chromosome-length haplotyping by single-cell sequencing.通过单细胞测序进行直接染色体长度单倍型分型。
Genome Res. 2016 Nov;26(11):1565-1574. doi: 10.1101/gr.209841.116. Epub 2016 Sep 19.
9
Rare variant phasing and haplotypic expression from RNA sequencing with phASER.利用 phASER 进行 RNA 测序的罕见变异相位和单倍型表达。
Nat Commun. 2016 Sep 8;7:12817. doi: 10.1038/ncomms12817.
10
A weighted exact test for mutually exclusive mutations in cancer.一种用于癌症中互斥突变的加权精确检验。
Bioinformatics. 2016 Sep 1;32(17):i736-i745. doi: 10.1093/bioinformatics/btw462.