• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

hsegHMM:基于隐马尔可夫模型的等位基因特异性拷贝数改变分析,考虑了超分割。

hsegHMM: hidden Markov model-based allele-specific copy number alteration analysis accounting for hypersegmentation.

机构信息

Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA.

出版信息

BMC Bioinformatics. 2018 Nov 14;19(1):424. doi: 10.1186/s12859-018-2412-y.

DOI:10.1186/s12859-018-2412-y
PMID:30428830
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6236906/
Abstract

BACKGROUND

Somatic copy number alternation (SCNA) is a common feature of the cancer genome and is associated with cancer etiology and prognosis. The allele-specific SCNA analysis of a tumor sample aims to identify the allele-specific copy numbers of both alleles, adjusting for the ploidy and the tumor purity. Next generation sequencing platforms produce abundant read counts at the base-pair resolution across the exome or whole genome which is susceptible to hypersegmentation, a phenomenon where numerous regions with very short length are falsely identified as SCNA.

RESULTS

We propose hsegHMM, a hidden Markov model approach that accounts for hypersegmentation for allele-specific SCNA analysis. hsegHMM provides statistical inference of copy number profiles by using an efficient E-M algorithm procedure. Through simulation and application studies, we found that hsegHMM handles hypersegmentation effectively with a t-distribution as a part of the emission probability distribution structure and a carefully defined state space. We also compared hsegHMM with FACETS which is a current method for allele-specific SCNA analysis. For the application, we use a renal cell carcinoma sample from The Cancer Genome Atlas (TCGA) study.

CONCLUSIONS

We demonstrate the robustness of hsegHMM to hypersegmentation. Furthermore, hsegHMM provides the quantification of uncertainty in identifying allele-specific SCNAs over the entire chromosomes. hsegHMM performs better than FACETS when read depth (coverage) is uneven across the genome.

摘要

背景

体细胞拷贝数改变(SCNA)是癌症基因组的一个常见特征,与癌症的病因和预后有关。肿瘤样本的等位基因特异性 SCNA 分析旨在确定两个等位基因的等位基因特异性拷贝数,同时调整倍性和肿瘤纯度。下一代测序平台在整个外显子或全基因组范围内以碱基分辨率产生丰富的读数,容易出现超分割现象,即大量非常短的区域被错误地识别为 SCNA。

结果

我们提出了 hsegHMM,这是一种用于等位基因特异性 SCNA 分析的隐马尔可夫模型方法,可以解决超分割问题。hsegHMM 通过使用高效的 E-M 算法程序来提供拷贝数谱的统计推断。通过模拟和应用研究,我们发现 hsegHMM 通过将 t 分布作为发射概率分布结构的一部分和精心定义的状态空间来有效地处理超分割。我们还将 hsegHMM 与 FACETS 进行了比较,FACETS 是当前用于等位基因特异性 SCNA 分析的方法。在应用中,我们使用了来自癌症基因组图谱(TCGA)研究的肾细胞癌样本。

结论

我们证明了 hsegHMM 对超分割的稳健性。此外,hsegHMM 提供了在整个染色体上识别等位基因特异性 SCNAs 的不确定性的量化。当基因组范围内的读深度(覆盖度)不均匀时,hsegHMM 的性能优于 FACETS。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/e43223c6e9d9/12859_2018_2412_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/592bc4a0e044/12859_2018_2412_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/3cb58692ec69/12859_2018_2412_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/a90206376eb8/12859_2018_2412_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/0b6dcf019e8a/12859_2018_2412_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/f9474eb0ec58/12859_2018_2412_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/e43223c6e9d9/12859_2018_2412_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/592bc4a0e044/12859_2018_2412_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/3cb58692ec69/12859_2018_2412_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/a90206376eb8/12859_2018_2412_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/0b6dcf019e8a/12859_2018_2412_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/f9474eb0ec58/12859_2018_2412_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/431a/6236906/e43223c6e9d9/12859_2018_2412_Fig6_HTML.jpg

相似文献

1
hsegHMM: hidden Markov model-based allele-specific copy number alteration analysis accounting for hypersegmentation.hsegHMM:基于隐马尔可夫模型的等位基因特异性拷贝数改变分析,考虑了超分割。
BMC Bioinformatics. 2018 Nov 14;19(1):424. doi: 10.1186/s12859-018-2412-y.
2
SAAS-CNV: A Joint Segmentation Approach on Aggregated and Allele Specific Signals for the Identification of Somatic Copy Number Alterations with Next-Generation Sequencing Data.SAAS-CNV:一种基于聚合信号和等位基因特异性信号的联合分割方法,用于通过下一代测序数据识别体细胞拷贝数变异。
PLoS Comput Biol. 2015 Nov 19;11(11):e1004618. doi: 10.1371/journal.pcbi.1004618. eCollection 2015 Nov.
3
FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing.FACETS:用于高通量DNA测序的等位基因特异性拷贝数和克隆异质性分析工具。
Nucleic Acids Res. 2016 Sep 19;44(16):e131. doi: 10.1093/nar/gkw520. Epub 2016 Jun 7.
4
Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.Sequenza:来自肿瘤测序数据的等位基因特异性拷贝数和突变图谱。
Ann Oncol. 2015 Jan;26(1):64-70. doi: 10.1093/annonc/mdu479. Epub 2014 Oct 15.
5
Low-cost and clinically applicable copy number profiling using repeat DNA.使用重复 DNA 进行低成本且临床适用的拷贝数分析。
BMC Genomics. 2022 Aug 17;23(1):599. doi: 10.1186/s12864-022-08681-8.
6
Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.从全基因组和全外显子组测序中发现等位基因特异性拷贝数
Nucleic Acids Res. 2015 Aug 18;43(14):e90. doi: 10.1093/nar/gkv319. Epub 2015 Apr 16.
7
Combining targeted sequencing and ultra-low-pass whole-genome sequencing for accurate somatic copy number alteration detection.结合靶向测序和超低深度全基因组测序进行精确的体细胞拷贝数改变检测。
Funct Integr Genomics. 2021 Mar;21(2):161-169. doi: 10.1007/s10142-021-00767-y. Epub 2021 Feb 4.
8
Comparative analysis of methods for identifying somatic copy number alterations from deep sequencing data.从深度测序数据中识别体细胞拷贝数改变的方法的比较分析。
Brief Bioinform. 2015 Mar;16(2):242-54. doi: 10.1093/bib/bbu004. Epub 2014 Mar 5.
9
Determination of genomic copy number alteration emphasizing a restriction site-based strategy of genome re-sequencing.强调基于限制位点的基因组重测序策略的基因组拷贝数改变的测定。
Bioinformatics. 2013 Nov 15;29(22):2813-21. doi: 10.1093/bioinformatics/btt481. Epub 2013 Aug 20.
10
FACETS: Fraction and Allele-Specific Copy Number Estimates from Tumor Sequencing.FACETS:基于肿瘤测序的片段和等位基因特异性拷贝数估计。
Methods Mol Biol. 2022;2493:89-105. doi: 10.1007/978-1-0716-2293-3_7.

引用本文的文献

1
Identification of Copy Number Alterations from Next-Generation Sequencing Data.从下一代测序数据中鉴定拷贝数改变。
Adv Exp Med Biol. 2022;1361:55-74. doi: 10.1007/978-3-030-91836-1_4.
2
A hidden Markov modeling approach for identifying tumor subclones in next-generation sequencing studies.一种用于鉴定下一代测序研究中肿瘤亚克隆的隐马尔可夫模型方法。
Biostatistics. 2022 Jan 13;23(1):69-82. doi: 10.1093/biostatistics/kxaa013.

本文引用的文献

1
FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing.FACETS:用于高通量DNA测序的等位基因特异性拷贝数和克隆异质性分析工具。
Nucleic Acids Res. 2016 Sep 19;44(16):e131. doi: 10.1093/nar/gkw520. Epub 2016 Jun 7.
2
Integrated Genomics for Pinpointing Survival Loci within Arm-Level Somatic Copy Number Alterations.整合基因组学用于在臂水平体细胞拷贝数改变中精准定位生存位点。
Cancer Cell. 2016 May 9;29(5):737-750. doi: 10.1016/j.ccell.2016.03.025.
3
New insights into the performance of human whole-exome capture platforms.
人类全外显子捕获平台性能的新见解。
Nucleic Acids Res. 2015 Jun 23;43(11):e76. doi: 10.1093/nar/gkv216. Epub 2015 Mar 27.
4
Pan-cancer patterns of somatic copy number alteration.体细胞拷贝数改变的泛癌模式
Nat Genet. 2013 Oct;45(10):1134-40. doi: 10.1038/ng.2760.
5
Diverse mechanisms of somatic structural variations in human cancer genomes.人类癌症基因组中体细胞结构变异的多种机制。
Cell. 2013 May 9;153(4):919-29. doi: 10.1016/j.cell.2013.04.010.
6
Allele-specific copy number analysis of tumors.肿瘤的等位基因特异性拷贝数分析。
Proc Natl Acad Sci U S A. 2010 Sep 28;107(39):16910-5. doi: 10.1073/pnas.1009843107. Epub 2010 Sep 13.
7
MixHMM: inferring copy number variation and allelic imbalance using SNP arrays and tumor samples mixed with stromal cells.MixHMM:利用 SNP 阵列和混合有基质细胞的肿瘤样本推断拷贝数变异和等位基因失衡。
PLoS One. 2010 Jun 1;5(6):e10909. doi: 10.1371/journal.pone.0010909.
8
The landscape of somatic copy-number alteration across human cancers.人类癌症中体细胞拷贝数改变的全景。
Nature. 2010 Feb 18;463(7283):899-905. doi: 10.1038/nature08822.
9
Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.RNA-seq 数据中检测等位基因特异性表达的读映射偏倚效应。
Bioinformatics. 2009 Dec 15;25(24):3207-12. doi: 10.1093/bioinformatics/btp579. Epub 2009 Oct 6.
10
Copy number variations and cancer.拷贝数变异与癌症。
Genome Med. 2009 Jun 16;1(6):62. doi: 10.1186/gm62.