• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自Affymetrix SNP 6.0基因分型数据的拷贝数变异——常用预测方法的准确性如何?

Copy number aberrations from Affymetrix SNP 6.0 genotyping data-how accurate are commonly used prediction approaches?

作者信息

Pitea Adriana, Kondofersky Ivan, Sass Steffen, Theis Fabian J, Mueller Nikola S, Unger Kristian

机构信息

Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany.

Research Unit Radiation Cytogenetics, Helmholtz Zentrum München, Neuherberg, Germany.

出版信息

Brief Bioinform. 2020 Jan 17;21(1):272-281. doi: 10.1093/bib/bby096.

DOI:10.1093/bib/bby096
PMID:30351397
Abstract

Copy number aberrations (CNAs) are known to strongly affect oncogenes and tumour suppressor genes. Given the critical role CNAs play in cancer research, it is essential to accurately identify CNAs from tumour genomes. One particular challenge in finding CNAs is the effect of confounding variables. To address this issue, we assessed how commonly used CNA identification algorithms perform on SNP 6.0 genotyping data in the presence of confounding variables. We simulated realistic synthetic data with varying levels of three confounding variables-the tumour purity, the length of a copy number region and the CNA burden (the percentage of CNAs present in a profiled genome)-and evaluated the performance of OncoSNP, ASCAT, GenoCNA, GISTIC and CGHcall. Furthermore, we implemented and assessed CGHcall*, an adjusted version of CGHcall accounting for high CNA burden. Our analysis on synthetic data indicates that tumour purity and the CNA burden strongly influence the performance of all the algorithms. No algorithm can correctly find lost and gained genomic regions across all tumour purities. The length of CNA regions influenced the performance of ASCAT, CGHcall and GISTIC. OncoSNP, GenoCNA and CGHcall* showed little sensitivity. Overall, CGHcall* and OncoSNP showed reasonable performance, particularly in samples with high tumour purity. Our analysis on the HapMap data revealed a good overlap between CGHcall, CGHcall* and GenoCNA results and experimentally validated data. Our exploratory analysis on the TCGA HNSCC data revealed plausible results of CGHcall, CGHcall* and GISTIC in consensus HNSCC CNA regions. Code is available at https://github.com/adspit/PASCAL.

摘要

已知拷贝数变异(CNA)会对癌基因和肿瘤抑制基因产生强烈影响。鉴于CNA在癌症研究中所起的关键作用,从肿瘤基因组中准确识别CNA至关重要。寻找CNA的一个特殊挑战是混杂变量的影响。为解决这个问题,我们评估了常用的CNA识别算法在存在混杂变量的情况下对SNP 6.0基因分型数据的表现。我们模拟了具有不同水平的三个混杂变量——肿瘤纯度、拷贝数区域长度和CNA负担(在一个分析的基因组中存在的CNA的百分比)的逼真合成数据,并评估了OncoSNP、ASCAT、GenoCNA、GISTIC和CGHcall的性能。此外,我们实现并评估了CGHcall*,这是考虑到高CNA负担的CGHcall的一个调整版本。我们对合成数据的分析表明,肿瘤纯度和CNA负担强烈影响所有算法的性能。没有一种算法能够在所有肿瘤纯度水平上正确找到丢失和获得的基因组区域。CNA区域的长度影响了ASCAT、CGHcall和GISTIC的性能。OncoSNP、GenoCNA和CGHcall表现出较低的敏感性。总体而言,CGHcall和OncoSNP表现出合理的性能,特别是在肿瘤纯度高的样本中。我们对HapMap数据的分析揭示了CGHcall、CGHcall和GenoCNA的结果与实验验证数据之间有良好的重叠。我们对TCGA HNSCC数据的探索性分析揭示了CGHcall、CGHcall和GISTIC在共识HNSCC CNA区域的合理结果。代码可在https://github.com/adspit/PASCAL获取。

相似文献

1
Copy number aberrations from Affymetrix SNP 6.0 genotyping data-how accurate are commonly used prediction approaches?来自Affymetrix SNP 6.0基因分型数据的拷贝数变异——常用预测方法的准确性如何?
Brief Bioinform. 2020 Jan 17;21(1):272-281. doi: 10.1093/bib/bby096.
2
Comparison of methods to detect copy number alterations in cancer using simulated and real genotyping data.使用模拟和真实基因分型数据比较癌症中拷贝数改变的检测方法。
BMC Bioinformatics. 2012 Aug 7;13:192. doi: 10.1186/1471-2105-13-192.
3
Comprehensive analysis of copy number aberrations in microsatellite stable colon cancer in view of stromal component.基于基质成分对微卫星稳定型结肠癌中拷贝数畸变的综合分析
Br J Cancer. 2017 Jul 25;117(3):421-431. doi: 10.1038/bjc.2017.208. Epub 2017 Jul 6.
4
Analysis of genomic abnormalities in tumors: a review of available methods for Illumina two-color SNP genotyping and evaluation of performance.肿瘤基因组异常分析:Illumina双色SNP基因分型可用方法综述及性能评估
Cancer Genet. 2013 Apr;206(4):103-15. doi: 10.1016/j.cancergen.2013.03.001. Epub 2013 Apr 25.
5
Hierarchical discovery of large-scale and focal copy number alterations in low-coverage cancer genomes.在低覆盖度癌症基因组中进行大规模和焦点拷贝数改变的层次式发现。
BMC Bioinformatics. 2020 Apr 16;21(1):147. doi: 10.1186/s12859-020-3480-3.
6
Genome-wide identification of significant aberrations in cancer genome.全基因组鉴定癌症基因组中的显著异常。
BMC Genomics. 2012 Jul 27;13:342. doi: 10.1186/1471-2164-13-342.
7
CGHcall: calling aberrations for array CGH tumor profiles.CGHcall:用于阵列比较基因组杂交肿瘤图谱的畸变检测
Bioinformatics. 2007 Apr 1;23(7):892-4. doi: 10.1093/bioinformatics/btm030. Epub 2007 Jan 31.
8
Genome-wide analysis of esophageal adenocarcinoma yields specific copy number aberrations that correlate with prognosis.食管腺癌的全基因组分析产生了与预后相关的特定拷贝数畸变。
Genes Chromosomes Cancer. 2014 Apr;53(4):324-38. doi: 10.1002/gcc.22143. Epub 2014 Jan 21.
9
Focal chromosomal copy number aberrations in cancer-Needles in a genome haystack.癌症中的局灶性染色体拷贝数畸变——基因组大海捞针
Biochim Biophys Acta. 2014 Nov;1843(11):2698-2704. doi: 10.1016/j.bbamcr.2014.08.001. Epub 2014 Aug 7.
10
Improved Uveal Melanoma Copy Number Subtypes Including an Ultra-High-Risk Group.改良的葡萄膜黑色素瘤拷贝数亚型,包括一个超高风险组。
Ophthalmol Sci. 2022 Jan 30;2(2):100121. doi: 10.1016/j.xops.2022.100121. eCollection 2022 Jun.

引用本文的文献

1
Homologous recombination deficiency (HRD) testing landscape: clinical applications and technical validation for routine diagnostics.同源重组缺陷(HRD)检测概况:常规诊断的临床应用与技术验证
Biomark Res. 2025 Feb 21;13(1):31. doi: 10.1186/s40364-025-00740-y.
2
Set-Wise Differential Interaction Between Copy Number Alterations and Gene Expressions of Lower-Grade Glioma Reveals Prognosis-Associated Pathways.低级别胶质瘤拷贝数改变与基因表达之间的集合水平差异相互作用揭示了预后相关通路。
Entropy (Basel). 2020 Dec 18;22(12):1434. doi: 10.3390/e22121434.
3
Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data.
多样本肿瘤测序数据中拷贝数变异和全基因组倍增的精确定量。
Nat Commun. 2020 Sep 2;11(1):4301. doi: 10.1038/s41467-020-17967-y.