• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于区分拷贝数变异和癌症相关改变的分类模型。

A classification model for distinguishing copy number variants from cancer-related alterations.

机构信息

Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, New York, NY, USA.

出版信息

BMC Bioinformatics. 2010 Jun 2;11:297. doi: 10.1186/1471-2105-11-297.

DOI:10.1186/1471-2105-11-297
PMID:20525196
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2897829/
Abstract

BACKGROUND

Both somatic copy number alterations (CNAs) and germline copy number variants (CNVs) that are prevalent in healthy individuals can appear as recurrent changes in comparative genomic hybridization (CGH) analyses of tumors. In order to identify important cancer genes CNAs and CNVs must be distinguished. Although the Database of Genomic Variants (DGV) contains a list of all known CNVs, there is no standard methodology to use the database effectively.

RESULTS

We develop a prediction model that distinguishes CNVs from CNAs based on the information contained in the DGV and several other variables, including segment's length, height, closeness to a telomere or centromere and occurrence in other patients. The models are fitted on data from glioblastoma and their corresponding normal samples that were collected as part of The Cancer Genome Atlas project and hybridized to Agilent 244 K arrays.

CONCLUSIONS

Using the DGV alone CNVs in the test set can be correctly identified with about 85% accuracy if the outliers are removed before segmentation and with 72% accuracy if the outliers are included, and additional variables improve the prediction by about 2-3% and 12%, respectively. Final models applied to data from ovarian tumors have about 90% accuracy with all the variables and 86% accuracy with the DGV alone.

摘要

背景

在肿瘤比较基因组杂交(CGH)分析中,常见于健康个体的体细胞拷贝数改变(CNAs)和种系拷贝数变异(CNVs)都可能表现为反复出现的变化。为了识别重要的癌症基因,必须区分 CNA 和 CNV。尽管基因组变异数据库(DGV)包含所有已知 CNV 的列表,但目前还没有有效的标准方法来使用该数据库。

结果

我们开发了一种基于 DGV 和其他几个变量(包括片段长度、高度、与端粒或着丝粒的接近程度以及在其他患者中的发生情况)的预测模型,用于区分 CNV 和 CNA。该模型基于癌症基因组图谱项目中收集的胶质母细胞瘤及其相应正常样本的杂交到 Agilent 244K 芯片上的数据进行拟合。

结论

如果在分割前先移除异常值,仅使用 DGV 可以将测试集中的 CNV 正确识别,准确率约为 85%;如果包含异常值,准确率约为 72%。额外的变量分别提高了约 2-3%和 12%的预测准确性。最终应用于卵巢肿瘤数据的模型在包含所有变量时准确率约为 90%,仅使用 DGV 时准确率约为 86%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d745/2897829/146b42ead1ba/1471-2105-11-297-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d745/2897829/6693db97f483/1471-2105-11-297-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d745/2897829/146b42ead1ba/1471-2105-11-297-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d745/2897829/6693db97f483/1471-2105-11-297-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d745/2897829/146b42ead1ba/1471-2105-11-297-2.jpg

相似文献

1
A classification model for distinguishing copy number variants from cancer-related alterations.用于区分拷贝数变异和癌症相关改变的分类模型。
BMC Bioinformatics. 2010 Jun 2;11:297. doi: 10.1186/1471-2105-11-297.
2
Distinguishing somatic and germline copy number events in cancer patient DNA hybridized to whole-genome SNP genotyping arrays.区分癌症患者DNA中与全基因组SNP基因分型阵列杂交的体细胞和生殖系拷贝数事件。
Methods Mol Biol. 2013;973:355-72. doi: 10.1007/978-1-62703-281-0_22.
3
High-resolution array-CGH analysis on 46,XX patients affected by early onset primary ovarian insufficiency discloses new genes involved in ovarian function.对 46,XX 早发性卵巢功能不全患者进行高分辨率 array-CGH 分析揭示了新的参与卵巢功能的基因。
Hum Reprod. 2019 Mar 1;34(3):574-583. doi: 10.1093/humrep/dey389.
4
CopyMap: localization and calling of copy number variation by joint analysis of hybridization data from multiple individuals.CopyMap:通过对来自多个个体的杂交数据进行联合分析来定位和调用拷贝数变异。
Bioinformatics. 2010 Nov 1;26(21):2776-7. doi: 10.1093/bioinformatics/btq515. Epub 2010 Oct 5.
5
Copy number alterations and copy number variation in cancer: close encounters of the bad kind.癌症中的拷贝数改变与拷贝数变异:不良类型的密切接触。
Cytogenet Genome Res. 2008;123(1-4):176-82. doi: 10.1159/000184706. Epub 2009 Mar 11.
6
Benign copy number changes in clinical cytogenetic diagnostics by array CGH.通过阵列比较基因组杂交技术在临床细胞遗传学诊断中检测良性拷贝数变化
Cytogenet Genome Res. 2008;123(1-4):94-101. doi: 10.1159/000184696. Epub 2009 Mar 11.
7
Computational methods for identification of recurrent copy number alteration patterns by array CGH.通过阵列比较基因组杂交鉴定复发性拷贝数改变模式的计算方法。
Cytogenet Genome Res. 2008;123(1-4):343-51. doi: 10.1159/000184726. Epub 2009 Mar 11.
8
Determining frequent patterns of copy number alterations in cancer.确定癌症中拷贝数改变的频繁模式。
PLoS One. 2010 Aug 12;5(8):e12028. doi: 10.1371/journal.pone.0012028.
9
Integration of genomic copy number variations and chemotherapy-response biomarkers in pediatric sarcoma.基因组拷贝数变异与儿童肉瘤化疗反应生物标志物的整合。
BMC Med Genomics. 2019 Jan 31;12(Suppl 1):23. doi: 10.1186/s12920-018-0456-5.
10
Across array comparative genomic hybridization: a strategy to reduce reference channel hybridizations.全基因组芯片比较杂交:一种减少参考通道杂交的策略。
Genes Chromosomes Cancer. 2008 Nov;47(11):994-1004. doi: 10.1002/gcc.20605.

引用本文的文献

1
Novel liquid biopsy CNV biomarkers in malignant melanoma.恶性黑色素瘤新型液体活检 CNV 生物标志物。
Sci Rep. 2024 Jul 9;14(1):15786. doi: 10.1038/s41598-024-65928-y.
2
Multiclass Cancer Prediction Based on Copy Number Variation Using Deep Learning.基于深度学习的拷贝数变异的多癌症预测。
Comput Intell Neurosci. 2022 Jun 9;2022:4742986. doi: 10.1155/2022/4742986. eCollection 2022.
3
Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection.基于自归一化神经网络和特征选择的泛癌分类

本文引用的文献

1
Population-genetic nature of copy number variations in the human genome.人类基因组中拷贝数变异的群体遗传学性质。
Hum Mol Genet. 2010 Mar 1;19(5):761-73. doi: 10.1093/hmg/ddp541. Epub 2009 Dec 5.
2
Copy number variation at 1q21.1 associated with neuroblastoma.1q21.1处的拷贝数变异与神经母细胞瘤相关。
Nature. 2009 Jun 18;459(7249):987-91. doi: 10.1038/nature08035.
3
Association of a germ-line copy number variation at 2p24.3 and risk for aggressive prostate cancer.2p24.3处种系拷贝数变异与侵袭性前列腺癌风险的关联。
Front Bioeng Biotechnol. 2020 Aug 4;8:766. doi: 10.3389/fbioe.2020.00766. eCollection 2020.
4
DNA Copy Number Variations as Markers of Mutagenic Impact.DNA 拷贝数变异作为诱变影响的标志物。
Int J Mol Sci. 2019 Sep 24;20(19):4723. doi: 10.3390/ijms20194723.
5
Identifying Patients with Atrioventricular Septal Defect in Down Syndrome Populations by Using Self-Normalizing Neural Networks and Feature Selection.利用自归一化神经网络和特征选择在唐氏综合征人群中识别患有房室间隔缺损的患者。
Genes (Basel). 2018 Apr 12;9(4):208. doi: 10.3390/genes9040208.
6
A robust topology-based algorithm for gene expression profiling.一种用于基因表达谱分析的基于拓扑结构的强大算法。
ISRN Bioinform. 2012 Nov 11;2012:381023. doi: 10.5402/2012/381023. eCollection 2012.
7
A specific missense mutation in GTF2I occurs at high frequency in thymic epithelial tumors.GTF2I中的一种特定错义突变在胸腺上皮肿瘤中高频出现。
Nat Genet. 2014 Aug;46(8):844-9. doi: 10.1038/ng.3016. Epub 2014 Jun 29.
8
Clonality: an R package for testing clonal relatedness of two tumors from the same patient based on their genomic profiles.克隆性:一个 R 包,用于根据两个来自同一患者的肿瘤的基因组图谱,测试它们的克隆相关性。
Bioinformatics. 2011 Jun 15;27(12):1698-9. doi: 10.1093/bioinformatics/btr267. Epub 2011 May 5.
Cancer Res. 2009 Mar 15;69(6):2176-9. doi: 10.1158/0008-5472.CAN-08-3151. Epub 2009 Mar 3.
4
Genetic association analysis of copy-number variation (CNV) in human disease pathogenesis.人类疾病发病机制中拷贝数变异(CNV)的遗传关联分析。
Genomics. 2009 Jan;93(1):22-6. doi: 10.1016/j.ygeno.2008.08.012. Epub 2008 Oct 19.
5
Comprehensive genomic characterization defines human glioblastoma genes and core pathways.全面的基因组特征分析确定了人类胶质母细胞瘤的基因和核心通路。
Nature. 2008 Oct 23;455(7216):1061-8. doi: 10.1038/nature07385. Epub 2008 Sep 4.
6
Small deletion variants have stable breakpoints commonly associated with alu elements.小缺失变异体具有通常与alu元件相关的稳定断点。
PLoS One. 2008 Aug 29;3(8):e3104. doi: 10.1371/journal.pone.0003104.
7
Rare structural variants disrupt multiple genes in neurodevelopmental pathways in schizophrenia.罕见结构变异破坏精神分裂症神经发育通路中的多个基因。
Science. 2008 Apr 25;320(5875):539-43. doi: 10.1126/science.1155174. Epub 2008 Mar 27.
8
Chromosomal breakpoints in primary colon cancer cluster at sites of structural variants in the genome.原发性结肠癌中的染色体断点聚集在基因组结构变异的位点。
Cancer Res. 2008 Mar 1;68(5):1284-95. doi: 10.1158/0008-5472.CAN-07-2864.
9
The fine-scale and complex architecture of human copy-number variation.人类拷贝数变异的精细尺度与复杂结构。
Am J Hum Genet. 2008 Mar;82(3):685-95. doi: 10.1016/j.ajhg.2007.12.010. Epub 2008 Jan 24.
10
Variation of CNV distribution in five different ethnic populations.五个不同种族人群中拷贝数变异(CNV)分布的差异。
Cytogenet Genome Res. 2007;118(1):19-30. doi: 10.1159/000106437.