• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于隐马尔可夫模型的算法,用于使用阵列 CGH 数据识别肿瘤亚型。

A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data.

机构信息

Department of Pathology, School of Medicine and Health Sciences, University of North Dakota, Grand Forks, ND 58201, USA.

出版信息

BMC Genomics. 2011 Dec 23;12 Suppl 5(Suppl 5):S10. doi: 10.1186/1471-2164-12-S5-S10.

DOI:10.1186/1471-2164-12-S5-S10
PMID:22369459
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287492/
Abstract

BACKGROUND

The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the major challenges for developing aCGH sample clustering are the high spatial correlation between aCGH markers and the low computing efficiency. A mixture hidden Markov model based algorithm was developed to address these two challenges.

RESULTS

The hidden Markov model (HMM) was used to model the spatial correlation between aCGH markers. A fast clustering algorithm was implemented and real data analysis on glioma aCGH data has shown that it converges to the optimal cluster rapidly and the computation time is proportional to the sample size. Simulation results showed that this HMM based clustering (HMMC) method has a substantially lower error rate than NMF clustering. The HMMC results for glioma data were significantly associated with clinical outcomes.

CONCLUSIONS

We have developed a fast clustering algorithm to identify tumor subtypes based on DNA copy number aberrations. The performance of the proposed HMMC method has been evaluated using both simulated and real aCGH data. The software for HMMC in both R and C++ is available in ND INBRE website http://ndinbre.org/programs/bioinformatics.php.

摘要

背景

近年来 array CGH(aCGH)研究的进展显著提高了使用 DNA 拷贝数数据进行肿瘤识别的能力。已经提出了许多无监督学习方法来对 aCGH 样本进行聚类。开发 aCGH 样本聚类的两个主要挑战是 aCGH 标记之间的高度空间相关性和低计算效率。基于混合隐马尔可夫模型的算法被开发出来以解决这两个挑战。

结果

隐马尔可夫模型(HMM)用于对 aCGH 标记之间的空间相关性进行建模。实现了一种快速聚类算法,对 glioma aCGH 数据的实际数据分析表明,它能够快速收敛到最优聚类,并且计算时间与样本大小成正比。模拟结果表明,这种基于 HMM 的聚类(HMMC)方法的错误率明显低于 NMF 聚类。glioma 数据的 HMMC 结果与临床结果显著相关。

结论

我们开发了一种快速聚类算法,基于 DNA 拷贝数异常来识别肿瘤亚型。使用模拟和真实的 aCGH 数据评估了所提出的 HMMC 方法的性能。R 和 C++中的 HMMC 软件可在 ND INBRE 网站 http://ndinbre.org/programs/bioinformatics.php 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/86bb565c58a5/1471-2164-12-S5-S10-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/7ace6452925d/1471-2164-12-S5-S10-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/fddc07d585e4/1471-2164-12-S5-S10-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/03efa8cf8677/1471-2164-12-S5-S10-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/f8947a423c0e/1471-2164-12-S5-S10-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/cd745350ec5b/1471-2164-12-S5-S10-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/86bb565c58a5/1471-2164-12-S5-S10-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/7ace6452925d/1471-2164-12-S5-S10-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/fddc07d585e4/1471-2164-12-S5-S10-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/03efa8cf8677/1471-2164-12-S5-S10-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/f8947a423c0e/1471-2164-12-S5-S10-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/cd745350ec5b/1471-2164-12-S5-S10-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd74/3287492/86bb565c58a5/1471-2164-12-S5-S10-6.jpg

相似文献

1
A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data.基于隐马尔可夫模型的算法,用于使用阵列 CGH 数据识别肿瘤亚型。
BMC Genomics. 2011 Dec 23;12 Suppl 5(Suppl 5):S10. doi: 10.1186/1471-2164-12-S5-S10.
2
A probe-density-based analysis method for array CGH data: simulation, normalization and centralization.一种基于探针密度的阵列比较基因组杂交(array CGH)数据分析方法:模拟、标准化和中心化
Bioinformatics. 2008 Aug 15;24(16):1749-56. doi: 10.1093/bioinformatics/btn321. Epub 2008 Jul 4.
3
Continuous-index hidden Markov modelling of array CGH copy number data.阵列比较基因组杂交拷贝数数据的连续索引隐马尔可夫模型
Bioinformatics. 2007 Apr 15;23(8):1006-14. doi: 10.1093/bioinformatics/btm059. Epub 2007 Feb 19.
4
A shifting level model algorithm that identifies aberrations in array-CGH data.一种用于识别阵列-CGH 数据中异常的移位水平模型算法。
Biostatistics. 2010 Apr;11(2):265-80. doi: 10.1093/biostatistics/kxp051. Epub 2009 Nov 30.
5
A Hidden Markov Model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays.一种使用Affymetrix SNP阵列估计癌症群体混合和等位基因拷贝数的隐马尔可夫模型。
BMC Bioinformatics. 2007 Nov 9;8:434. doi: 10.1186/1471-2105-8-434.
6
A latent class model with hidden Markov dependence for array CGH data.一种用于阵列比较基因组杂交(array CGH)数据的具有隐藏马尔可夫依赖性的潜在类别模型。
Biometrics. 2009 Dec;65(4):1296-305. doi: 10.1111/j.1541-0420.2009.01226.x.
7
Integrating copy number polymorphisms into array CGH analysis using a robust HMM.使用稳健的隐马尔可夫模型将拷贝数多态性整合到阵列比较基因组杂交分析中。
Bioinformatics. 2006 Jul 15;22(14):e431-9. doi: 10.1093/bioinformatics/btl238.
8
Accurate confidence aware clustering of array CGH tumor profiles.准确的置信度感知的 array CGH 肿瘤谱聚类。
Bioinformatics. 2010 Jan 1;26(1):6-14. doi: 10.1093/bioinformatics/btp603. Epub 2009 Oct 21.
9
Model-based clustering of array CGH data.基于模型的阵列比较基因组杂交数据聚类
Bioinformatics. 2009 Jun 15;25(12):i30-8. doi: 10.1093/bioinformatics/btp205.
10
BioHMM: a heterogeneous hidden Markov model for segmenting array CGH data.BioHMM:一种用于分割阵列比较基因组杂交数据的异构隐马尔可夫模型。
Bioinformatics. 2006 May 1;22(9):1144-6. doi: 10.1093/bioinformatics/btl089. Epub 2006 Mar 13.

引用本文的文献

1
Mesolimbic Neurobehavioral Mechanisms of Reward Motivation in Anorexia Nervosa: A Multimodal Imaging Study.神经性厌食症中奖励动机的中脑边缘神经行为机制:一项多模态成像研究
Front Psychiatry. 2022 Mar 7;13:806327. doi: 10.3389/fpsyt.2022.806327. eCollection 2022.
2
Prediction of the number of activated genes in multiple independent Cd(+2)- and As(+3)-induced malignant transformations of human urothelial cells (UROtsa).预测在人尿道上皮细胞(UROtsa)的多次独立镉(+2)和砷(+3)诱导的恶性转化中激活基因的数量。
PLoS One. 2014 Jan 22;9(1):e85614. doi: 10.1371/journal.pone.0085614. eCollection 2014.
3
An HMM-based algorithm for evaluating rates of receptor-ligand binding kinetics from thermal fluctuation data.

本文引用的文献

1
An algorithm for classifying tumors based on genomic aberrations and selecting representative tumor models.基于基因组异常对肿瘤进行分类并选择代表性肿瘤模型的算法。
BMC Med Genomics. 2010 Jun 22;3:23. doi: 10.1186/1755-8794-3-23.
2
Personalized medicine and inhibition of EGFR signaling in lung cancer.肺癌中的个性化医疗与表皮生长因子受体信号传导抑制
N Engl J Med. 2009 Sep 3;361(10):1018-20. doi: 10.1056/NEJMe0905763. Epub 2009 Aug 19.
3
A 2009 update on the treatment of patients with hormone receptor-positive breast cancer.2009年激素受体阳性乳腺癌患者治疗的最新进展。
基于隐马尔可夫模型的算法,用于从热波动数据评估受体-配体结合动力学速率。
Bioinformatics. 2013 Jun 15;29(12):1511-8. doi: 10.1093/bioinformatics/btt180. Epub 2013 Apr 18.
4
Genomic signatures and gene networking: challenges and promises.基因组特征和基因网络:挑战与前景。
BMC Genomics. 2011 Dec 23;12 Suppl 5(Suppl 5):I1. doi: 10.1186/1471-2164-12-S5-I1.
Clin Breast Cancer. 2009 Jun;9 Suppl 1:S6-S17. doi: 10.3816/CBC.2009.s.001.
4
A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6.一种用于从包括GenomeWideSNP 5和6在内的所有Affymetrix基因分型阵列估计全分辨率原始拷贝数的单阵列预处理方法。
Bioinformatics. 2009 Sep 1;25(17):2149-56. doi: 10.1093/bioinformatics/btp371. Epub 2009 Jun 17.
5
Model-based clustering of array CGH data.基于模型的阵列比较基因组杂交数据聚类
Bioinformatics. 2009 Jun 15;25(12):i30-8. doi: 10.1093/bioinformatics/btp205.
6
Mutations and response to epidermal growth factor receptor inhibitors.突变与对表皮生长因子受体抑制剂的反应
Clin Cancer Res. 2009 Feb 15;15(4):1133-9. doi: 10.1158/1078-0432.CCR-08-0905.
7
Comparative genomic hybridisation arrays: high-throughput tools to determine targeted therapy in breast cancer.比较基因组杂交阵列:用于确定乳腺癌靶向治疗的高通量工具。
Pathobiology. 2008;75(2):63-74. doi: 10.1159/000123844. Epub 2008 Jun 10.
8
Classification of human cancers based on DNA copy number amplification modeling.基于DNA拷贝数扩增建模的人类癌症分类
BMC Med Genomics. 2008 May 14;1:15. doi: 10.1186/1755-8794-1-15.
9
Marked genomic differences characterize primary and secondary glioblastoma subtypes and identify two distinct molecular and clinical secondary glioblastoma entities.显著的基因组差异是原发性和继发性胶质母细胞瘤亚型的特征,并确定了两种不同的分子和临床继发性胶质母细胞瘤实体。
Cancer Res. 2006 Dec 1;66(23):11502-13. doi: 10.1158/0008-5472.CAN-06-2072. Epub 2006 Nov 17.
10
DNA copy number amplification profiling of human neoplasms.人类肿瘤的DNA拷贝数扩增图谱分析
Oncogene. 2006 Nov 23;25(55):7324-32. doi: 10.1038/sj.onc.1209717. Epub 2006 Jun 5.