• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iPcc:一种用于准确发现和预测疾病类别的新型特征提取方法。

iPcc: a novel feature extraction method for accurate disease class discovery and prediction.

机构信息

MOH Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100730, China.

出版信息

Nucleic Acids Res. 2013 Aug;41(14):e143. doi: 10.1093/nar/gkt343. Epub 2013 Jun 12.

DOI:10.1093/nar/gkt343
PMID:23761440
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3737526/
Abstract

Gene expression profiling has gradually become a routine procedure for disease diagnosis and classification. In the past decade, many computational methods have been proposed, resulting in great improvements on various levels, including feature selection and algorithms for classification and clustering. In this study, we present iPcc, a novel method from the feature extraction perspective to further propel gene expression profiling technologies from bench to bedside. We define 'correlation feature space' for samples based on the gene expression profiles by iterative employment of Pearson's correlation coefficient. Numerical experiments on both simulated and real gene expression data sets demonstrate that iPcc can greatly highlight the latent patterns underlying noisy gene expression data and thus greatly improve the robustness and accuracy of the algorithms currently available for disease diagnosis and classification based on gene expression profiles.

摘要

基因表达谱分析逐渐成为疾病诊断和分类的常规程序。在过去的十年中,已经提出了许多计算方法,在特征选择以及分类和聚类的算法等各个层面都取得了很大的改进。在本研究中,我们提出了 iPcc,这是一种从特征提取角度出发的新方法,旨在进一步推动基因表达谱分析技术从实验室走向临床。我们基于基因表达谱通过迭代使用皮尔逊相关系数为样本定义了“相关特征空间”。基于模拟和真实基因表达数据集的数值实验表明,iPcc 可以极大地突出嘈杂基因表达数据背后的潜在模式,从而极大地提高当前基于基因表达谱进行疾病诊断和分类的算法的稳健性和准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/09e84f598a50/gkt343f9p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/e59bb5fe2559/gkt343f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/a6a581c501b2/gkt343f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/468ab734272d/gkt343f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/cce710e02ba7/gkt343f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/b34aba348d54/gkt343f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/72ef4515f3e3/gkt343f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/13a6337eb01a/gkt343f7p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/74f0d49a3a1a/gkt343f8p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/09e84f598a50/gkt343f9p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/e59bb5fe2559/gkt343f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/a6a581c501b2/gkt343f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/468ab734272d/gkt343f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/cce710e02ba7/gkt343f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/b34aba348d54/gkt343f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/72ef4515f3e3/gkt343f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/13a6337eb01a/gkt343f7p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/74f0d49a3a1a/gkt343f8p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/890c/3737526/09e84f598a50/gkt343f9p.jpg

相似文献

1
iPcc: a novel feature extraction method for accurate disease class discovery and prediction.iPcc:一种用于准确发现和预测疾病类别的新型特征提取方法。
Nucleic Acids Res. 2013 Aug;41(14):e143. doi: 10.1093/nar/gkt343. Epub 2013 Jun 12.
2
A unified computational model for revealing and predicting subtle subtypes of cancers.揭示和预测癌症微妙亚型的统一计算模型。
BMC Bioinformatics. 2012 May 1;13:70. doi: 10.1186/1471-2105-13-70.
3
SamCluster: an integrated scheme for automatic discovery of sample classes using gene expression profile.SamCluster:一种利用基因表达谱自动发现样本类别的综合方案。
Bioinformatics. 2003 May 1;19(7):811-7. doi: 10.1093/bioinformatics/btg095.
4
Pathway activity inference for multiclass disease classification through a mathematical programming optimisation framework.通过数学规划优化框架进行多类疾病分类的通路活性推断
BMC Bioinformatics. 2014 Dec 5;15(1):390. doi: 10.1186/s12859-014-0390-2.
5
Reliable classification of two-class cancer data using evolutionary algorithms.使用进化算法对两类癌症数据进行可靠分类。
Biosystems. 2003 Nov;72(1-2):111-29. doi: 10.1016/s0303-2647(03)00138-2.
6
Simultaneous gene clustering and subset selection for sample classification via MDL.通过最小描述长度实现用于样本分类的同步基因聚类和子集选择
Bioinformatics. 2003 Jun 12;19(9):1100-9. doi: 10.1093/bioinformatics/btg039.
7
Cancer classification from the gene expression profiles by Discriminant Kernel-PLS.基于判别核偏最小二乘法从基因表达谱进行癌症分类。
J Bioinform Comput Biol. 2010 Dec;8 Suppl 1:147-60. doi: 10.1142/s0219720010005130.
8
Bagging to improve the accuracy of a clustering procedure.通过装袋法提高聚类过程的准确性。
Bioinformatics. 2003 Jun 12;19(9):1090-9. doi: 10.1093/bioinformatics/btg038.
9
SC(3): Triple spectral clustering-based consensus clustering framework for class discovery from cancer gene expression profiles.SC(3):基于三重谱聚类的共识聚类框架,用于从癌症基因表达谱中发现类别。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1751-65. doi: 10.1109/TCBB.2012.108.
10
A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.一种用于从癌组织基因表达数据中进行特征选择和规则提取的多核支持向量机方案。
Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.

引用本文的文献

1
A Heuristic Machine Learning-Based Optimization Technique to Predict Lung Cancer Patient Survival.基于启发式机器学习的肺癌患者生存预测优化技术。
Comput Intell Neurosci. 2023 Feb 2;2023:4506488. doi: 10.1155/2023/4506488. eCollection 2023.
2
A Protein Interaction Information-based Generative Model for Enhancing Gene Clustering.基于蛋白质相互作用信息的基因聚类增强生成模型。
Sci Rep. 2020 Jan 20;10(1):665. doi: 10.1038/s41598-020-57437-5.
3
High-Order Correlation Integration for Single-Cell or Bulk RNA-seq Data Analysis.用于单细胞或批量RNA测序数据分析的高阶相关性整合

本文引用的文献

1
MULTIPLE CLUSTERS, TYPES, AND DIMENSIONS FROM ITERATIVE INTERCOLUMNAR CORRELATIONAL ANALYSIS.迭代柱间相关分析得出的多个聚类、类型和维度
Multivariate Behav Res. 1968 Oct 1;3(4):465-77. doi: 10.1207/s15327906mbr0304_6.
2
ellipsoidFN: a tool for identifying a heterogeneous set of cancer biomarkers based on gene expressions.椭球 FN:一种基于基因表达识别异质癌症生物标志物的工具。
Nucleic Acids Res. 2013 Feb 1;41(4):e53. doi: 10.1093/nar/gks1288. Epub 2012 Dec 22.
3
Reducing the time requirement of k-means algorithm.降低 K-Means 算法的时间要求。
Front Genet. 2019 Apr 26;10:371. doi: 10.3389/fgene.2019.00371. eCollection 2019.
4
Parsimonious Gene Correlation Network Analysis (PGCNA): a tool to define modular gene co-expression for refined molecular stratification in cancer.简约基因关联网络分析(PGCNA):一种定义模块化基因共表达的工具,用于癌症的精细分子分层。
NPJ Syst Biol Appl. 2019 Apr 11;5:13. doi: 10.1038/s41540-019-0090-7. eCollection 2019.
5
Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles.基于基因表达谱的癌症预测随机子空间聚合
Biomed Res Int. 2016;2016:4596326. doi: 10.1155/2016/4596326. Epub 2016 Nov 24.
6
Integrating heterogeneous genomic data to accurately identify disease subtypes.整合异质基因组数据以准确识别疾病亚型。
BMC Med Genomics. 2015 Nov 20;8:78. doi: 10.1186/s12920-015-0154-5.
7
Unravelling personalized dysfunctional gene network of complex diseases based on differential network model.基于差异网络模型解析复杂疾病的个性化功能失调基因网络
J Transl Med. 2015 Jun 13;13:189. doi: 10.1186/s12967-015-0546-5.
8
Machine learning applications in cancer prognosis and prediction.机器学习在癌症预后和预测中的应用。
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17. doi: 10.1016/j.csbj.2014.11.005. eCollection 2015.
PLoS One. 2012;7(12):e49946. doi: 10.1371/journal.pone.0049946. Epub 2012 Dec 11.
4
Accounting for noise when clustering biological data.当对生物数据进行聚类时,要考虑噪声的影响。
Brief Bioinform. 2013 Jul;14(4):423-36. doi: 10.1093/bib/bbs057. Epub 2012 Oct 14.
5
A unified computational model for revealing and predicting subtle subtypes of cancers.揭示和预测癌症微妙亚型的统一计算模型。
BMC Bioinformatics. 2012 May 1;13:70. doi: 10.1186/1471-2105-13-70.
6
Combining multiple approaches for gene microarray classification.结合多种方法进行基因微阵列分类。
Bioinformatics. 2012 Apr 15;28(8):1151-7. doi: 10.1093/bioinformatics/bts108. Epub 2012 Mar 5.
7
Defining the structure of the general stress regulon of Bacillus subtilis using targeted microarray analysis and random forest classification.利用靶向基因芯片分析和随机森林分类定义枯草芽孢杆菌的一般应激调控组的结构。
Microbiology (Reading). 2012 Mar;158(Pt 3):696-707. doi: 10.1099/mic.0.055434-0. Epub 2011 Dec 15.
8
Characterizing the impact of smoking and lung cancer on the airway transcriptome using RNA-Seq.利用 RNA-Seq 技术描绘吸烟和肺癌对气道转录组的影响。
Cancer Prev Res (Phila). 2011 Jun;4(6):803-17. doi: 10.1158/1940-6207.CAPR-11-0212.
9
Genome-wide RNA-seq analysis of human and mouse platelet transcriptomes.人类和小鼠血小板转录组的全基因组 RNA-seq 分析。
Blood. 2011 Oct 6;118(14):e101-11. doi: 10.1182/blood-2011-03-339705. Epub 2011 May 19.
10
Genome-wide expression profiling of five mouse models identifies similarities and differences with human psoriasis.五种小鼠模型的全基因组表达谱分析鉴定了与人类银屑病的相似性和差异。
PLoS One. 2011 Apr 4;6(4):e18266. doi: 10.1371/journal.pone.0018266.