• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于主成分分析的无监督特征提取应用于出芽酵母的时间周期性基因表达。

Principal component analysis based unsupervised feature extraction applied to budding yeast temporally periodic gene expression.

作者信息

Taguchi Y-H

机构信息

Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551 Japan.

出版信息

BioData Min. 2016 Jun 29;9:22. doi: 10.1186/s13040-016-0101-9. eCollection 2016.

DOI:10.1186/s13040-016-0101-9
PMID:27366210
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4928327/
Abstract

BACKGROUND

The recently proposed principal component analysis (PCA) based unsupervised feature extraction (FE) has successfully been applied to various bioinformatics problems ranging from biomarker identification to the screening of disease causing genes using gene expression/epigenetic profiles. However, the conditions required for its successful use and the mechanisms involved in how it outperforms other supervised methods is unknown, because PCA based unsupervised FE has only been applied to challenging (i.e. not well known) problems.

RESULTS

In this study, PCA based unsupervised FE was applied to an extensively studied organism, i.e., budding yeast. When applied to two gene expression profiles expected to be temporally periodic, yeast metabolic cycle (YMC) and yeast cell division cycle (YCDC), PCA based unsupervised FE outperformed simple but powerful conventional methods, with sinusoidal fitting with regards to several aspects: (i) feasible biological term enrichment without assuming periodicity for YMC; (ii) identification of periodic profiles whose period was half as long as the cell division cycle for YMC; and (iii) the identification of no more than 37 genes associated with the enrichment of biological terms related to cell division cycle for the integrated analysis of seven YCDC profiles, for which sinusoidal fittings failed. The explantation for differences between methods used and the necessary conditions required were determined by comparing PCA based unsupervised FE with fittings to various periodic (artificial, thus pre-defined) profiles. Furthermore, four popular unsupervised clustering algorithms applied to YMC were not as successful as PCA based unsupervised FE.

CONCLUSIONS

PCA based unsupervised FE is a useful and effective unsupervised method to investigate YMC and YCDC. This study identified why the unsupervised method without pre-judged criteria outperformed supervised methods requiring human defined criteria.

摘要

背景

最近提出的基于主成分分析(PCA)的无监督特征提取(FE)已成功应用于各种生物信息学问题,从生物标志物识别到使用基因表达/表观遗传谱筛选致病基因。然而,其成功应用所需的条件以及它优于其他监督方法的机制尚不清楚,因为基于PCA的无监督FE仅应用于具有挑战性的(即不太知名的)问题。

结果

在本研究中,基于PCA的无监督FE应用于一种经过广泛研究的生物体,即芽殖酵母。当应用于预期具有时间周期性的两个基因表达谱,即酵母代谢周期(YMC)和酵母细胞分裂周期(YCDC)时,基于PCA的无监督FE在几个方面优于简单但强大的传统方法,即正弦拟合:(i)对于YMC,无需假设周期性即可进行可行的生物学术语富集;(ii)识别周期为YMC细胞分裂周期一半的周期性谱;(iii)对于七个YCDC谱的综合分析,识别与细胞分裂周期相关的生物学术语富集相关的不超过37个基因,而正弦拟合在这些方面失败。通过将基于PCA的无监督FE与各种周期性(人工的,因此是预先定义的)谱的拟合进行比较,确定了所用方法之间差异的解释以及所需的必要条件。此外,应用于YMC的四种流行的无监督聚类算法不如基于PCA的无监督FE成功。

结论

基于PCA的无监督FE是研究YMC和YCDC的一种有用且有效的无监督方法。本研究确定了为什么没有预先判断标准的无监督方法优于需要人为定义标准的监督方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/6252d78a7744/13040_2016_101_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/e71a8ae4c215/13040_2016_101_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/02ff6b71d266/13040_2016_101_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/330209285710/13040_2016_101_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/9b0a6a1035cb/13040_2016_101_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/2ca4bb8c6784/13040_2016_101_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/d55ed5f5bc84/13040_2016_101_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/cceaa3d01be0/13040_2016_101_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/faee330f4c8e/13040_2016_101_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/ce4fc73bf086/13040_2016_101_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/2927a3b4800e/13040_2016_101_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/6252d78a7744/13040_2016_101_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/e71a8ae4c215/13040_2016_101_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/02ff6b71d266/13040_2016_101_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/330209285710/13040_2016_101_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/9b0a6a1035cb/13040_2016_101_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/2ca4bb8c6784/13040_2016_101_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/d55ed5f5bc84/13040_2016_101_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/cceaa3d01be0/13040_2016_101_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/faee330f4c8e/13040_2016_101_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/ce4fc73bf086/13040_2016_101_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/2927a3b4800e/13040_2016_101_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e87/4928327/6252d78a7744/13040_2016_101_Fig11_HTML.jpg

相似文献

1
Principal component analysis based unsupervised feature extraction applied to budding yeast temporally periodic gene expression.基于主成分分析的无监督特征提取应用于出芽酵母的时间周期性基因表达。
BioData Min. 2016 Jun 29;9:22. doi: 10.1186/s13040-016-0101-9. eCollection 2016.
2
Principal component analysis-based unsupervised feature extraction applied to in silico drug discovery for posttraumatic stress disorder-mediated heart disease.基于主成分分析的无监督特征提取应用于创伤后应激障碍介导的心脏病的计算机辅助药物发现。
BMC Bioinformatics. 2015 Apr 30;16:139. doi: 10.1186/s12859-015-0574-4.
3
Tensor Decomposition-Based Unsupervised Feature Extraction Applied to Single-Cell Gene Expression Analysis.基于张量分解的无监督特征提取应用于单细胞基因表达分析
Front Genet. 2019 Sep 19;10:864. doi: 10.3389/fgene.2019.00864. eCollection 2019.
4
Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes.基于张量分解和主成分分析的无监督特征提取方法在具有多等级的社会性昆虫的大脑基因表达和甲基化谱中的应用。
BMC Bioinformatics. 2018 May 8;19(Suppl 4):99. doi: 10.1186/s12859-018-2068-7.
5
Tensor-Decomposition-Based Unsupervised Feature Extraction Applied to Prostate Cancer Multiomics Data.基于张量分解的无监督特征提取在前列腺癌多组学数据中的应用。
Genes (Basel). 2020 Dec 11;11(12):1493. doi: 10.3390/genes11121493.
6
Identification of genes associated with altered gene expression and m6A profiles during hypoxia using tensor decomposition based unsupervised feature extraction.利用基于张量分解的无监督特征提取方法鉴定缺氧条件下改变基因表达和 m6A 谱的相关基因。
Sci Rep. 2021 Apr 26;11(1):8909. doi: 10.1038/s41598-021-87779-7.
7
Projection in genomic analysis: A theoretical basis to rationalize tensor decomposition and principal component analysis as feature selection tools.基因组分析中的投影:张量分解和主成分分析作为特征选择工具的理论基础。
PLoS One. 2022 Sep 29;17(9):e0275472. doi: 10.1371/journal.pone.0275472. eCollection 2022.
8
Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage.鉴定E13至E16大鼠F3代乙烯菌核利谱系原始生殖细胞中与异常启动子甲基化相关的异常基因表达。
BMC Bioinformatics. 2015;16 Suppl 18(Suppl 18):S16. doi: 10.1186/1471-2105-16-S18-S16. Epub 2015 Dec 9.
9
Principal Components Analysis Based Unsupervised Feature Extraction Applied to Gene Expression Analysis of Blood from Dengue Haemorrhagic Fever Patients.基于主成分分析的无监督特征提取在登革出血热患者血液基因表达分析中的应用。
Sci Rep. 2017 Mar 9;7:44016. doi: 10.1038/srep44016.
10
Universal Nature of Drug Treatment Responses in Drug-Tissue-Wide Model-Animal Experiments Using Tensor Decomposition-Based Unsupervised Feature Extraction.基于张量分解的无监督特征提取在药物-组织-全模型动物实验中药物治疗反应的普遍性质
Front Genet. 2020 Aug 20;11:695. doi: 10.3389/fgene.2020.00695. eCollection 2020.

引用本文的文献

1
Projection in genomic analysis: A theoretical basis to rationalize tensor decomposition and principal component analysis as feature selection tools.基因组分析中的投影:张量分解和主成分分析作为特征选择工具的理论基础。
PLoS One. 2022 Sep 29;17(9):e0275472. doi: 10.1371/journal.pone.0275472. eCollection 2022.
2
Metabolomics Approach on Non-Targeted Screening of 50 PPCPs in Lettuce and Maize.代谢组学方法对生菜和玉米中 50 种 PPCPs 的非靶向筛查
Molecules. 2022 Jul 23;27(15):4711. doi: 10.3390/molecules27154711.
3
PCA-based unsupervised feature extraction for gene expression analysis of COVID-19 patients.

本文引用的文献

1
Identification of More Feasible MicroRNA-mRNA Interactions within Multiple Cancers Using Principal Component Analysis Based Unsupervised Feature Extraction.使用基于主成分分析的无监督特征提取方法识别多种癌症中更可行的微小RNA-信使核糖核酸相互作用
Int J Mol Sci. 2016 May 10;17(5):696. doi: 10.3390/ijms17050696.
2
Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage.鉴定E13至E16大鼠F3代乙烯菌核利谱系原始生殖细胞中与异常启动子甲基化相关的异常基因表达。
BMC Bioinformatics. 2015;16 Suppl 18(Suppl 18):S16. doi: 10.1186/1471-2105-16-S18-S16. Epub 2015 Dec 9.
3
基于 PCA 的无监督特征提取在 COVID-19 患者基因表达分析中的应用。
Sci Rep. 2021 Aug 30;11(1):17351. doi: 10.1038/s41598-021-95698-w.
4
Prediction and Analysis of Skin Cancer Progression using Genomics Profiles of Patients.利用患者基因组特征预测和分析皮肤癌进展。
Sci Rep. 2019 Oct 31;9(1):15790. doi: 10.1038/s41598-019-52134-4.
5
Tensor Decomposition-Based Unsupervised Feature Extraction Applied to Single-Cell Gene Expression Analysis.基于张量分解的无监督特征提取应用于单细胞基因表达分析
Front Genet. 2019 Sep 19;10:864. doi: 10.3389/fgene.2019.00864. eCollection 2019.
6
Two-Stage Hybrid Gene Selection Using Mutual Information and Genetic Algorithm for Cancer Data Classification.基于互信息和遗传算法的两阶段混合基因选择在癌症数据分类中的应用。
J Med Syst. 2019 Jun 17;43(8):235. doi: 10.1007/s10916-019-1372-8.
7
Drug candidate identification based on gene expression of treated cells using tensor decomposition-based unsupervised feature extraction for large-scale data.基于张量分解的无监督特征提取的处理细胞基因表达的药物候选物识别用于大规模数据。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):388. doi: 10.1186/s12859-018-2395-8.
8
Tensor Decomposition-Based Unsupervised Feature Extraction Can Identify the Universal Nature of Sequence-Nonspecific Off-Target Regulation of mRNA Mediated by MicroRNA Transfection.基于张量分解的无监督特征提取能够识别由微小RNA转染介导的mRNA序列非特异性脱靶调控的普遍特性。
Cells. 2018 Jun 4;7(6):54. doi: 10.3390/cells7060054.
9
Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes.基于张量分解和主成分分析的无监督特征提取方法在具有多等级的社会性昆虫的大脑基因表达和甲基化谱中的应用。
BMC Bioinformatics. 2018 May 8;19(Suppl 4):99. doi: 10.1186/s12859-018-2068-7.
10
Tensor decomposition-based unsupervised feature extraction identifies candidate genes that induce post-traumatic stress disorder-mediated heart diseases.基于张量分解的无监督特征提取可识别出诱发创伤后应激障碍介导的心脏病的候选基因。
BMC Med Genomics. 2017 Dec 21;10(Suppl 4):67. doi: 10.1186/s12920-017-0302-1.
Comprehensive analysis of transcriptome and metabolome analysis in Intrahepatic Cholangiocarcinoma and Hepatocellular Carcinoma.
肝内胆管癌和肝细胞癌中转录组和代谢组的综合分析
Sci Rep. 2015 Nov 5;5:16294. doi: 10.1038/srep16294.
4
Principal component analysis-based unsupervised feature extraction applied to in silico drug discovery for posttraumatic stress disorder-mediated heart disease.基于主成分分析的无监督特征提取应用于创伤后应激障碍介导的心脏病的计算机辅助药物发现。
BMC Bioinformatics. 2015 Apr 30;16:139. doi: 10.1186/s12859-015-0574-4.
5
TINAGL1 and B3GALNT1 are potential therapy target genes to suppress metastasis in non-small cell lung cancer.TINAGL1和B3GALNT1是抑制非小细胞肺癌转移的潜在治疗靶基因。
BMC Genomics. 2014;15 Suppl 9(Suppl 9):S2. doi: 10.1186/1471-2164-15-S9-S2. Epub 2014 Dec 8.
6
Cyclebase 3.0: a multi-organism database on cell-cycle regulation and phenotypes.细胞周期数据库3.0:一个关于细胞周期调控和表型的多物种数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D1140-4. doi: 10.1093/nar/gku1092. Epub 2014 Nov 5.
7
STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10:整合了整个生命之树的蛋白质-蛋白质相互作用网络。
Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.
8
Comparison of hepatocellular carcinoma miRNA expression profiling as evaluated by next generation sequencing and microarray.通过下一代测序和微阵列评估的肝细胞癌miRNA表达谱比较
PLoS One. 2014 Sep 12;9(9):e106314. doi: 10.1371/journal.pone.0106314. eCollection 2014.
9
Universal disease biomarker: can a fixed set of blood microRNAs diagnose multiple diseases?通用疾病生物标志物:一组固定的血液微小RNA能诊断多种疾病吗?
BMC Res Notes. 2014 Aug 30;7:581. doi: 10.1186/1756-0500-7-581.
10
Genes associated with genotype-specific DNA methylation in squamous cell carcinoma as candidate drug targets.与鳞状细胞癌中基因型特异性DNA甲基化相关的基因作为候选药物靶点。
BMC Syst Biol. 2014;8 Suppl 1(Suppl 1):S4. doi: 10.1186/1752-0509-8-S1-S4. Epub 2014 Jan 24.