• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多 TGDR:一种微阵列实验中多类分类的正则化方法。

Multi-TGDR: a regularization method for multi-class classification in microarray experiments.

机构信息

Division of Clinical Epidemiology, First Hospital of the Jilin University, Changchun, Jilin, China ; Center for Clinical and Translational Science, The Rockefeller University, New York, New York, United States of America.

出版信息

PLoS One. 2013 Nov 19;8(11):e78302. doi: 10.1371/journal.pone.0078302. eCollection 2013.

DOI:10.1371/journal.pone.0078302
PMID:24260109
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3833980/
Abstract

BACKGROUND

As microarray technology has become mature and popular, the selection and use of a small number of relevant genes for accurate classification of samples has arisen as a hot topic in the circles of biostatistics and bioinformatics. However, most of the developed algorithms lack the ability to handle multiple classes, arguably a common application. Here, we propose an extension to an existing regularization algorithm, called Threshold Gradient Descent Regularization (TGDR), to specifically tackle multi-class classification of microarray data. When there are several microarray experiments addressing the same/similar objectives, one option is to use a meta-analysis version of TGDR (Meta-TGDR), which considers the classification task as a combination of classifiers with the same structure/model while allowing the parameters to vary across studies. However, the original Meta-TGDR extension did not offer a solution to the prediction on independent samples. Here, we propose an explicit method to estimate the overall coefficients of the biomarkers selected by Meta-TGDR. This extension permits broader applicability and allows a comparison between the predictive performance of Meta-TGDR and TGDR using an independent testing set.

RESULTS

Using real-world applications, we demonstrated the proposed multi-TGDR framework works well and the number of selected genes is less than the sum of all individualized binary TGDRs. Additionally, Meta-TGDR and TGDR on the batch-effect adjusted pooled data approximately provided same results. By adding Bagging procedure in each application, the stability and good predictive performance are warranted.

CONCLUSIONS

Compared with Meta-TGDR, TGDR is less computing time intensive, and requires no samples of all classes in each study. On the adjusted data, it has approximate same predictive performance with Meta-TGDR. Thus, it is highly recommended.

摘要

背景

随着微阵列技术的成熟和普及,选择和使用少量相关基因来准确分类样本已成为生物统计学和生物信息学领域的热门话题。然而,大多数开发的算法缺乏处理多类别的能力,可以说是一种常见的应用。在这里,我们提出了一种扩展现有正则化算法的方法,称为阈值梯度下降正则化(TGDR),专门用于处理微阵列数据的多类分类。当有几个微阵列实验针对相同/相似的目标时,一种选择是使用 TGDR 的元分析版本(Meta-TGDR),它将分类任务视为具有相同结构/模型的分类器的组合,同时允许参数在研究中变化。然而,原始的 Meta-TGDR 扩展并没有为独立样本的预测提供解决方案。在这里,我们提出了一种显式方法来估计 Meta-TGDR 选择的生物标志物的总体系数。这种扩展允许更广泛的适用性,并允许使用独立测试集比较 Meta-TGDR 和 TGDR 的预测性能。

结果

使用真实应用,我们证明了所提出的多 TGDR 框架运行良好,并且选择的基因数量少于所有个体化二元 TGDR 的总和。此外,经过批次效应调整后,Meta-TGDR 和 TGDR 在汇总数据上的结果大致相同。通过在每个应用程序中添加 Bagging 过程,可以保证稳定性和良好的预测性能。

结论

与 Meta-TGDR 相比,TGDR 的计算时间更短,并且不需要每个研究中的所有类别的样本。在调整后的数据上,它具有与 Meta-TGDR 近似相同的预测性能。因此,强烈推荐使用。

相似文献

1
Multi-TGDR: a regularization method for multi-class classification in microarray experiments.多 TGDR:一种微阵列实验中多类分类的正则化方法。
PLoS One. 2013 Nov 19;8(11):e78302. doi: 10.1371/journal.pone.0078302. eCollection 2013.
2
Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus.多组元 TGDR 是一种多类正则化方法,用于识别乙型肝炎或丙型肝炎病毒感染所致肝细胞癌和肝硬化的代谢特征。
BMC Bioinformatics. 2014 Apr 4;15:97. doi: 10.1186/1471-2105-15-97.
3
GEE-TGDR: A Longitudinal Feature Selection Algorithm and Its Application to lncRNA Expression Profiles for Psoriasis Patients Treated with Immune Therapies.GEE-TGDR:一种纵向特征选择算法及其在免疫治疗银屑病患者长链非编码 RNA 表达谱中的应用。
Biomed Res Int. 2021 Apr 9;2021:8862895. doi: 10.1155/2021/8862895. eCollection 2021.
4
Combining clinical and genomic covariates via Cov-TGDR.通过Cov-TGDR结合临床和基因组协变量。
Cancer Inform. 2007 Oct 15;3:371-8.
5
Clustering threshold gradient descent regularization: with applications to microarray studies.聚类阈值梯度下降正则化:及其在微阵列研究中的应用
Bioinformatics. 2007 Feb 15;23(4):466-72. doi: 10.1093/bioinformatics/btl632. Epub 2006 Dec 20.
6
On the analysis of glycomics mass spectrometry data via the regularized area under the ROC curve.通过正则化ROC曲线下面积对糖组学质谱数据进行分析。
BMC Bioinformatics. 2007 Dec 12;8:477. doi: 10.1186/1471-2105-8-477.
7
Integrative Interaction Analysis using Threshold Gradient Directed Regularization.使用阈值梯度定向正则化的综合交互分析
Appl Stoch Models Bus Ind. 2019 Mar-Apr;35(2):354-375. doi: 10.1002/asmb.2342. Epub 2018 May 29.
8
An integrated approach for identifying wrongly labelled samples when performing classification in microarray data.一种在微阵列数据分析中进行分类时识别错误标记样本的综合方法。
PLoS One. 2012;7(10):e46700. doi: 10.1371/journal.pone.0046700. Epub 2012 Oct 17.
9
Sparse logistic regression with a L1/2 penalty for gene selection in cancer classification.基于 L1/2 罚项的稀疏逻辑回归在癌症分类中的基因选择。
BMC Bioinformatics. 2013 Jun 19;14:198. doi: 10.1186/1471-2105-14-198.
10
Management of well-differentiated thyroglossal remnant thyroid carcinoma: time to close the debate? Report of five new cases and proposal of a definitive algorithm for treatment.高分化甲状腺舌管残余甲状腺癌的管理:是时候结束争论了吗?5例新病例报告及确定性治疗算法建议
Ann Surg Oncol. 2006 May;13(5):745-52. doi: 10.1245/ASO.2006.05.022. Epub 2006 Mar 16.

引用本文的文献

1
GEE-TGDR: A Longitudinal Feature Selection Algorithm and Its Application to lncRNA Expression Profiles for Psoriasis Patients Treated with Immune Therapies.GEE-TGDR:一种纵向特征选择算法及其在免疫治疗银屑病患者长链非编码 RNA 表达谱中的应用。
Biomed Res Int. 2021 Apr 9;2021:8862895. doi: 10.1155/2021/8862895. eCollection 2021.
2
Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time.通过使用符号平均值来总结基因表达值随时间的变化进行纵向数据的特征选择。
Biomed Res Int. 2019 Mar 19;2019:1724898. doi: 10.1155/2019/1724898. eCollection 2019.
3
A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time.一种纵向特征选择方法可识别出相关基因,以区分随时间推移的复杂损伤和简单损伤。
BMC Med Inform Decis Mak. 2018 Dec 7;18(Suppl 5):115. doi: 10.1186/s12911-018-0685-8.
4
To select relevant features for longitudinal gene expression data by extending a pathway analysis method.通过扩展一种通路分析方法来选择纵向基因表达数据的相关特征。
F1000Res. 2018 Jul 31;7:1166. doi: 10.12688/f1000research.15357.1. eCollection 2018.
5
Identification of subtype-specific prognostic signatures using Cox models with redundant gene elimination.使用具有冗余基因消除功能的Cox模型鉴定亚型特异性预后特征。
Oncol Lett. 2018 Jun;15(6):8545-8555. doi: 10.3892/ol.2018.8418. Epub 2018 Apr 4.
6
Identification of Genes Discriminating Multiple Sclerosis Patients from Controls by Adapting a Pathway Analysis Method.通过改进通路分析方法鉴别区分多发性硬化症患者与对照的基因
PLoS One. 2016 Nov 15;11(11):e0165543. doi: 10.1371/journal.pone.0165543. eCollection 2016.
7
Classification of Non-Small Cell Lung Cancer Using Significance Analysis of Microarray-Gene Set Reduction Algorithm.使用微阵列基因集缩减算法的显著性分析对非小细胞肺癌进行分类
Biomed Res Int. 2016;2016:2491671. doi: 10.1155/2016/2491671. Epub 2016 Jun 30.
8
Meta-analysis derived atopic dermatitis (MADAD) transcriptome defines a robust AD signature highlighting the involvement of atherosclerosis and lipid metabolism pathways.荟萃分析得出的特应性皮炎(MADAD)转录组定义了一个强大的特应性皮炎特征,突出了动脉粥样硬化和脂质代谢途径的参与。
BMC Med Genomics. 2015 Oct 12;8:60. doi: 10.1186/s12920-015-0133-x.
9
Visualization-aided classification ensembles discriminate lung adenocarcinoma and squamous cell carcinoma samples using their gene expression profiles.可视化辅助分类集成通过基因表达谱区分肺腺癌和鳞状细胞癌样本。
PLoS One. 2014 Oct 15;9(10):e110052. doi: 10.1371/journal.pone.0110052. eCollection 2014.
10
Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus.多组元 TGDR 是一种多类正则化方法,用于识别乙型肝炎或丙型肝炎病毒感染所致肝细胞癌和肝硬化的代谢特征。
BMC Bioinformatics. 2014 Apr 4;15:97. doi: 10.1186/1471-2105-15-97.

本文引用的文献

1
Meta-analysis derived (MAD) transcriptome of psoriasis defines the "core" pathogenesis of disease.基于荟萃分析(MAD)的银屑病转录组定义了疾病的“核心”发病机制。
PLoS One. 2012;7(9):e44274. doi: 10.1371/journal.pone.0044274. Epub 2012 Sep 5.
2
Expanding the psoriasis disease profile: interrogation of the skin and serum of patients with moderate-to-severe psoriasis.扩大银屑病疾病谱:对中重度银屑病患者的皮肤和血清进行检测。
J Invest Dermatol. 2012 Nov;132(11):2552-64. doi: 10.1038/jid.2012.184. Epub 2012 Jul 5.
3
Frozen robust multiarray analysis (fRMA).冻融稳健多阵列分析(fRMA)。
Biostatistics. 2010 Apr;11(2):242-53. doi: 10.1093/biostatistics/kxp059. Epub 2010 Jan 22.
4
Global gene expression analysis reveals evidence for decreased lipid biosynthesis and increased innate immunity in uninvolved psoriatic skin.全基因组表达分析揭示了未受累银屑病皮肤中脂质生物合成减少和固有免疫增加的证据。
J Invest Dermatol. 2009 Dec;129(12):2795-804. doi: 10.1038/jid.2009.173. Epub 2009 Jul 2.
5
Filtering genes for cluster and network analysis.为聚类和网络分析筛选基因。
BMC Bioinformatics. 2009 Jun 23;10:193. doi: 10.1186/1471-2105-10-193.
6
Regularized gene selection in cancer microarray meta-analysis.癌症微阵列荟萃分析中的正则化基因选择
BMC Bioinformatics. 2009 Jan 1;10:1. doi: 10.1186/1471-2105-10-1.
7
Type I interferon: potential therapeutic target for psoriasis?I型干扰素:银屑病的潜在治疗靶点?
PLoS One. 2008 Jul 16;3(7):e2737. doi: 10.1371/journal.pone.0002737.
8
A comparison of methods for multiclass support vector machines.多类支持向量机方法的比较
IEEE Trans Neural Netw. 2002;13(2):415-25. doi: 10.1109/72.991427.
9
A parsimonious threshold-independent protein feature selection method through the area under receiver operating characteristic curve.一种基于受试者工作特征曲线下面积的简约且与阈值无关的蛋白质特征选择方法。
Bioinformatics. 2007 Oct 15;23(20):2788-94. doi: 10.1093/bioinformatics/btm442. Epub 2007 Sep 18.
10
A review of feature selection techniques in bioinformatics.生物信息学中特征选择技术综述。
Bioinformatics. 2007 Oct 1;23(19):2507-17. doi: 10.1093/bioinformatics/btm344. Epub 2007 Aug 24.