多组元 TGDR 是一种多类正则化方法，用于识别乙型肝炎或丙型肝炎病毒感染所致肝细胞癌和肝硬化的代谢特征。

Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus.

机构信息

Division of Clinical Epidemiology, First Hospital of the Jilin University, 71Xinmin Street, Changchun, Jilin 130021, China.

出版信息

BMC Bioinformatics. 2014 Apr 4;15:97. doi: 10.1186/1471-2105-15-97.

DOI:10.1186/1471-2105-15-97

PMID:24707821

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4234477/

Abstract

BACKGROUND

Over the last decade, metabolomics has evolved into a mainstream enterprise utilized by many laboratories globally. Like other "omics" data, metabolomics data has the characteristics of a smaller sample size compared to the number of features evaluated. Thus the selection of an optimal subset of features with a supervised classifier is imperative. We extended an existing feature selection algorithm, threshold gradient descent regularization (TGDR), to handle multi-class classification of "omics" data, and proposed two such extensions referred to as multi-TGDR. Both multi-TGDR frameworks were used to analyze a metabolomics dataset that compares the metabolic profiles of hepatocellular carcinoma (HCC) infected with hepatitis B (HBV) or C virus (HCV) with that of cirrhosis induced by HBV/HCV infection; the goal was to improve early-stage diagnosis of HCC.

RESULTS

We applied two multi-TGDR frameworks to the HCC metabolomics data that determined TGDR thresholds either globally across classes, or locally for each class. Multi-TGDR global model selected 45 metabolites with a 0% misclassification rate (the error rate on the training data) and had a 3.82% 5-fold cross-validation (CV-5) predictive error rate. Multi-TGDR local selected 48 metabolites with a 0% misclassification rate and a 5.34% CV-5 error rate.

CONCLUSIONS

One important advantage of multi-TGDR local is that it allows inference for determining which feature is related specifically to the class/classes. Thus, we recommend multi-TGDR local be used because it has similar predictive performance and requires the same computing time as multi-TGDR global, but may provide class-specific inference.

摘要

背景

在过去的十年中，代谢组学已经发展成为一个被许多全球实验室使用的主流领域。与其他“组学”数据一样，代谢组学数据的特点是评估的特征数量比样本量小。因此，选择具有监督分类器的最佳特征子集是至关重要的。我们扩展了现有的特征选择算法，阈值梯度下降正则化（TGDR），以处理“组学”数据的多类分类，并提出了两种扩展，称为多-TGDR。这两种多-TGDR 框架都用于分析一个代谢组学数据集，该数据集比较了乙型肝炎（HBV）或丙型肝炎（HCV）感染的肝细胞癌（HCC）的代谢谱与 HBV/HCV 感染引起的肝硬化的代谢谱；目标是改善 HCC 的早期诊断。

结果

我们将两种多-TGDR 框架应用于 HCC 代谢组学数据，这些框架确定了要么在全局范围内跨类别的 TGDR 阈值，要么在每个类别的局部范围内确定 TGDR 阈值。多-TGDR 全局模型选择了 45 个代谢物，其错误率（训练数据上的错误率）为 0%，5 倍交叉验证（CV-5）预测错误率为 3.82%。多-TGDR 局部选择了 48 个代谢物，错误率为 0%，CV-5 错误率为 5.34%。

结论

多-TGDR 局部的一个重要优势是它允许进行推断，以确定哪个特征与特定的类/类有关。因此，我们建议使用多-TGDR 局部，因为它具有相似的预测性能，并且需要与多-TGDR 全局相同的计算时间，但可能提供类特定的推断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fe6/4234477/14e42090a8f7/1471-2105-15-97-1.jpg

相似文献

Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus.

BMC Bioinformatics. 2014 Apr 4;15:97. doi: 10.1186/1471-2105-15-97.

Multi-TGDR: a regularization method for multi-class classification in microarray experiments.

PLoS One. 2013 Nov 19;8(11):e78302. doi: 10.1371/journal.pone.0078302. eCollection 2013.

Targeted metabolic profiling of hepatocellular carcinoma and hepatitis C using LC-MS/MS.

Electrophoresis. 2013 Oct;34(19):2910-7. doi: 10.1002/elps.201300029. Epub 2013 Sep 1.

Patterns of hepatocellular carcinoma development in hepatitis B virus and hepatitis C virus related cirrhosis.

Antiviral Res. 2001 Nov;52(2):199-207. doi: 10.1016/s0166-3542(01)00185-1.

Hepatic expression of the proliferative marker Ki-67 and p53 protein in HBV or HCV cirrhosis in relation to dysplastic liver cell changes and hepatocellular carcinoma.

J Viral Hepat. 2005 Nov;12(6):635-41. doi: 10.1111/j.1365-2893.2005.00635.x.

Does dual infection by hepatitis B and C viruses play an important role in the pathogenesis of hepatocellular carcinoma in Japan?

Cancer. 1997 Dec 1;80(11):2060-7.

Impact of occult hepatitis B virus infection and prior hepatitis B virus infection on development of hepatocellular carcinoma in patients with liver cirrhosis due to hepatitis C virus.

Scand J Gastroenterol. 2008;43(7):849-56. doi: 10.1080/00365520801935459.

Integrated metabolomic profiling of hepatocellular carcinoma in hepatitis C cirrhosis through GC/MS and UPLC/MS-MS.

Liver Int. 2014 Oct;34(9):1428-44. doi: 10.1111/liv.12541. Epub 2014 Apr 28.

Concurrent hepatitis B and C virus infection and risk of hepatocellular carcinoma in cirrhosis. A prospective study.

Cancer. 1994 Nov 1;74(9):2442-8. doi: 10.1002/1097-0142(19941101)74:9<2442::aid-cncr2820740909>3.0.co;2-#.

Minor role of hepatitis B and C virus infection in the etiology of hepatocellular carcinoma in a low-endemic area.

Scand J Gastroenterol. 1996 Aug;31(8):809-13. doi: 10.3109/00365529609010357.

引用本文的文献

GEE-TGDR: A Longitudinal Feature Selection Algorithm and Its Application to lncRNA Expression Profiles for Psoriasis Patients Treated with Immune Therapies.

Biomed Res Int. 2021 Apr 9;2021:8862895. doi: 10.1155/2021/8862895. eCollection 2021.

Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time.

Biomed Res Int. 2019 Mar 19;2019:1724898. doi: 10.1155/2019/1724898. eCollection 2019.

The metabolic fingerprints of HCV and HBV infections studied by Nuclear Magnetic Resonance Spectroscopy.

Sci Rep. 2019 Mar 11;9(1):4128. doi: 10.1038/s41598-019-40028-4.

A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time.

BMC Med Inform Decis Mak. 2018 Dec 7;18(Suppl 5):115. doi: 10.1186/s12911-018-0685-8.

To select relevant features for longitudinal gene expression data by extending a pathway analysis method.

F1000Res. 2018 Jul 31;7:1166. doi: 10.12688/f1000research.15357.1. eCollection 2018.

Deciphering hepatocellular carcinoma through metabolomics: from biomarker discovery to therapy evaluation.

Cancer Manag Res. 2018 Apr 11;10:715-734. doi: 10.2147/CMAR.S156837. eCollection 2018.

Robust Regression Analysis of GCMS Data Reveals Differential Rewiring of Metabolic Networks in Hepatitis B and C Patients.

Metabolites. 2017 Oct 8;7(4):51. doi: 10.3390/metabo7040051.

Identification of Subtype-Specific Prognostic Genes for Early-Stage Lung Adenocarcinoma and Squamous Cell Carcinoma Patients Using an Embedded Feature Selection Algorithm.

PLoS One. 2015 Jul 30;10(7):e0134630. doi: 10.1371/journal.pone.0134630. eCollection 2015.

Visualization-aided classification ensembles discriminate lung adenocarcinoma and squamous cell carcinoma samples using their gene expression profiles.

PLoS One. 2014 Oct 15;9(10):e110052. doi: 10.1371/journal.pone.0110052. eCollection 2014.

本文引用的文献

Multi-TGDR: a regularization method for multi-class classification in microarray experiments.

PLoS One. 2013 Nov 19;8(11):e78302. doi: 10.1371/journal.pone.0078302. eCollection 2013.

Stable feature selection and classification algorithms for multiclass microarray data.

Biol Direct. 2012 Oct 2;7:33. doi: 10.1186/1745-6150-7-33.

Meta-analysis derived (MAD) transcriptome of psoriasis defines the "core" pathogenesis of disease.

PLoS One. 2012;7(9):e44274. doi: 10.1371/journal.pone.0044274. Epub 2012 Sep 5.

Serum metabolic profiling study of hepatocellular carcinoma infected with hepatitis B or hepatitis C virus by using liquid chromatography-mass spectrometry.

J Proteome Res. 2012 Nov 2;11(11):5433-42. doi: 10.1021/pr300683a. Epub 2012 Sep 28.

Computational and statistical analysis of protein mass spectrometry data.

PLoS Comput Biol. 2012 Jan;8(1):e1002296. doi: 10.1371/journal.pcbi.1002296. Epub 2012 Jan 26.

Bioinformatic-driven search for metabolic biomarkers in disease.

J Clin Bioinforma. 2011 Jan 20;1(1):2. doi: 10.1186/2043-9113-1-2.

Serum 27-nor-5β-cholestane-3,7,12,24,25 pentol glucuronide discovered by metabolomics as potential diagnostic biomarker for epithelium ovarian cancer.

J Proteome Res. 2011 May 6;10(5):2625-32. doi: 10.1021/pr200173q. Epub 2011 Apr 19.

Enhanced detection of early hepatocellular carcinoma by serum SELDI-TOF proteomic signature combined with alpha-fetoprotein marker.

Ann Surg Oncol. 2010 Sep;17(9):2518-25. doi: 10.1245/s10434-010-1038-8. Epub 2010 Mar 31.

Metabolic profiling using principal component analysis, discriminant partial least squares, and genetic algorithms.

Talanta. 2006 Feb 28;68(5):1683-91. doi: 10.1016/j.talanta.2005.08.042. Epub 2005 Sep 19.

HMDB: a knowledgebase for the human metabolome.

Nucleic Acids Res. 2009 Jan;37(Database issue):D603-10. doi: 10.1093/nar/gkn810. Epub 2008 Oct 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多组元 TGDR 是一种多类正则化方法，用于识别乙型肝炎或丙型肝炎病毒感染所致肝细胞癌和肝硬化的代谢特征。

Multi-TGDR, a multi-class regularization method, identifies the metabolic profiles of hepatocellular carcinoma and cirrhosis infected with hepatitis B or hepatitis C virus.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献