• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将稀疏Cox模型与纵向转录组学数据聚类相结合用于创伤预后分析。

Coupling sparse Cox models with clustering of longitudinal transcriptomics data for trauma prognosis.

作者信息

Constantino Cláudia S, Carvalho Alexandra M, Vinga Susana

机构信息

INESC-ID, Instituto Superior Técnico, ULisboa, R. Alves Redol 9, Lisbon, 1000-029, Portugal.

Instituto de Telecomunicações, Instituto Superior Técnico, ULisboa, Av. Rovisco Pais 1, Lisbon, 1049-001, Portugal.

出版信息

BioData Min. 2021 Apr 14;14(1):25. doi: 10.1186/s13040-021-00257-8.

DOI:10.1186/s13040-021-00257-8
PMID:33853663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8048345/
Abstract

BACKGROUND

Longitudinal gene expression analysis and survival modeling have been proved to add valuable biological and clinical knowledge. This study proposes a novel framework to discover gene signatures and patterns in a high-dimensional time series transcriptomics data and to assess their association with hospital length of stay.

METHODS

We investigated a longitudinal and high-dimensional gene expression dataset from 168 blunt-force trauma patients followed during the first 28 days after injury. To model the length of stay, an initial dimensionality reduction step was performed by applying Cox regression with elastic net regularization using gene expression data from the first hospitalization days. Also, a novel methodology to impute missing values to the genes selected previously was proposed. We then applied multivariate time series (MTS) clustering to analyse gene expression over time and to stratify patients with similar trajectories. The validation of the patients' partitions obtained by MTS clustering was performed using Kaplan-Meier curves and log-rank tests.

RESULTS

We were able to unravel 22 genes strongly associated with hospital's discharge. Their expression values in the first days after trauma showed to be good predictors of the length of stay. The proposed mixed imputation method allowed to achieve a complete dataset of short time series with a minimum loss of information for the 28 days of follow-up. MTS clustering enabled to group patients with similar genes trajectories and, notably, with similar discharge days from the hospital. Patients within each cluster have comparable genes' trajectories and may have an analogous response to injury.

CONCLUSION

The proposed framework was able to tackle the joint analysis of time-to-event information with longitudinal multivariate high-dimensional data. The application to length of stay and transcriptomics data revealed a strong relationship between gene expression trajectory and patients' recovery, which may improve trauma patient's management by healthcare systems. The proposed methodology can be easily adapted to other medical data, towards more effective clinical decision support systems for health applications.

摘要

背景

纵向基因表达分析和生存建模已被证明能增加有价值的生物学和临床知识。本研究提出了一个新颖的框架,用于在高维时间序列转录组学数据中发现基因特征和模式,并评估它们与住院时间的关联。

方法

我们研究了168例钝器伤患者在受伤后前28天的纵向高维基因表达数据集。为了对住院时间进行建模,通过应用带有弹性网络正则化的Cox回归,使用首次住院日的基因表达数据进行了初始降维步骤。此外,还提出了一种新方法来填补先前选择的基因的缺失值。然后,我们应用多变量时间序列(MTS)聚类来分析基因表达随时间的变化,并对具有相似轨迹的患者进行分层。使用Kaplan-Meier曲线和对数秩检验对通过MTS聚类获得的患者分区进行验证。

结果

我们能够找出22个与出院密切相关的基因。它们在创伤后最初几天的表达值显示是住院时间的良好预测指标。所提出的混合填补方法能够获得一个完整的短时间序列数据集,在28天的随访中信息损失最小。MTS聚类能够将具有相似基因轨迹的患者分组,特别是具有相似出院日期的患者。每个聚类中的患者具有可比的基因轨迹,并且可能对损伤有类似的反应。

结论

所提出的框架能够处理事件发生时间信息与纵向多变量高维数据的联合分析。应用于住院时间和转录组学数据揭示了基因表达轨迹与患者恢复之间的密切关系,这可能会改善医疗系统对创伤患者的管理。所提出的方法可以很容易地适用于其他医学数据,以建立更有效的健康应用临床决策支持系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/637ef80b6d27/13040_2021_257_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/df5506ad46e7/13040_2021_257_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/8b52bd9dc0e0/13040_2021_257_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/d24427d69ead/13040_2021_257_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/fe79891793c9/13040_2021_257_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/c9411e5e1973/13040_2021_257_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/8fedaedc483f/13040_2021_257_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/1258272c3063/13040_2021_257_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/637ef80b6d27/13040_2021_257_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/df5506ad46e7/13040_2021_257_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/8b52bd9dc0e0/13040_2021_257_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/d24427d69ead/13040_2021_257_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/fe79891793c9/13040_2021_257_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/c9411e5e1973/13040_2021_257_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/8fedaedc483f/13040_2021_257_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/1258272c3063/13040_2021_257_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4b/8048345/637ef80b6d27/13040_2021_257_Fig8_HTML.jpg

相似文献

1
Coupling sparse Cox models with clustering of longitudinal transcriptomics data for trauma prognosis.将稀疏Cox模型与纵向转录组学数据聚类相结合用于创伤预后分析。
BioData Min. 2021 Apr 14;14(1):25. doi: 10.1186/s13040-021-00257-8.
2
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。
BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.
3
Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth.用于电子健康领域中存在缺失值的大型纵向试验数据的基于多重填补的聚类验证(MIV)
J Med Syst. 2016 Jun;40(6):146. doi: 10.1007/s10916-016-0499-0. Epub 2016 Apr 28.
4
Multi-cancer samples clustering via graph regularized low-rank representation method under sparse and symmetric constraints.基于稀疏和对称约束的图正则化低秩表示方法的多癌样本聚类。
BMC Bioinformatics. 2019 Dec 30;20(Suppl 22):718. doi: 10.1186/s12859-019-3231-5.
5
Performance Evaluation of Missing-Value Imputation Clustering Based on a Multivariate Gaussian Mixture Model.基于多元高斯混合模型的缺失值插补聚类性能评估
PLoS One. 2016 Aug 23;11(8):e0161112. doi: 10.1371/journal.pone.0161112. eCollection 2016.
6
Survival marker genes of colorectal cancer derived from consistent transcriptomic profiling.基于一致的转录组分析的结直肠癌生存标记基因。
BMC Genomics. 2018 Dec 11;19(Suppl 8):857. doi: 10.1186/s12864-018-5193-9.
7
Joint model-based clustering of nonlinear longitudinal trajectories and associated time-to-event data analysis, linked by latent class membership: with application to AIDS clinical studies.基于联合模型的非线性纵向轨迹聚类以及相关的事件发生时间数据分析,通过潜在类别成员关系进行关联:应用于艾滋病临床研究。
Lifetime Data Anal. 2018 Oct;24(4):699-718. doi: 10.1007/s10985-017-9409-0. Epub 2017 Oct 27.
8
Histone deactylase gene expression profiles are associated with outcomes in blunt trauma patients.组蛋白去乙酰化酶基因表达谱与钝性创伤患者的预后相关。
J Trauma Acute Care Surg. 2016 Jan;80(1):26-32; discussion 32-3. doi: 10.1097/TA.0000000000000896.
9
Towards clustering of incomplete microarray data without the use of imputation.迈向无需插补的不完整微阵列数据聚类
Bioinformatics. 2007 Jan 1;23(1):107-13. doi: 10.1093/bioinformatics/btl555. Epub 2006 Oct 31.
10
Chronic consequences of acute injuries: worse survival after discharge.急性损伤的慢性后果:出院后生存率下降。
J Trauma Acute Care Surg. 2012 Sep;73(3):699-703. doi: 10.1097/TA.0b013e318253b5db.

本文引用的文献

1
Structured sparsity regularization for analyzing high-dimensional omics data.结构稀疏正则化分析高维组学数据。
Brief Bioinform. 2021 Jan 18;22(1):77-87. doi: 10.1093/bib/bbaa122.
2
Multi-omics Data Integration, Interpretation, and Its Application.多组学数据整合、解读及其应用
Bioinform Biol Insights. 2020 Jan 31;14:1177932219899051. doi: 10.1177/1177932219899051. eCollection 2020.
3
Vanin 1: Its Physiological Function and Role in Diseases.范宁 1:其生理功能及在疾病中的作用。
Int J Mol Sci. 2019 Aug 9;20(16):3891. doi: 10.3390/ijms20163891.
4
Twiner: correlation-based regularization for identifying common cancer gene signatures.Twiner:基于相关性的正则化方法,用于鉴定常见癌症基因特征。
BMC Bioinformatics. 2019 Jun 25;20(1):356. doi: 10.1186/s12859-019-2937-8.
5
Comparison of Gaussian Processes Methods to Linear methods for Imputation of Sparse Physiological Time Series.用于稀疏生理时间序列插补的高斯过程方法与线性方法的比较
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:4106-4109. doi: 10.1109/EMBC.2018.8513303.
6
Time-series clustering of cage-level sea lice data.基于笼具水平的海虱数据的时间序列聚类。
PLoS One. 2018 Sep 25;13(9):e0204319. doi: 10.1371/journal.pone.0204319. eCollection 2018.
7
Cox model with interval-censored covariate in cohort studies.队列研究中具有区间删失协变量的Cox模型。
Biom J. 2018 Jul;60(4):797-814. doi: 10.1002/bimj.201700090. Epub 2018 May 18.
8
Clustering gene expression time series data using an infinite Gaussian process mixture model.使用无限高斯过程混合模型对基因表达时间序列数据进行聚类。
PLoS Comput Biol. 2018 Jan 16;14(1):e1005896. doi: 10.1371/journal.pcbi.1005896. eCollection 2018 Jan.
9
Application of time series analysis in modelling and forecasting emergency department visits in a medical centre in Southern Taiwan.时间序列分析在台湾南部某医疗中心急诊就诊建模与预测中的应用
BMJ Open. 2017 Dec 1;7(11):e018628. doi: 10.1136/bmjopen-2017-018628.
10
Computational dynamic approaches for temporal omics data with applications to systems medicine.用于时间组学数据的计算动力学方法及其在系统医学中的应用
BioData Min. 2017 Jun 17;10:20. doi: 10.1186/s13040-017-0140-x. eCollection 2017.