School of Information Systems, Computing, Brunel University, Uxbridge, West London, UK.
J Biomed Inform. 2013 Apr;46(2):266-74. doi: 10.1016/j.jbi.2012.11.003. Epub 2012 Nov 29.
Clinical trials are typically conducted over a population within a defined time period in order to illuminate certain characteristics of a health issue or disease process. These cross-sectional studies give us a 'snapshot' of this disease process over a large number of people but do not allow us to model the temporal nature of disease, thereby allowing for modelling detailed prognostic predictions. The aim of this paper is to explore an extension of the temporal bootstrap to identify intermediate stages in a disease process and sub-categories of the disease exhibiting subtly different symptoms. Our approach is compared to a strawman method and investigated in its ability to explain the dynamics of progression on biomedical data from three diseases: Glaucoma, Breast Cancer and Parkinson's disease. We focus on creating reliable time-series models from large amounts of historical cross-sectional data using the temporal bootstrap technique. Two issues are explored: how to build time-series models from cross-sectional data, and how to automatically identify different disease states along these trajectories, as well as the transitions between them. Our approach of relabeling trajectories allows us to explore the temporal nature of how diseases progress even when time-series data is not available (if the cross-sectional study is large enough). We intend to expand this research to deal with multiple studies where we can combine both cross-sectional and longitudinal datasets and to focus on the junctions of the trajectories as key stages in the progression of disease.
临床试验通常在特定时间段内针对特定人群进行,旨在阐明健康问题或疾病过程的某些特征。这些横断面研究为我们提供了大量人群中疾病过程的“快照”,但不允许我们对疾病的时间性质进行建模,从而无法进行详细预后预测的建模。本文旨在探讨时间 bootstrap 的扩展,以识别疾病过程中的中间阶段和表现出略微不同症状的疾病亚类。我们的方法与一个假想方法进行了比较,并研究了其在解释来自三种疾病(青光眼、乳腺癌和帕金森病)的生物医学数据的进展动态方面的能力。我们专注于使用时间 bootstrap 技术从大量历史横断面数据中创建可靠的时间序列模型。探讨了两个问题:如何从横断面数据构建时间序列模型,以及如何自动识别这些轨迹上的不同疾病状态以及它们之间的转变。我们对轨迹重新标记的方法允许我们探索疾病进展的时间性质,即使没有时间序列数据(如果横断面研究足够大)。我们打算扩展这项研究,以处理可以结合横断面和纵向数据集的多项研究,并关注轨迹的交点作为疾病进展的关键阶段。