Luo Lola, Small Dylan, Stewart Walter F, Roy Jason A
University of Pennsylvania.
Sutter Health.
EGEMS (Wash DC). 2013 Dec 18;1(3):1040. doi: 10.13063/2327-9214.1040. eCollection 2013.
Chronic diseases are often described by stages of severity. Clinical decisions about what to do are influenced by the stage, whether a patient is progressing, and the rate of progression. For chronic kidney disease (CKD), relatively little is known about the transition rates between stages. To address this, we used electronic health records (EHR) data on a large primary care population, which should have the advantage of having both sufficient follow-up time and sample size to reliably estimate transition rates for CKD. However, EHR data have some features that threaten the validity of any analysis. In particular, the timing and frequency of laboratory values and clinical measurements are not determined a priori by research investigators, but rather, depend on many factors, including the current health of the patient. We developed an approach for estimating CKD stage transition rates using hidden Markov models (HMMs), when the level of information and observation time vary among individuals. To estimate the HMMs in a computationally manageable way, we used a "discretization" method to transform daily data into intervals of 30 days, 90 days, or 180 days. We assessed the accuracy and computation time of this method via simulation studies. We also used simulations to study the effect of informative observation times on the estimated transition rates. Our simulation results showed good performance of the method, even when missing data are non-ignorable. We applied the methods to EHR data from over 60,000 primary care patients who have chronic kidney disease (stage 2 and above). We estimated transition rates between six underlying disease states. The results were similar for men and women.
慢性病通常根据严重程度阶段来描述。关于如何治疗的临床决策受疾病阶段、患者是否病情进展以及进展速度的影响。对于慢性肾脏病(CKD),人们对各阶段之间的转变率了解相对较少。为解决这一问题,我们使用了来自大量初级保健人群的电子健康记录(EHR)数据,这类数据的优势在于有足够的随访时间和样本量,能够可靠地估计CKD的转变率。然而,EHR数据具有一些会威胁到任何分析有效性的特征。特别是,实验室检查值和临床测量的时间及频率并非由研究人员预先确定,而是取决于许多因素,包括患者当前的健康状况。当个体间的信息水平和观察时间不同时,我们开发了一种使用隐马尔可夫模型(HMM)来估计CKD阶段转变率的方法。为了以可计算的方式估计HMM,我们使用了一种“离散化”方法,将每日数据转换为30天、90天或180天的时间间隔。我们通过模拟研究评估了该方法的准确性和计算时间。我们还使用模拟来研究信息丰富的观察时间对估计转变率的影响。我们的模拟结果表明,即使缺失数据不可忽略,该方法仍具有良好的性能。我们将这些方法应用于来自60000多名患有慢性肾脏病(2期及以上)的初级保健患者的EHR数据。我们估计了六种潜在疾病状态之间的转变率。男性和女性的结果相似。