Lazic Stanley E, Mason Sarah L, Michell Andrew W, Barker Roger A
Cambridge Computational Biology Institute, Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK.
BMC Med Res Methodol. 2009 May 27;9:32. doi: 10.1186/1471-2288-9-32.
It is often desirable to observe how a disease progresses over time in individual patients, rather than graphing group averages; and since multiple outcomes are typically recorded on each patient, it would be advantageous to visualise disease progression on multiple variables simultaneously.
A variety of vector plots and a path plot have been developed for this purpose, and data from a longitudinal Huntington's disease study are used to illustrate the utility of these graphical methods for exploratory data analysis.
Initial and final values for three outcome variables can be easily visualised per patient, along with the change in these variables over time. In addition to the disease trajectory, the path individual patients take from initial to final observation can be traced. Categorical variables can be coded with different types of vectors or paths (e.g. different colours, line types, line thickness) and separate panels can be used to include further categorical or continuous variables, allowing clear visualisation of further information for each individual. In addition, summary statistics such as mean vectors, bivariate interquartile ranges and convex polygons can be included to assist in interpreting trajectories, comparing groups, and detecting multivariate outliers.
Vector and path plots are useful graphical methods for exploratory data analysis when individual-level information on multiple variables over time is desired, and they have several advantages over plotting each variable separately.
通常希望观察个体患者疾病随时间的进展情况,而不是绘制组平均值图表;而且由于通常会记录每个患者的多个结果,因此能够同时在多个变量上直观显示疾病进展情况将很有帮助。
为此开发了多种向量图和路径图,并使用来自一项亨廷顿舞蹈病纵向研究的数据来说明这些图形方法在探索性数据分析中的实用性。
可以轻松直观显示每位患者三个结果变量的初始值和最终值,以及这些变量随时间的变化情况。除了疾病轨迹外,还可以追踪个体患者从初始观察到最终观察所采取的路径。分类变量可以用不同类型的向量或路径进行编码(例如不同颜色、线型、线宽),并且可以使用单独的面板来纳入更多分类或连续变量,从而清晰地直观显示每个个体的更多信息。此外,还可以纳入诸如平均向量、双变量四分位距和凸多边形等汇总统计量,以帮助解释轨迹、比较组以及检测多变量异常值。
当需要随时间获取多个变量的个体层面信息时,向量图和路径图是探索性数据分析中有用的图形方法,并且它们相对于分别绘制每个变量具有多个优势。