Bhushan V, Malloy M J, Engler M M, Engler M B, Drown D, Kane J P
Cardiovascular Research Institute, University of California, San Francisco 94143.
J Med Syst. 1993 Aug;17(3-4):187-93. doi: 10.1007/BF00996944.
Exploratory data analysis (EDA) software facilitates unstructured, iterative open exploration of complex datasets with the aid of multiple linked graphical displays. We are investigating relationships between plasma lipoproteins and coronary artery disease by retrospective analysis of 1677 consecutive UCSF Lipid Clinic patients. Our preliminary experience is with Data Deck 3.0 although several additional software programs (JMP 2.0, Systat 5.1, Minitab 8.0, StatView 4.0) are mentioned. Lipid diagnosis (751 women and 925 men) was 22% primary hypercholesterolemia, 19% combined hyperlipidemia, 3% dysbetalipoproteinemia, 15% endogenous lipemia, 4% mixed lipemia, 5% elevated Lp(a) and 32% with no major lipid abnormality. We found the Macintosh platform (68030) to be flexible and powerful for analysis of moderate size (less than 1 Mb) clinical datasets. High resolution color monitors (1024 x 768 pixels), fast hard disks (< 18 msec) and moderate amounts of system memory (8 + Mb) facilitate exploratory analysis.
探索性数据分析(EDA)软件借助多个相互关联的图形显示,便于对复杂数据集进行非结构化的、迭代式的开放式探索。我们正在通过对1677例连续的加州大学旧金山分校脂质诊所患者进行回顾性分析,来研究血浆脂蛋白与冠状动脉疾病之间的关系。我们最初使用的是Data Deck 3.0软件,不过文中还提到了其他几个软件程序(JMP 2.0、Systat 5.1、Minitab 8.0、StatView 4.0)。脂质诊断结果(751名女性和925名男性)显示,原发性高胆固醇血症占22%,混合性高脂血症占19%,异常β脂蛋白血症占3%,内源性脂血症占15%,混合性脂血症占4%,脂蛋白(a)升高占5%,无主要脂质异常者占32%。我们发现,苹果机平台(68030)对于分析中等规模(小于1兆字节)的临床数据集而言灵活且功能强大。高分辨率彩色显示器(1024×768像素)、快速硬盘(<18毫秒)以及适量的系统内存(8 +兆字节)有助于进行探索性分析。