Atkinson E Neely, Follen Michele
Department of Biostatistics and Applied Math, the University of Texas M.D. Anderson Cancer Center, Center for Biomedical Engineering, Unit 193, 1515 Holcombe Blvd., Houston, TX 77030, USA.
Gynecol Oncol. 2005 Dec;99(3 Suppl 1):S76-83. doi: 10.1016/j.ygyno.2005.07.048. Epub 2005 Oct 7.
The use of interactive dynamic graphics has become a common practice for the exploration of multidimensional data sets. The availability of powerful and inexpensive hardware and software for graphical computing makes the use of such techniques feasible for the examination of complex forms of data. This paper describes some simple techniques, which were implemented in the LISP-STAT environment, for the visualization of functional data arising from studies of optical technologies used for the detection of cervical intraepithelial neoplasia or squamous intraepithelial lesions.
The methods demonstrated have been implemented in software coded in LISP-STAT, a free statistical computing package available for most computer systems. The data used in this paper are drawn from a previous study in which fluorescence spectroscopy was measured from cervical sites at 337 nm, 380 nm, and 460 nm excitation in cervical screening patients. The goal of the project is to explore biographical variables to better understand the biology of fluorescence.
199 measurements were taken in 55 women with normal Pap smears. The data are recorded as spectra showing the intensity of emission excitation versus emission in nanometers. Covariate variables available for analysis are current smoker vs. nonsmoker premenopausal vs. postmenopausal, tissue type (columnar, squamous, and transition zone), and age in years. Although the optical measurements show consistent changes between normal and abnormal tissue in individual patients, there is wide variation in the intensity of the measurements between patients, even for normal tissue. Patient age affects the fluorescent spectrum showing increasing intensity with increasing age. Menopausal status affects the fluorescent spectra coincidentally with age. Smoking and race do not appear to affect the spectra in this sample of patients.
The use of interactive graphical techniques permits the data analyst to examine multidimensional data in intuitive ways. These explorations allow non-statisticians to explore the data in a perceptive manner that may lead to new approaches in algorithm development for optical technologies.
使用交互式动态图形已成为探索多维数据集的常见做法。用于图形计算的强大且廉价的硬件和软件的可用性使得使用此类技术来检查复杂形式的数据变得可行。本文描述了一些在LISP-STAT环境中实现的简单技术,用于可视化源自用于检测宫颈上皮内瘤变或鳞状上皮内病变的光学技术研究中的功能数据。
所展示的方法已在使用LISP-STAT编码的软件中实现,LISP-STAT是一个可用于大多数计算机系统的免费统计计算软件包。本文使用的数据取自先前的一项研究,该研究在宫颈筛查患者中于337nm、380nm和460nm激发波长下测量了宫颈部位的荧光光谱。该项目的目标是探索生物统计学变量,以更好地理解荧光生物学。
对55名巴氏涂片正常的女性进行了199次测量。数据记录为光谱,显示发射激发强度与以纳米为单位的发射强度的关系。可用于分析的协变量包括当前吸烟者与非吸烟者、绝经前与绝经后、组织类型(柱状、鳞状和移行带)以及年龄(以岁为单位)。尽管光学测量显示个体患者中正常组织和异常组织之间存在一致的变化,但患者之间测量强度存在很大差异,即使是正常组织也是如此。患者年龄影响荧光光谱,随年龄增长强度增加。绝经状态与年龄同时影响荧光光谱。在该患者样本中,吸烟和种族似乎不影响光谱。
使用交互式图形技术使数据分析师能够以直观的方式检查多维数据。这些探索使非统计学家能够以敏锐的方式探索数据,这可能会为光学技术的算法开发带来新方法。