Breen Vivienne, Kasabov Nikola, Kamat Ashish M, Jacobson Elsie, Suttie James M, O'Sullivan Paul J, Kavalieris Laimonis, Darling David G
Auckland University of Technology, Auckland, New Zealand.
M. D. Anderson Cancer Center, University of Texas, Houston, TX, USA.
BMC Med Res Methodol. 2015 May 12;15:45. doi: 10.1186/s12874-015-0036-8.
Comparing the relative utility of diagnostic tests is challenging when available datasets are small, partial or incomplete. The analytical leverage associated with a large sample size can be gained by integrating several small datasets to enable effective and accurate across-dataset comparisons. Accordingly, we propose a methodology for a holistic comparative analysis and ranking of cancer diagnostic tests through dataset integration and imputation of missing values, using urothelial carcinoma (UC) as a case study.
Five datasets comprising samples from 939 subjects, including 89 with UC, where up to four diagnostic tests (cytology, NMP22®, UroVysion® Fluorescence In-Situ Hybridization (FISH) and Cxbladder Detect) were integrated into a single dataset containing all measured records and missing values. The tests were firstly ranked using three criteria: sensitivity, specificity and a standard variable (feature) ranking method popularly known as signal-to-noise ratio (SNR) index derived from the mean values for all subjects clinically known to have UC versus healthy subjects. Secondly, step-wise unsupervised and supervised imputation (the latter accounting for the 'clinical truth' as determined by cystoscopy) was performed using personalized modelling, k-nearest-neighbour methods, multiple logistic regression and multilayer perceptron neural networks. All imputation models were cross-validated by comparing their post-imputation predictive accuracy for UC with their pre-imputation accuracy. Finally, the post-imputation tests were re-ranked using the same three criteria.
In both measured and imputed data sets, Cxbladder Detect ranked higher for sensitivity, and urine cytology a higher specificity, when compared with other UC tests. Cxbladder Detect consistently ranked higher than FISH and all other tests when SNR analyses were performed on measured, unsupervised and supervised imputed datasets. Supervised imputation resulted in a smaller cross-validation error. Cxbladder Detect was robust to imputation showing a 2% difference in its predictive versus clinical accuracy, outperforming FISH, NMP22 and cytology.
All data analysed, pre- and post-imputation showed that Cxbladder Detect had higher SNR and outperformed all other comparator tests, including FISH. The methodology developed and validated for comparative ranking of the diagnostic tests for detecting UC, may be further applied to other cancer diagnostic datasets across population groups and multiple datasets.
当可用数据集规模小、不完整或部分缺失时,比较诊断测试的相对效用具有挑战性。通过整合多个小数据集,可以获得与大样本量相关的分析优势,从而实现有效且准确的跨数据集比较。因此,我们提出一种方法,通过数据集整合和缺失值插补,对癌症诊断测试进行全面的比较分析和排序,并以尿路上皮癌(UC)为例进行研究。
五个数据集包含939名受试者的样本,其中89例为UC患者,最多四项诊断测试(细胞学检查、NMP22®、UroVysion®荧光原位杂交(FISH)和Cxbladder Detect)被整合到一个包含所有测量记录和缺失值的单一数据集中。首先使用三个标准对测试进行排序:敏感性、特异性以及一种常用的标准变量(特征)排序方法,即信噪比(SNR)指数,该指数由所有临床确诊为UC的受试者与健康受试者的平均值得出。其次,使用个性化建模、k近邻方法、多元逻辑回归和多层感知器神经网络进行逐步无监督和有监督插补(后者考虑了膀胱镜检查确定的“临床真相”)。通过比较插补后UC的预测准确性与其插补前的准确性,对所有插补模型进行交叉验证。最后,使用相同的三个标准对插补后的测试重新排序。
在测量数据集和插补数据集中,与其他UC测试相比,Cxbladder Detect的敏感性排名更高,而尿液细胞学检查的特异性更高。在对测量数据集、无监督插补数据集和有监督插补数据集进行SNR分析时,Cxbladder Detect的排名始终高于FISH和所有其他测试。有监督插补导致较小的交叉验证误差。Cxbladder Detect对插补具有稳健性,其预测准确性与临床准确性的差异为2%,优于FISH、NMP22和细胞学检查。
所有分析的数据,无论是插补前还是插补后,均显示Cxbladder Detect具有更高的SNR,并且优于所有其他对比测试,包括FISH。为检测UC的诊断测试的比较排名而开发和验证的方法,可进一步应用于跨人群组和多个数据集的其他癌症诊断数据集。