Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander University Erlangen-Nuremberg, Martensstrasse 3, 91058 Erlangen, Germany.
Comput Med Imaging Graph. 2011 Apr;35(3):227-36. doi: 10.1016/j.compmedimag.2010.11.004. Epub 2010 Dec 3.
In this paper, a method is described to automatically estimate the visible body region of a computed tomography (CT) volume image. In order to quantify the body region, a body coordinate (BC) axis is used that runs in longitudinal direction. Its origin and unit length are patient-specific and depend on anatomical landmarks. The body region of a test volume is estimated by registering it only along the longitudinal axis to a set of reference CT volume images with known body coordinates. During these 1D registrations, an axial image slice of the test volume is compared to an axial slice of a reference volume by extracting a descriptor from both slices and measuring the similarity of the descriptors. A slice descriptor consists of histograms of visual words. Visual words are code words of a quantized feature space and can be thought of as classes of image patches with similar appearance. A slice descriptor is formed by sampling a slice on a regular 2D grid and extracting a Speeded Up Robust Features (SURF) descriptor at each sample point. The codebook, or visual vocabulary, is generated in a training step by clustering SURF descriptors. Each SURF descriptor extracted from a slice is classified into the closest visual word (or cluster center) and counted in a histogram. A slice is finally described by a spatial pyramid of such histograms. We introduce an extension of the SURF descriptors to an arbitrary number of dimensions (N-SURF). Here, we make use of 2-SURF and 3-SURF descriptors. Cross-validation on 84 datasets shows the robustness of the results. The body portion can be estimated with an average error of 15.5mm within 9s. Possible applications of this method are automatic labeling of medical image databases and initialization of subsequent image analysis algorithms.
本文描述了一种自动估算计算机断层扫描(CT)容积图像可见体区的方法。为了量化体区,使用了沿纵轴方向延伸的体坐标系(BC),其原点和单位长度是患者特有的,取决于解剖学标志。通过仅沿着纵轴将测试体积注册到一组具有已知体坐标的参考 CT 容积图像,来估算测试体积的体区。在这些一维注册过程中,通过从两个切片中提取描述符并测量描述符的相似性,比较测试体积的轴向图像切片和参考体积的轴向切片。切片描述符由视觉单词的直方图组成。视觉单词是量化特征空间的码字,可以看作是具有相似外观的图像补丁类。通过在规则的 2D 网格上对切片进行采样并在每个采样点提取 Speeded Up Robust Features(SURF)描述符,形成切片描述符。代码本,或视觉词汇,通过聚类 SURF 描述符在训练步骤中生成。从切片中提取的每个 SURF 描述符被分类到最近的视觉单词(或聚类中心)并在直方图中计数。最后,通过这样的直方图的空间金字塔来描述切片。我们将 SURF 描述符扩展到任意数量的维度(N-SURF)。这里,我们使用 2-SURF 和 3-SURF 描述符。在 84 个数据集上的交叉验证表明了结果的稳健性。该方法可以在 9 秒内以平均误差 15.5mm 的精度估算体部。这种方法的可能应用是自动标记医学图像数据库和初始化后续图像分析算法。