Zellmann Stefan, Schulze Jrgen P, Lang Ulrich
IEEE Trans Vis Comput Graph. 2021 Mar;27(3):1904-1915. doi: 10.1109/TVCG.2019.2938957. Epub 2021 Jan 28.
While k-d trees are known to be effective for spatial indexing of sparse 3-d volume data, full reconstruction, e.g. due to changes to the alpha transfer function during rendering, is usually a costly operation with this hierarchical data structure. In a recent publication we showed how to port a clever state of the art k-d tree construction algorithm to a multi-core CPU architecture and by means of thorough optimization we were able to obtain interactive reconstruction rates for moderately sized to large data sets. The construction scheme is based on maintaining partial summed-volume tables that fit in the L1 cache of the multi-core CPU and that allow for fast occupancy queries. In this work we propose a GPU implementation of the parallel k-d tree construction algorithm and compare it with the original multi-core CPU implementation. We conduct a thorough comparative study that outlines performance and scalability of our implementation.
虽然已知k-d树对于稀疏三维体数据的空间索引很有效,但由于渲染期间alpha传递函数的变化等原因导致的完全重建,对于这种分层数据结构来说通常是一项代价高昂的操作。在最近的一篇论文中,我们展示了如何将一种先进的k-d树构建算法移植到多核CPU架构上,并且通过全面优化,我们能够为中等大小到大型数据集获得交互式重建速率。该构建方案基于维护适合多核CPU的L1缓存的部分求和体积表,并且允许进行快速占用查询。在这项工作中,我们提出了并行k-d树构建算法的GPU实现,并将其与原始的多核CPU实现进行比较。我们进行了一项全面的比较研究,概述了我们实现的性能和可扩展性。