Here the same scene visualized using OpenCL octree raycasting. Eventhough the persistent threads method is not used, its around 20-100% faster than the previous splatting (depending on the scene). The code is ported from my previous CUDA raycaster. Hardware is again the GTX580M.
Each method has its advantages though. While octree raycasting works better for static objects, splatting is better for dynamic scenes.