We have released optimized PyTorch and Taichi voxelization implementations, which improve speed by 1.5x and 3.5x, respectively. However, I assume this may not fully meet your needs, as it does not significantly optimize GPU memory usage, and Taichi does not support gradient computation for training.
Regrettably, we found that the CUDA implementation from GaussianFormer is not strictly correct, so our plans have changed from what we originally intended.
Originally posted by @npurson in #14
Hello,
In a previous post, you responded to me with the above. Could you expand on the statement 'the CUDA implementation from GaussianFormer is not strictly correct'. Is it that you're saying their implementation is not mathematically correct, per se or that it is not optimised for usage in a model which is self-supervised using rendering?
Thank you
Originally posted by @npurson in #14
Hello,
In a previous post, you responded to me with the above. Could you expand on the statement 'the CUDA implementation from GaussianFormer is not strictly correct'. Is it that you're saying their implementation is not mathematically correct, per se or that it is not optimised for usage in a model which is self-supervised using rendering?
Thank you