aipixel / GPS-Gaussian

[CVPR 2024 Highlight] The official repo for “GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis”

Home Page:https://shunyuanzheng.github.io/GPS-Gaussian

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

scale head

lebron12332 opened this issue · comments

hello. Excellent work. I noticed that you truncated the scale, as shown in the following code, why do this operation?

scale head

scale_out = torch.clamp_max(self.scale_head(out), 0.01)

Hi, thanks for your interest.

If not clamp the max value of the scale of Gaussians, it is quite possible to be out of memory at the beginning of the training. Also, we do not need extremely large Gaussians since the point clouds unprotected from depth maps are dense.

Thank you. I have another related question. I can understand that using Gaussian expression is more efficient than nerf, but why can it also improve the performance, such as the quality of rendered images? What is your opinion on this? such as your task.

In my opinion, efficient rendering enables us to generate complete views of humans to optimize perceptual metrics like LPIPS in general human-related works, which remains a challenge in NeRF-based methods. But our work benefits more from its differentiable capability, especially to the `position' of points.