Awesome 3D Gaussian Splatting Resources

A curated list of papers and open-source resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months. If you have any additions or suggestions, feel free to contribute. Additional resources like blog posts, videos, etc. are also welcome.

Added 18 papers: Z-Splat, Dual-Camera, StylizedGS, Hash3D, Revisiting Densification, Gaussian Pancakes, 3D-aware Deformable Gaussians, SpikeNVS, Zero-shot PC completion, SplatPose, DreamScene360, RealmDreamer, Gaussian-ILC, Reinforcment Learning with GGS, GoMAvatar, OccGaussian, LoopGaussian, Review

April 11, 2024

Code release of latentSplat

April 9, 2024

Added 1 paper: EgoLifter

April 8, 2024

Added 3 papers: Robust Gaussian Splatting, SC4D, and MM-Gaussian

April 5, 2024

Added 5 papers: Surface Reconstruction, TCLC-GS, GaSpCT, OmniGS, and Per-Gaussian Embedding,
Fixes

April 2, 2024

Added 11 papers: HO, SGD, HGS, Snap-it, InstantSplat, 3DGSR, MM3DGS, HAHA, CityGaussain, Mirror-3DGS, and Feature Splatting

March 30, 2024

Added 8 papers: Modeling uncertainty, GRM, Gamba, CoherentGS, TOGS, SA-GS, and GaussianCube

March 27, 2024

Added Other Implementation: 360-gaussian-splatting
CVPR '24 labels added
Added 5 papers: Comp4D, DreamPolisher, DN-Splatter, 2D GS, and Octree-GS

March 26, 2024

Added 13 paper: latentSplat, GS on the Move, RadSplat, Mini-Splatting, SyncTweedies, HAC, STAG4D, EndoGSLAM, Pixel-GS, Semantic Gaussians, Gaussian in the Wild, CG-SLAM, and GSDF

March 24, 2024:

Added paper: Gaussian Frosting

March 20, 2024:

Added 4 papers: GVGEN, HUGS, RGBD GS-ICP SLAM, and High-Fidelity SLAM

March 19, 2024:

Added Pointrix
Added 3DGS tutorial by the original authors
Added GauStudio
Added 23 papers: Touch-GS, GGRt, FDGaussian, SWAG, Den-SOFT, Gaussian-Flow, View-Consistent 3D Editing, BAGS, GeoGaussian, GS-Pose, Analytic-Splatting, Seamless 3D Maps, Texture-GS, Recent Advances in 3DGS, Compact 3DGS for Dense Visual SLAM, BrightDreamer, 3DGS-Reloc, Beyond Uncertainty, Motion-Aware 3DGS, Fed3DGS, GaussNav, 3DGS-Calib, and NEDS-SLAM

March 17, 2024:

Update repo name and link for 3DGS.cpp (originally VulkanSplatting)

March 16, 2024:

SplatTV
Added 6 papers: GaussianGrasper, new splitting algorithm, Controllable Text-to-3D Generation, Spring-Mass 3DGS, Hyper-3DGS, and DreamScene

March 14, 2024:

Added 6 papers: SemGauss, StyleGaussian, Gaussian Splatting in Style, GaussCtrl, GaussianImage, and RAIN-GS

March 8, 2024:

Tutorial: Howto capture images for 3DGS
Added 6 papers: SplattingAvatar, DNGaussian, Radiative Gaussians, BAGS, GSEdit, and ManiGaussian

March 8, 2024:

Added 3DGStream Viewer

March 6, 2024:

1 paper added: Splat-Nav

March 5, 2024:

1 paper added: 3DGStream
Code releases
New viewer added

March 2, 2024:

1 paper added: 3D Gaussian Model for Animation and Texturing
New section: Courses that also teach 3DGS.

February 28, 2024:

VastGaussian

February 27, 2024:

2 papers added: Spec-Gaussian and GEA
SC-GS code released

February 24, 2024:

2 papers added: Identifying unnecessary Gaussians and Gaussian Pro

February 23, 2024:

Corrected Authors and updated abstract for EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting

February 21, 2024:

Added one paper: Reshaping SLAM: a Survey

February 20, 2024:

GaussianObject code released
Added one paper: GaussianHair

February 19, 2024:

Blog post added: NeRFs vs. 3DGS.

February 16, 2024:

2 papers added: IM-3D and GES
GaMeS code released

February 14, 2024:

Added viewer: VulkanSplatting - cross-platform, high performance 3DGS renderer in C++ and Vulkan Compute

February 13, 2024:

Code releases: (16th Jan 2024) Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
3 papers added: 3DGala, ImplicitDeepFake, and 3D Gaussians as a New Vision Era.

February 9, 2024:

1 paper added: HeadStudio

February 8, 2024:

3 papers added: Rig3DGS, Mesh-based GS, and LGM February 6, 2024:
Added 2 papers: SGS-SLAM and 4D Gaussian Splatting

February 5, 2024:

Moved SWAGS to Dynmatics and Deformation section
Added 2 paper: GaussianObject and GaMeSh
GS++ renamed to Optimal Projection

February 2, 2024:

Added 6 papers: VR-GS, Segment Anything, Gaussian Splashing, GS++, 360-GS, and StopThePop
TRIPS code release

January 30, 2024:

Code changes: GaussianAvatars code changed to private

January 29, 2024:

Added 2 papers: LIV-GaussMap and TIP-Editor

January 26, 2024:

Removed retracted paper: Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions
3 papers added: EndoGaussians, PSAvatar, and GauU-Scene

January 25, 2024:

Added viewer: Splatapult - 3d gaussian splatting renderer in C++ and OpenGL, works with OpenXR for tethered VR

January 24, 2024:

Added utility: GSOPs (Gaussian Splat Operators) for SideFX Houdini
Code releases: GaussianAvatars

January 23, 2024:

3 papers added: Amortized Gen3D, Deformable Endoscopic Tissues, Fast dynamic 3D Object Generation
Code releases: Animatable Avatars, Compressed 3D Gaussians, GaussianAvatar

January 13, 2024:

4 papers added: CoSSegGaussians, TRIPS, Gaussian Shadow Casting for Neural Characters and DISTWAR

January 9, 2024:

1 paper added: A Survey on 3D Gaussian Splatting (The first survey)

January 8, 2024:

4 papers added: SWAGS (added paper from 2023 which I forgot to add before, ), first review paper, compressed 3DGS, and an application paper for Characterizing Satellite Geometry.

January 7, 2024:

1 Open source implementation: taichi-splatting - work is originally derived off Taichi 3D Gaussian Splatting, with significant re-organisation and changes.

January 5, 2024:

3 papers added: FMGS, PEGASUS, and Repaint123.

January 2, 2024:

1 paper added: Street Gaussians.

January 2, 2024:

Deblurring Gaussians paper link updated.
SAGA code released.
2 papers from 2023 added: Text2Immersion and 2D-Guided 3DG Segmentation.
Mathematical supplemend of gsplat lib.
Add years in categories.
GSM code released.

December 29, 2023:

1 paper added (apparently missed that one before): Gaussian-Head-Avatar.
Blog post head avatars added.

December 29, 2023:

3 papers added: DreamGaussian4D, 4DGen, and Spacetime Gaussian.

December 27, 2023:

3 papers added: LangSplat, Deformable 3DGS, and Human101.
Blog post added: Comprehensive Review of 3DGS.

December 25, 2023:

Efficient 3D Gaussian Representation for Monocular/Multi-view Dynamic Scenes code released.
GPS-Gaussian code released.

December 24, 2023:

2 papers added: Self-Organization Gaussian Grids and Gaussian Splitting.
Added repo for enhancing Gaussian rendering to model more complex scenes.

December 21, 2023:

3 papers added: Splatter Image, pixelSplat, and align your gaussians.
Gaussian Grouping code released.

December 19, 2023:

2 papers added: GAvatar and GauFRe.

December 18, 2023:

Added utility: SpectacularAI - Conversion scripts for different 3DGS conventions.
SuGaR code released.

December 16, 2023:

Added WebGL viewer 3: Gauzilla.

December 15, 2023:

4 papers added: DrivingGaussian, iComMa, Triplane, and 3DGS-Avatar.
Relightable Gaussians code released.

December 13, 2023:

5 papers added: Gaussian-SLAM, CoGS, ASH, CF-GS, and Photo-SLAM.

December 11, 2023:

2 papers added: Gaussian Splatting SLAM and Denoising Scores for 3D Generation.
ScaffoldGS code released.

December 8, 2023:

2 papers added: EAGLES and MonoGaussianAvatar.

December 7, 2023:

LucidDreamer code released.
9 papers added: GauHuman, HeadGaS, HiFi4G, Gaussian-Flow, Feature-3DGS, Gaussian-Avatar, FlashAvatar, Relightable, and Deblurring Gaussians.

December 5, 2023:

9 papers added: NeuSG, GaussianHead, GaussianAvatars, GPS-Gaussian, Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction, SplaTAM, MANUS, Segment Any, and Language embedded 3D Gaussians.

December 4, 2023:

8 papers added: Gaussian Grouping, MD Splatting, DynMF, Scaffold-GS, SparseGS, FSGS, Control4D, and SC-GS.

December 1, 2023:

4 papers added: Compact3D, GaussianShader, Periodic Vibration Gaussian and Gaussian Shell Maps for Efficient 3D Human Generation.
Created Table of contents for each category and added line breaks.

November 30, 2023:

Added Unreal game engine implementation.
5 papers added: LightGaussian, FisherRF, HUGS, HumanGaussian, CG3D, and Multi Scale 3DGS.

November 29, 2023:

Added two papers: Point and Move and IR-GS.

November 28, 2023:

Added five papers: GaussinEditor, Relightable Gaussians, GART, Mip-Splatting, HumanGaussian.

November 27, 2023:

Added two papers: Gaussian Editing and Compact 3D Gaussians.

November 25, 2023:

Animatable Gaussians project added (paper not yet released).

November 22, 2023:

3 new GS papers added: Animatable, Depth-Regularized, and Monocular/Multi-view 3DGS.
Added some classic papers.
Added another GS paper also called LucidDreamer.

November 21, 2023:

3 new GS papers added: GaussianDiffusion, LucidDreamer, PhysGaussian.
2 more GS papers added: SuGaR, PhysGaussian.

November 21, 2023:

Added the paper GS-SLAM

November 17, 2023:

Added PlayCanvas implementation to Game Engines section.

November 16, 2023:

Deformable 3D Gaussians code released.
Drivable 3D Gaussian Avatars paper added.

November 8, 2023:

Some notes about the 3DGS implementation and unsive/rsal format discussion.

November 4, 2023:

Added 2D gaussian splatting.
Added very detailed (technical) blog post explaining 3D gaussian splatting.

October 28, 2023:

Added Utilities Section.
Added 3DGS Converter for editing 3DGS .ply files in Cloud Compare to Utilities.
Added Kapture (for bundler to colmap model conversion) and Kapture image cropper script with conversion instructions to Utilities.

October 23, 2023:

Added python WebGL viewer 2.
Added Intro to gaussian splatting (and Unity viewer) video blog.

October 21, 2023:

Added python OpenGL viewer.
Added typescript WebGPU viewer.

October 20, 2023:

Made abstracts readable (removed hyphenations).
Added Windows tutorial.
Other minor text fixes.
Added Jupyter notebook viewer.

October 19, 2023:

Added Github page link for Real-time Photorealistic Dynamic Scene Representation.
Re-ordered headings.
Added other unofficial implementations.
Moved Nerfstudio gsplat and fast: C++/CUDA to Unofficial Implementations.
Added Nerfstudio, Blender, WebRTC, iOS & Metal viewers.

October 17, 2023:

GaussianDreamer code released.
Added Real-time Photorealistic Dynamic Scene Representation.

October 16, 2023:

Added Deformable 3D Gaussians paper.
Dynamic 3D Gaussians code released. October 15, 2023: Initial list with first 6 papers.

Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications in broader scenarios. To tackle these issues, we present HumanSplat that predicts the 3D Gaussian Splatting properties of any human from a single input image in a generalizable manner. In particular, HumanSplat comprises a 2D multi-view diffusion model and a latent reconstruction transformer with human structure priors that adeptly integrate geometric priors and semantic features within a unified framework. A hierarchical loss that incorporates human semantic information is further designed to achieve high-fidelity texture modeling and better constrain the estimated multiple views. Comprehensive experiments on standard benchmarks and in-the-wild images demonstrate that HumanSplat surpasses existing state-of-the-art methods in achieving photorealistic novel-view synthesis. Project page: https://humansplat.github.io/.

📄 Paper | 🌐 Project Page

Classic work:

1. A Generalization of Algebraic Surface Drawing

Authors: James F. Blinn

Comment:: First paper rendering 3D gaussians.

Abstract

The mathematical description of three-dimensional surfaces usually falls into one of two classifications: parametric and implicit. An implicit surface is defined to be all points which satisfy some equation F (x, y, z) = 0. This form is ideally suited for image space shaded picture drawing; the pixel coordinates are substituted for x and y, and the equation is solved for z. Algorithms for drawing such objects have been developed primarily for first- and second-order polynomial functions, a subcategory known as algebraic surfaces. This paper presents a new algorithm applicable to other functional forms, in particular to the summation of several Gaussian density distributions. The algorithm was created to model electron density maps of molecular structures, but it can be used for other artistically interesting shapes.

📄 Paper

2. Approximate Differentiable Rendering with Algebraic Surfaces

Authors: Leonid Keselman and Martial Hebert

Comment:: First paper to do differentiable rendering optimization of 3D gaussians.

Abstract

Differentiable renderers provide a direct mathematical link between an object’s 3D representation and images of that object. In this work, we develop an approximate differentiable renderer for a compact, interpretable representation, which we call Fuzzy Metaballs. Our approximate renderer focuses on rendering shapes via depth maps and silhouettes. It sacrifices fidelity for utility, producing fast runtimes and high-quality gradient information that can be used to solve vision tasks. Compared to mesh-based differentiable renderers, our method has forward passes that are 5x faster and backwards passes that are 30x faster. The depth maps and silhouette images generated by our method are smooth and defined everywhere. In our evaluation of differentiable renderers for pose estimation, we show that our method is the only one comparable to classic techniques. In shape from silhouette, our method performs well using only gradient descent and a per-pixel loss, without any surrogate losses or regularization. These reconstructions work well even on natural video sequences with segmentation artifacts.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

Authors: Jan U. Müller, Michael Weinmann, Reinhard Klein

Comment: Builds 2D screen-space gaussians from underlying 3D representations.

Abstract

We propose an efficient and GPU-accelerated sampling framework which enables unbiased gradient approximation for differentiable point cloud rendering based on surface splatting. Our framework models the contribution of a point to the rendered image as a probability distribution. We derive an unbiased approximative gradient for the rendering function within this model. To efficiently evaluate the proposed sample estimate, we introduce a tree-based data-structure which employs multi-pole methods to draw samples in near linear time. Our gradient estimator allows us to avoid regularization required by previous methods, leading to a more faithful shape recovery from images. Furthermore, we validate that these improvements are applicable to real-world applications by refining the camera poses and point cloud obtained from a real-time SLAM system. Finally, employing our framework in a neural rendering setting optimizes both the point cloud and network parameters, highlighting the framework’s ability to enhance data driven approaches.

📄 Paper 💻 Code

4. Generating and Real-Time Rendering of Clouds

Authors: Petr Man

Comment: Splatting of anisotropic gaussians. Basically a non-differentiable implementation of 3DGS.

Abstract

This paper presents a method for generation and real-time rendering of static clouds. Perlin noise function generates three dimensional map of a cloud. We also present a twopass rendering algorithm that performs physically based approximation. In the first preprocessed phase it computes multiple forward scattering. In the second phase first order anisotropic scattering at runtime is evaluated. The generated map is stored as voxels and is unsuitable for the real-time rendering. We introduce a more suitable inner representation of cloud that approximates the original map and contains much less information. The cloud is then represented by a set of metaballs (spheres) with parameters such as center positions, radii and density values. The main contribution of this paper is to propose a method, that transforms the original cloud map to the inner representation. This method uses the Radial Basis Function (RBF) neural network.

📄 Paper

Compression:

3D Gaussian Splatting has recently emerged as a highly promising technique for modeling of static 3D scenes. In contrast to Neural Radiance Fields, it utilizes efficient rasterization allowing for very fast rendering at high-quality. However, the storage size is significantly higher, which hinders practical deployment, e.g. on resource constrained devices. In this paper, we introduce a compact scene representation organizing the parameters of 3D Gaussian Splatting (3DGS) into a 2D grid with local homogeneity, ensuring a drastic reduction in storage requirements without compromising visual quality during rendering. Central to our idea is the explicit exploitation of perceptual redundancies present in natural scenes. In essence, the inherent nature of a scene allows for numerous permutations of Gaussian parameters to equivalently represent it. To this end, we propose a novel highly parallel algorithm that regularly arranges the high-dimensional Gaussian parameters into a 2D grid while preserving their neighborhood structure. During training, we further enforce local smoothness between the sorted parameters in the grid. The uncompressed Gaussians use the same structure as 3DGS, ensuring a seamless integration with established renderers. Our method achieves a reduction factor of 17x to 42x in size for complex scenes with no increase in training time, marking a substantial leap forward in the domain of 3D scene distribution and consumption.

📄 Paper | 🌐 Project Page | 💻 Code

Diffusion:

2024:

1. AGG: Amortized Generative 3D Gaussians for Single Image to 3D

Authors: Dejia Xu, Ye Yuan, Morteza Mardani, Sifei Liu, Jiaming Song, Zhangyang Wang, Arash Vahdat

Abstract

Given the growing need for automatic 3D content creation pipelines, various 3D representations have been studied to generate 3D objects from a single image. Due to its superior rendering efficiency, 3D Gaussian splatting-based models have recently excelled in both 3D reconstruction and generation. 3D Gaussian splatting approaches for image to 3D generation are often optimization-based, requiring many computationally expensive score-distillation steps. To overcome these challenges, we introduce an Amortized Generative 3D Gaussian framework (AGG) that instantly produces 3D Gaussians from a single image, eliminating the need for per-instance optimization. Utilizing an intermediate hybrid representation, AGG decomposes the generation of 3D Gaussian locations and other appearance attributes for joint optimization. Moreover, we propose a cascaded pipeline that first generates a coarse representation of the 3D data and later upsamples it with a 3D Gaussian super-resolution module. Our method is evaluated against existing optimization-based 3D Gaussian frameworks and sampling-based pipelines utilizing other 3D representations, where AGG showcases competitive generation abilities both qualitatively and quantitatively while being several orders of magnitude faster.

📄 Paper | 🌐 Project Page| 🎥 Short Presentation

2. Fast Dynamic 3D Object Generation from a Single-view Video

Authors: Zijie Pan, Zeyu Yang, Xiatian Zhu, Li Zhang

Abstract

Generating dynamic three-dimensional (3D) object from a single-view video is challenging due to the lack of 4D labeled data. Existing methods extend text-to-3D pipelines by transferring off-the-shelf image generation models such as score distillation sampling, but they are slow and expensive to scale (e.g., 150 minutes per object) due to the need for back-propagating the information-limited supervision signals through a large pretrained model. To address this limitation, we propose an efficient video-to-4D object generation framework called Efficient4D. It generates high-quality spacetime-consistent images under different camera views, and then uses them as labeled data to directly train a novel 4D Gaussian splatting model with explicit point cloud geometry, enabling real-time rendering under continuous camera trajectories. Extensive experiments on synthetic and real videos show that Efficient4D offers a remarkable 10-fold increase in speed when compared to prior art alternatives while preserving the same level of innovative view synthesis quality. For example, Efficient4D takes only 14 minutes to model a dynamic object.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

Authors: Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Abstract

Reconstructing and rendering 3D objects from highly sparse views is of critical importance for promoting applications of 3D vision techniques and improving user experience. However, images from sparse views only contain very limited 3D information, leading to two significant challenges: 1) Difficulty in building multi-view consistency as images for matching are too few; 2) Partially omitted or highly compressed object information as view coverage is insufficient. To tackle these challenges, we propose GaussianObject, a framework to represent and render the 3D object with Gaussian splatting, that achieves high rendering quality with only 4 input images. We first introduce techniques of visual hull and floater elimination which explicitly inject structure priors into the initial optimization process for helping build multi-view consistency, yielding a coarse 3D Gaussian representation. Then we construct a Gaussian repair model based on diffusion models to supplement the omitted object information, where Gaussians are further refined. We design a self-generating strategy to obtain image pairs for training the repair model. Our GaussianObject is evaluated on several challenging datasets, including MipNeRF360, OmniObject3D, and OpenIllumination, achieving strong reconstruction results from only 4 views and significantly outperforming previous state-of-the-art methods.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

Authors: Heng Yu, Chaoyang Wang, Peiye Zhuang, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Laszlo A Jeni, Sergey Tulyakov, Hsin-Ying Lee

Authors: Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan

Abstract

Recent one image to 3D generation methods commonly adopt Score Distillation Sampling (SDS). Despite the impressive results, there are multiple deficiencies including multi-view inconsistency, over-saturated and over-smoothed textures, as well as the slow generation speed. To address these deficiencies, we present Repaint123 to alleviate multi-view bias as well as texture degradation and speed up the generation process. The core idea is to combine the powerful image generation capability of the 2D diffusion model and the texture alignment ability of the repainting strategy for generating high-quality multi-view images with consistency. We further propose visibility-aware adaptive repainting strength for overlap regions to enhance the generated image quality in the repainting process. The generated high-quality and multi-view consistent images enable the use of simple Mean Square Error (MSE) loss for fast 3D content generation. We conduct extensive experiments and show that our method has a superior ability to generate high-quality 3D content with multi-view consistency and fine textures in 2 minutes from scratch.

📄 Paper | 🌐 Project Page | 💻 Code (not yet)

Dynamics and Deformation:

Constructing photo-realistic Free-Viewpoint Videos (FVVs) of dynamic scenes from multi-view videos remains a challenging endeavor. Despite the remarkable advancements achieved by current neural rendering techniques, these methods generally require complete video sequences for offline training and are not capable of real-time rendering. To address these constraints, we introduce 3DGStream, a method designed for efficient FVV streaming of real-world dynamic scenes. Our method achieves fast on-the-fly per-frame reconstruction within 12 seconds and real-time rendering at 200 FPS. Specificallggy, we utilize 3D Gaussians (3DGs) to represent the scene. Instead of the naïve approach of directly optimizing 3DGs per-frame, we employ a compact Neural Transformation Cache (NTC) to model the translations and rotations of 3DGs, markedly reducing the training time and storage required for each FVV frame. Furthermore, we propose an adaptive 3DG addition strategy to handle emerging objects in dynamic scenes. Experiments demonstrate that 3DGStream achieves competitive performance in terms of rendering speed, image quality, training time, and model storage when compared with state-of-the-art methods.

📄 Paper | 🌐 Project Page | 💻 Code (not yet) | 🔍 3DGStream Viewer

Editing:

Recently, 3D Gaussian, as an explicit 3D representation method, has demonstrated strong competitiveness over NeRF (Neural Radiance Fields) in terms of expressing complex scenes and training duration. These advantages signal a wide range of applications for 3D Gaussians in 3D understanding and editing. Meanwhile, the segmentation of 3D Gaussians is still in its infancy. The existing segmentation methods are not only cumbersome but also incapable of segmenting multiple objects simultaneously in a short amount of time. In response, this paper introduces a 3D Gaussian segmentation method implemented with 2D segmentation as supervision. This approach uses input 2D segmentation maps to guide the learning of the added 3D Gaussian semantic information, while nearest neighbor clustering and statistical filtering refine the segmentation results. Experiments show that our concise method can achieve comparable performances on mIOU and mAcc for multi-object segmentation as previous single-object segmentation methods.

📄 Paper

Language Embedding:

📄 Paper

Mesh Extraction and Physics:

1. [CVPR '24] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Authors: Tianyi Xie, Zeshun Zong, Yuxin Qiu, Xuan Li, Yutao Feng, Yin Yang, Chenfanfu Jiang

Abstract

We introduce PhysGaussian, a new method that seamlessly integrates physically grounded Newtonian dynamics within 3D Gaussians to achieve high-quality novel motion synthesis. Employing a custom Material Point Method (MPM), our approach enriches 3D Gaussian kernels with physically meaningful kinematic deformation and mechanical stress attributes, all evolved in line with continuum mechanics principles. A defining characteristic of our method is the seamless integration between physical simulation and visual rendering: both components utilize the same 3D Gaussian kernels as their discrete representations. This negates the necessity for triangle/tetrahedron meshing, marching cubes, "cage meshes," or any other geometry embedding, highlighting the principle of "what you see is what you simulate (WS2)." Our method demonstrates exceptional versatility across a wide variety of materials--including elastic entities, metals, non-Newtonian fluids, and granular materials--showcasing its strong capabilities in creating diverse visual content with novel viewpoints and movements.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

2. [CVPR '24] SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Authors: Antoine Guédon, Vincent Lepetit

Abstract

We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting. Gaussian Splatting has recently become very popular as it yields realistic rendering while being significantly faster to train than NeRFs. It is however challenging to extract a mesh from the millions of tiny 3D gaussians as these gaussians tend to be unorganized after optimization and no method has been proposed so far. Our first key contribution is a regularization term that encourages the gaussians to align well with the surface of the scene. We then introduce a method that exploits this alignment to sample points on the real surface of the scene and extract a mesh from the Gaussians using Poisson reconstruction, which is fast, scalable, and preserves details, in contrast to the Marching Cubes algorithm usually applied to extract meshes from Neural SDFs. Finally, we introduce an optional refinement strategy that binds gaussians to the surface of the mesh, and jointly optimizes these Gaussians and the mesh through Gaussian splatting rendering. This enables easy editing, sculpting, rigging, animating, compositing and relighting of the Gaussians using traditional softwares by manipulating the mesh instead of the gaussians themselves. Retrieving such an editable mesh for realistic rendering is done within minutes with our method, compared to hours with the state-of-the-art methods on neural SDFs, while providing a better rendering quality.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance

Authors: Hanlin Chen, Chen Li, Gim Hee Lee

Abstract

Existing neural implicit surface reconstruction methods have achieved impressive performance in multi-view 3D reconstruction by leveraging explicit geometry priors such as depth maps or point clouds as regularization. However, the reconstruction results still lack fine details because of the over-smoothed depth map or sparse point cloud. In this work, we propose a neural implicit surface reconstruction pipeline with guidance from 3D Gaussian Splatting to recover highly detailed surfaces. The advantage of 3D Gaussian Splatting is that it can generate dense point clouds with detailed structure. Nonetheless, a naive adoption of 3D Gaussian Splatting can fail since the generated points are the centers of 3D Gaussians that do not necessarily lie on the surface. We thus introduce a scale regularizer to pull the centers close to the surface by enforcing the 3D Gaussians to be extremely thin. Moreover, we propose to refine the point cloud from 3D Gaussians Splatting with the normal priors from the surface predicted by neural implicit models instead of using a fixed set of points as guidance. Consequently, the quality of surface reconstruction improves from the guidance of the more accurate 3D Gaussian splatting. By jointly optimizing the 3D Gaussian Splatting and the neural implicit model, our approach benefits from both representations and generates complete surfaces with intricate details. Experiments on Tanks and Temples verify the effectiveness of our proposed method.

📄 Paper

Misc:

Modeling dynamic, large-scale urban scenes is challenging due to their highly intricate geometric structures and unconstrained dynamics in both space and time. Prior methods often employ high-level architectural priors, separating static and dynamic elements, resulting in suboptimal capture of their synergistic interactions. To address this challenge, we present a unified representation model, called Periodic Vibration Gaussian (PVG). PVG builds upon the efficient 3D Gaussian splatting technique, originally designed for static scene representation, by introducing periodic vibration-based temporal dynamics. This innovation enables PVG to elegantly and uniformly represent the characteristics of various objects and elements in dynamic urban scenes. To enhance temporally coherent representation learning with sparse training data, we introduce a novel flow-based temporal smoothing mechanism and a position-aware adaptive control strategy. Extensive experiments on Waymo Open Dataset and KITTI benchmarks demonstrate that PVG surpasses state-of-the-art alternatives in both reconstruction and novel view synthesis for both dynamic and static scenes. Notably, PVG achieves this without relying on manually labeled object bounding boxes or expensive optical flow estimation. Moreover, PVG exhibits 50/6000-fold acceleration in training/rendering over the best alternative.

📄 Paper | 🌐 Project Page | 💻 Code (not yet)

Regularization and Optimization:

We present a method named iComMa to address the 6D pose estimation problem in computer vision. The conventional pose estimation methods typically rely on the target's CAD model or necessitate specific network training tailored to particular object classes. Some existing methods address mesh-free 6D pose estimation by employing the inversion of a Neural Radiance Field (NeRF), aiming to overcome the aforementioned constraints. However, it still suffers from adverse initializations. By contrast, we model the pose estimation as the problem of inverting the 3D Gaussian Splatting (3DGS) with both the comparing and matching loss. In detail, a render-and-compare strategy is adopted for the precise estimation of poses. Additionally, a matching module is designed to enhance the model's robustness against adverse initializations by minimizing the distances between 2D keypoints. This framework systematically incorporates the distinctive characteristics and inherent rationale of render-and-compare and matching-based approaches. This comprehensive consideration equips the framework to effectively address a broader range of intricate and challenging scenarios, including instances with substantial angular deviations, all while maintaining a high level of prediction accuracy. Experimental results demonstrate the superior precision and robustness of our proposed jointly optimized framework when evaluated on synthetic and complex real-world data in challenging scenarios.

📄 Paper | 💻 Code

Rendering:

Neural Radiance Fields (NeRFs) have demonstrated the remarkable potential of neural networks to capture the intricacies of 3D objects. By encoding the shape and color information within neural network weights, NeRFs excel at producing strikingly sharp novel views of 3D objects. Recently, numerous generalizations of NeRFs utilizing generative models have emerged, expanding its versatility. In contrast, Gaussian Splatting (GS) offers a similar renders quality with faster training and inference as it does not need neural networks to work. We encode information about the 3D objects in the set of Gaussian distributions that can be rendered in 3D similarly to classical meshes. Unfortunately, GS are difficult to condition since they usually require circa hundred thousand Gaussian components. To mitigate the caveats of both models, we propose a hybrid model that uses GS representation of the 3D object's shape and NeRF-based encoding of color and opacity. Our model uses Gaussian distributions with trainable positions (i.e. means of Gaussian), shape (i.e. covariance of Gaussian), color and opacity, and neural network, which takes parameters of Gaussian and viewing direction to produce changes in color and opacity. Consequently, our model better describes shadows, light reflections, and transparency of 3D objects.

📄 Paper | 💻 Code

Reviews:

📄 Paper

SLAM:

The integration of neural rendering and the SLAM system recently showed promising results in joint localization and photorealistic view reconstruction. However, existing methods, fully relying on implicit representations, are so resource-hungry that they cannot run on portable devices, which deviates from the original intention of SLAM. In this paper, we present Photo-SLAM, a novel SLAM framework with a hyper primitives map. Specifically, we simultaneously exploit explicit geometric features for localization and learn implicit photometric features to represent the texture information of the observed environment. In addition to actively densifying hyper primitives based on geometric features, we further introduce a Gaussian-Pyramid-based training method to progressively learn multi-level features, enhancing photorealistic mapping performance. The extensive experiments with monocular, stereo, and RGB-D datasets prove that our proposed system Photo-SLAM significantly outperforms current state-of-the-art SLAM systems for online photorealistic mapping, e.g., PSNR is 30% higher and rendering speed is hundreds of times faster in the Replica dataset. Moreover, the Photo-SLAM can run at real-time speed using an embedded platform such as Jetson AGX Orin, showing the potential of robotics applications.

📄 Paper | 🌐 Project Page | 💻 Code

Sparse:

We introduce the Splatter Image, an ultra-fast approach for monocular 3D object reconstruction which operates at 38 FPS. Splatter Image is based on Gaussian Splatting, which has recently brought real-time rendering, fast training, and excellent scaling to multi-view reconstruction. For the first time, we apply Gaussian Splatting in a monocular reconstruction setting. Our approach is learning-based, and, at test time, reconstruction only requires the feed-forward evaluation of a neural network. The main innovation of Splatter Image is the surprisingly straightforward design: it uses a 2D image-to-image network to map the input image to one 3D Gaussian per pixel. The resulting Gaussians thus have the form of an image, the Splatter Image. We further extend the method to incorporate more than one image as input, which we do by adding cross-view attention. Owning to the speed of the renderer (588 FPS), we can use a single GPU for training while generating entire images at each iteration in order to optimize perceptual metrics like LPIPS. On standard benchmarks, we demonstrate not only fast reconstruction but also better results than recent and much more expensive baselines in terms of PSNR, LPIPS, and other metrics.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

Navigation:

📄 Paper

Poses:

2024:

1. GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Authors: Hao Li, Yuanyuan Gao, Dingwen Zhang, Chenming Wu, Yalun Dai, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

Abstract

This paper presents GGRt, a novel approach to generalizable novel view synthesis that alleviates the need for real camera poses, complexity in processing high-resolution images, and lengthy optimization processes, thus facilitating stronger applicability of 3D Gaussian Splatting (3D-GS) in real-world scenarios. Specifically, we design a novel joint learning framework that consists of an Iterative Pose Optimization Network (IPO-Net) and a Generalizable 3D-Gaussians (G-3DG) model. With the joint learning mechanism, the proposed framework can inherently estimate robust relative pose information from the image observations and thus primarily alleviate the requirement of real camera poses. Moreover, we implement a deferred back-propagation mechanism that enables high-resolution training and inference, overcoming the resolution constraints of previous methods. To enhance the speed and efficiency, we further introduce a progressive Gaussian cache module that dynamically adjusts during training and inference. As the first pose-free generalizable 3D-GS framework, GGRt achieves inference at ≥ 5 FPS and real-time rendering at ≥ 100 FPS. Through extensive experimentation, we demonstrate that our method outperforms existing NeRF-based pose-free techniques in terms of inference speed and effectiveness. It can also approach the real pose-based 3D-GS methods. Our contributions provide a significant leap forward for the integration of computer vision and computer graphics into practical applications, offering state-of-the-art results on LLFF, KITTI, and Waymo Open datasets and enabling real-time rendering for immersive experiences.

📄 Paper 🌐 Project Page

2. GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation

Authors: Dingding Cai, Janne Heikkilä, Esa Rahtu

Abstract

This paper introduces GS-Pose, an end-to-end framework for locating and estimating the 6D pose of objects. GS-Pose begins with a set of posed RGB images of a previously unseen object and builds three distinct representations stored in a database. At inference, GS-Pose operates sequentially by locating the object in the input image, estimating its initial 6D pose using a retrieval approach, and refining the pose with a render-and-compare method. The key insight is the application of the appropriate object representation at each stage of the process. In particular, for the refinement step, we utilize 3D Gaussian splatting, a novel differentiable rendering technique that offers high rendering speed and relatively low optimization time. Off-the-shelf toolchains and commodity hardware, such as mobile phones, can be used to capture new objects to be added to the database. Extensive evaluations on the LINEMOD and OnePose-LowTexture datasets demonstrate excellent performance, establishing the new state-of-the-art.

📄 Paper | 🌐 Project Page | 💻 Code (not yet) | 🎥 Short Presentation

Large-Scale:

📄 Paper | 🌐 Project Page | 💻 Code

Data

NERDS 360 Multi-View dataset for Outdoor Scenes

Courses

MIT Inverse Rendering Lectures (Module 2)

Open Source Implementations

Reference

Gaussian Splatting

Unofficial Implementations

	Language	License
Taichi 3D Gaussian Splatting	taichi	Apache-2.0
Gaussian Splatting 3D	Python/CUDA
3D Gaussian Splatting	Python/CUDA	MIT
fast	C++/CUDA	Inria/MPII
nerfstudio	Python/CUDA	Apache-2.0
taichi-splatting	taichi/PyTorch	Apache-2.0
OpenSplat	C++/CPU or GPU	AGPL-3.0
3D Gaussian Splatting	Python/CUDA	MIT
Grendel Distributed 3DGS	Python/CUDA	Apache-2.0

2D Gaussian Splatting

jupyter notebook 2D GS splatting

Gaussian Style Transfer

Direct Gaussian Style Optimization (DGSO): Stylizing 3D Gaussian Splats - Applying style transfer during gaussian optimization to produce stylized gaussian splats of a scene.

Game Engines

Viewers

WebGL Viewer 1
WebGL Viewer 2
WebGL Viewer 3
WebGPU Viewer 1
WebGPU Viewer 2
WebGPU Viewer 3
Three.js
A-Frame
Nerfstudio Unofficial
Nerfstudio Viser
Blender (Editor)
WebRTC viewer
iOS & Metal viewer
jupyter notebook
PyOpenGL viewer (also with official CUDA backend)
PlayCanvas Viewer
gsplat.js
Splatapult - 3d gaussian splatting renderer in C++ and OpenGL, works with OpenXR for tethered VR
3DGS.cpp - cross-platform, high performance 3DGS renderer in C++ and Vulkan Compute, supporting Windows, macOS, Linux, iOS, and visionOS
vkgs - cross-platform, high performance 3DGS renderer in C++ and Vulkan Compute/Graphics
Gaussian Viewer - Loads also Compact3D plys.
spaTV - WebGL Viewer for 4D Gaussians (based on SpaceTime Gaussian) with demo here
Taichi Viewer
uc-vision-splat-viewer(3D gaussin splatting renderer with benchmarking capability)

Utilities

Kapture - A unified data format to facilitate visual localization and structure from motion e.g. for bundler to colmap model conversion
Kapture image cropper script - Undistorted image cropper script to remove black borders with included conversion instructions
camorph - A toolbox for conversion between camera parameter conventions e.g. Reality Capture to colmap model
3DGS Converter - A tool for converting 3D Gaussian Splatting .ply files into a format suitable for Cloud Compare and vice-versa
SuperSplat - Open source browser-based tool to clean/filter, reorient and compress .ply/.splat files
SpectacularAI - Conversion scripts for different 3DGS conventions
GSOPs - GSOPs (Gaussian Splat Operators) for SideFX Houdini. Import, edit, and export models, or generate synthetic training data

Tutorial

Tutorial from the authors of 3DGS

Framework

msplat - A modular differential gaussian rasterization library.
GauStudio - Unified framework with different paper implementations
gaussian-splatting-lightning - A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer

Other

My-exp-Gaussians - Enhancing the ability of 3D Gaussians to model complex scenes
360-gaussian-splatting - Generate gaussian splatting directly from 360 images

Blog Posts

Tutorial Videos

Credits

Thanks to Leonid Keselman for informing me about the release of the paper "Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting".
Thanks to Eric Haines for suggesting the jupyter notebook viewer, windows tutorial and for fixing text hyphenations and other issues.
Thanks to Henry Pearce for maintaining contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 533 Commits
LICENSE		LICENSE
README.md		README.md

License

ShijieZhou-UCLA/awesome-3D-gaussian-splatting

Folders and files

Latest commit

History

Repository files navigation

Awesome 3D Gaussian Splatting Resources

Table of contents

Seminal Paper introducing 3D Gaussian Splatting:

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Autonomous Driving:

2024:

1. Street Gaussians for Modeling Dynamic Urban Scenes

2. TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes

2023:

1. [CVPR '24] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

Avatars:

2024:

1. GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting

2. PSAvatar: A Point-based Morphable Shape Model for Real-Time Head Avatar Creation with 3D Gaussian Splatting

3. Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos

4. HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

5. ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting

6. GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians

7. GVA: Reconstructing Vivid 3D Gaussian Avatars from Monocular Videos

8. [CVPR '24] SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting

9. SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface

10. HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

11. [CVPRW '24] Gaussian Splatting Decoder for 3D‑aware Generative Adversarial Networks

12. GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

13. OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering

14. [CVPR '24] Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

2023:

1. Drivable 3D Gaussian Avatars

2. SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

3. [CVPR '24] Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling

4. [CVPR '24] GART: Gaussian Articulated Template Models

5. [CVPR '24] Human Gaussian Splatting: Real-time Rendering of Animatable Avatars

6. [CVPR '24] HUGS: Human Gaussian Splats

7. [CVPR '24] Gaussian Shell Maps for Efficient 3D Human Generation

8. GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation

9. [CVPR '24] GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians

10. [CVPR '24] GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis

11. GauHuman: Articulated Gaussian Splatting from Monocular Human Videos

12. HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting

13. [CVPR '24] HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting

14. [CVPR '24] GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

15. [CVPR '24] FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding

16. [CVPR '24] Relightable Gaussian Codec Avatars

17. MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar

18. [CVPR '24] ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering

19. [CVPR '24] 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting

20. [CVPR '24] GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

21. Deformable 3D Gaussian Splatting for Animatable Human Avatars

22. Human101: Training 100+FPS Human Gaussians in 100s from 1 View

23. [CVPR '24] Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

24. HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

Classic work:

1. A Generalization of Algebraic Surface Drawing

2. Approximate Differentiable Rendering with Algebraic Surfaces

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

4. Generating and Real-Time Rendering of Clouds

Compression:

2024:

1. [I3D '24] Reducing the Memory Footprint of 3D Gaussian Splatting

2. [CVPR '24] Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis

3. HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

4. [ECCV '24] End-to-End Rate-Distortion Optimized 3D Gaussian Representation

2023:

1. LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS

2. Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization

3. [CVPR '24] Compact 3D Gaussian Representation for Radiance Field

4. [ECCV '24] Compact 3D Scene Representation via Self-Organizing Gaussian Grids

Diffusion:

2024:

1. AGG: Amortized Generative 3D Gaussians for Single Image to 3D

2. Fast Dynamic 3D Object Generation from a Single-view Video

3. GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

4.LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

5. GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting