A new framework, BRDFusion, has been introduced that combines Physically Based Rendering (PBR) with generative models for high-quality inverse rendering of urban locations based on video.

image
image

What Happened

The BRDFusion framework has been developed, utilizing 3D Gaussian Splatting (3DGS) and diffusion models to reconstruct geometry, materials, and HDR lighting from video recordings. The system enables relighting, night-time simulation, and the insertion of dynamic objects into the scene.

Context

Traditional methods often face a gap between the physical accuracy of rendering and the visual realism characteristic of generative models. BRDFusion addresses this issue using a hybrid PBR + Generative approach, where 3DGS provides efficient geometry reconstruction at urban scales, while diffusion models eliminate artifacts and enhance image quality.

Why It Matters for the Industry

For the AI and autonomous vehicle industries, this technology opens up possibilities for creating high-quality synthetic simulations for training agents and testing sensors. This could lead to the emergence of specialized libraries and plugins for automating data preparation in robotics and the creation of digital twins for cities.

Why It Matters for Users

Content creators and researchers gain a tool for editing real-world urban video scenes: one can change the time of day or add objects so that they appear physically correct and natural within the ambient lighting.

What Is Not Yet Known / Limitations

At the current stage, the project is a demonstration prototype. The observed image blurring indicates difficulties in preserving high spatial frequencies when using diffusion models, which limits the method's application in high-precision production.

Sources

Author

Look at AI, Editorial Staff