Google Deepmind’s New Insane 4D AI Model CAT4D
CAT4D: Google Deepmind’s New Insane 4D AI Model
Imagine stepping into a film scene and walking around to see everything happening, instead of just watching from a fixed screen.
CAT4D can transform normal videos into immersive multi-view experiences.
Introducing CAT4D!
CAT4D transforms any real or generated video into dynamic 3D scenes with a multi-view video diffusion model.
The outputs are dynamic 3D models that we can freeze and look at from novel viewpoints, in real-time!
Forget about shooting a scene from multiple angles!
CAT4D lets you create dynamic 4D scenes from single videos and makes it possible to get the perfect camera angle in post-production.
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models:
Project:https://cat-4d.github.io
arXiv:https://arxiv.org/abs/2411.18613
Most people believe that creating realistic 4D scenes requires:
- Expensive multiple camera setups
- It takes weeks of post-production
- And tons of computing power
Thanks to CAT4D, those old limitations are no longer an issue.