Google Deepmind’s New Insane 4D AI Model CAT4D

TheSkillsGrowth
1 min read5 days ago

CAT4D: Google Deepmind’s New Insane 4D AI Model

Imagine stepping into a film scene and walking around to see everything happening, instead of just watching from a fixed screen.

Google Deepmind’s New Insane 4D AI Model CAT4D
Google Deepmind’s New Insane 4D AI Model CAT4D

CAT4D can transform normal videos into immersive multi-view experiences.

Introducing CAT4D!

CAT4D transforms any real or generated video into dynamic 3D scenes with a multi-view video diffusion model.
The outputs are dynamic 3D models that we can freeze and look at from novel viewpoints, in real-time!

Forget about shooting a scene from multiple angles!

CAT4D lets you create dynamic 4D scenes from single videos and makes it possible to get the perfect camera angle in post-production.

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models:

Projecthttps://cat-4d.github.io
arXivhttps://arxiv.org/abs/2411.18613

Most people believe that creating realistic 4D scenes requires:

- Expensive multiple camera setups
- It takes weeks of post-production
- And tons of computing power

Thanks to CAT4D, those old limitations are no longer an issue.

--

--

TheSkillsGrowth
TheSkillsGrowth

Written by TheSkillsGrowth

Crafting digital Skills For Knowledge · Crafting digital Skills For Future · Crafting digital Skills For Growth · Crafting digital Skills For Knowledge ·

Responses (4)