D4RT: Teaching AI to see the world in four dimensions

D4RT: Unified, efficient 4D reconstruction and tracking up to 300x faster than prior methods.

15 April 2026 07:36 AM IST
D4RT: Teaching AI to see the world in four dimensions

Meet D4RT, a unified AI model for 4D scene reconstruction and tracking.

D4RT combines a powerful encoder that builds a rich, global understanding of the video, and a lightweight decoder that answers thousands of queries in parallel. By asking specific questions — identifying where a source pixel is located at a target time and camera view — the model efficiently solves diverse tasks like tracking, depth estimation, and pose estimation through a single, flexible interface.

Disclaimer: This content has been automatically aggregated from GOOGLE DEEPMIND for informational purposes. To read the original article, please visit GOOGLE DEEPMIND.