The novel aspect here seems to be 3D LiDAR output from 2D video using post-training. As far as I...

ra7 • yesterday at 5:04 PM • 2 replies • view on HN

The novel aspect here seems to be 3D LiDAR output from 2D video using post-training. As far as I'm aware, no other video world models can do this.

IMO, access to DeepMind and Google infra is a hugely understated advantage Waymo has that no other competitor can replicate.

Replies

codexb • yesterday at 6:55 PM

3d from moving 2d images has been a thing for decades.

➕ show 1 reply

moffkalast • yesterday at 10:32 PM

It's not unheard of, there are a handful [0] of metric monodepth methods that output data that's not unlike a really inaccurate 3D lidar, though theirs certainly looks SOTA.

[0] https://github.com/YvanYin/Metric3D

alt Hacker News

Replies