logoalt Hacker News

KaiserProtoday at 9:28 AM1 replyview on HN

I worked at a VPS competitor of niantic.

I am conflicted on this report.

1) VPS is not new, the startup I worked at had a working public system in 2018.

2) The hard part about VPSs is not actually the navigation, its generating and querying the map.

How does the VPS work?

You build a point cloud of features (for us we paid people to go and record videos in cities, Tesla/Waymo/toyata/google drove cars niantic got it's players to take videos/pictures)

Align that point cloud to the 3d world, store it in a way that can be queried quickly (doing that quickly and at scale is still an area of research)

Then your client needs to extract the keypoints from an image and perform triangulation against the map to see where the camera was taken (There are calibration issues, but we ain't got time for that)

Now.

Niantic, from what I can see (and its been a while) has a database of key landmarks, but not of the areas inbetween. For decent navigation I would say that this is a massive problem.

I know niantic are pushing the whole "spatial world model" but frankly I don't think that scales. They stuff they have released is memorybound in vGPUs which isn't that useful for realtime querying.

I strongly suspect that actually they have a different system, much more traditional along the lines of colmap, or hloc, or something with a feedforward model in it.

However for the drone usercase, what you actually want is SLAM, which is a very different problem. for SLAM you need to build the map whilst your are moving, and then try and do loop closure or some other method to stop drift. Once you've gone there and back you can use that model for relocaliosation.


Replies

fragmedetoday at 10:18 AM

(Visual Positioning System)