1. Create a point cloud from a scene (either via lidar, or via photogrammetry from multiple images)

meindnoch • yesterday at 8:20 PM • 6 replies • view on HN

2. Replace each point of the point cloud with a fuzzy ellipsoid, that has a bunch of parameters for its position + size + orientation + view-dependent color (via spherical harmonics up to some low order)

3. If you render these ellipsoids using a differentiable renderer, then you can subtract the resulting image from the ground truth (i.e. your original photos), and calculate the partial derivatives of the error with respect to each of the millions of ellipsoid parameters that you fed into the renderer.

4. Now you can run gradient descent using the differentiable renderer, which makes your fuzzy ellipsoids converge to something closely reproducing the ground truth images (from multiple angles).

5. Since the ellipsoids started at the 3D point cloud's positions, the 3D structure of the scene will likely be preserved during gradient descent, thus the resulting scene will support novel camera angles with plausible-looking results.

Replies

klondike_klive • yesterday at 9:02 PM

You... you must have been quite some 5 year old.

➕ show 3 replies

alok-g • today at 2:44 AM

Thanks.

How hard is it to handle cases where the starting positions of ellipsoids in 3D is not correct (being too off). How common is such a scenario with the state of the art? E.g., if having only a stereoscopic image pair, the correspondences are often not accurate.

Thanks.

pleurotus • today at 7:19 AM

Thanks for the explanation!

renewiltord • yesterday at 11:11 PM

Great explanation/simplification. Top quality contribution.

chrisjj • yesterday at 11:57 PM

Or: Matrix bullet time with more viewpoints and less quality.

alt Hacker News

Replies