logoalt Hacker News

Apple's Cubify Anything: Scaling Indoor 3D Object Detection

180 pointsby Tycho87last Monday at 8:25 AM22 commentsview on HN

Comments

Carroktoday at 6:44 PM

I really want an app I can scan my whole house with the camera/lidar combo on my phone, and export it into Blender, where I can then rearrange furniture and stuff. Apps like Scaniverse get you pretty close, but everything is one mesh, would be great to be able to slide the couch around the space without having the manually cut it out of the mesh.

pablogancharovyesterday at 11:52 PM

In case anyone is interested in rendering USDZ scans in Three.js, I created a demo: https://usdz-threejs-viewer.vercel.app/

show 1 reply
pzotoday at 9:46 AM

They overcomplicate by using 3-4 different (sub) license in one project:

in README:

Licenses - The sample code is released under Apple Sample Code License.

- The data is released under CC-by-NC-ND.

- The models are released under Apple ML Research Model Terms of Use.

Acknowledgements

- We use and acknowledge contributions from multiple open-source projects in ACKNOWLEDGEMENTS."

then having in github license button "Copyright (C) 2025 Apple Inc. All Rights Reserved."

in repo file LICENSE LICENSE_MODEL

why making it so confusing and elaborate? Its so useless to even use by 3rd party devs for making apps and releasing on their platform. So then just make it one license with the most strict restrictions you can make AGPL and/or CC-by-NC-ND .

show 2 replies
desertmonadyesterday at 11:00 PM

Looks promising but the license, Attribution-NonCommercial-NoDerivatives is pretty limiting..

callumprenticetoday at 4:50 AM

I keep meaning to get back to my suite of equirectangular image functions - viewers, editors, authoring etc. and this reminded me to resurrect the Viewer.

https://equinaut.surge.sh/?eqr=https://raw.githubusercontent...

Not quite right I think because the source image issn't 2x1 aspect ratio.

They can look really nice: both in the real world - https://equinaut.surge.sh/?eqr=https://upload.wikimedia.org/...

or

the virtual world: https://equinaut.surge.sh/?eqr=https://live.staticflickr.com...

syntaxingyesterday at 11:32 PM

Surprised this isn’t in coreML. Seems useful for the Vision Pro or something

fidotronyesterday at 11:40 PM

The accuracy of the results don't seem that great. For example, looking at the pictures on the wall in their sample, or the beams in the ceiling.

It's possible it's some artifact of the processing resolution, but I think most people that have worked with NNs for AR input will be surprised that this is not considered disappointing.

tech_jane18today at 2:39 AM

[flagged]

dev_john15today at 2:36 AM

[flagged]

show 1 reply
totetsutoday at 10:23 AM

Is this so your smart speaker can better report whats in your house back to apple?

Sviptoday at 6:12 AM

Will it work on a picture of a Power Mac G4 Cube[0]? Whenever I see "cube" and "apple" together (which, in fairness, is rare), I think of the Cube.

[0] https://en.wikipedia.org/wiki/Power_Mac_G4_Cube