logoalt Hacker News

thomastjeffery02/20/20251 replyview on HN

There is no such thing as "thing" here.

These models are trained such that the given conditions (the visual input and the text prompt) will be continued with a desirable continuation (motor function over time).

The only dimension accuracy can apply to is desirability.


Replies

jayd1602/21/2025

You don't think there's any segmentation going on?

show 1 reply