logoalt Hacker News

thomastjefferylast Thursday at 8:20 PM1 replyview on HN

There is no such thing as "thing" here.

These models are trained such that the given conditions (the visual input and the text prompt) will be continued with a desirable continuation (motor function over time).

The only dimension accuracy can apply to is desirability.


Replies

jayd16last Friday at 12:24 AM

You don't think there's any segmentation going on?

show 1 reply