logoalt Hacker News

raffael_deyesterday at 11:13 PM1 replyview on HN

"99% of humans" is a low bar. Maybe you mean "99% of people who earn money by developing software"?


Replies

WarmWashyesterday at 11:34 PM

LLMs can't really "see", so I challenge you to draw a pelican on a bike without any visual feedback, just code. Because that is how they are doing it.

Vision tokens for transformers aren't really well solved yet, which is why they can smash a phd math problem and trip over a "count the cats on the chair" problem.

show 1 reply