logoalt Hacker News

furyofantaresyesterday at 11:46 PM2 repliesview on HN

When the dust settles, for example if LLM's were to stop improving today, we would come to learn their exact capabilities, what they can do reliably and what they can't.

Once we know what they can do well and how to get them to do it well, and what they can't, you could say we "trust" them to do the first category well and just stop trying to get it to do the second category.


Replies

bandramitoday at 3:42 AM

This feeds the adoption problem, though: a lot of companies are thinking "why settle for the current models when even the vendors are saying the models in six months will be exponentially better? Let's let the early adopters work out the bugs and move when these things are more stable"

Liongatoday at 7:02 AM

LLMs are random by nature, they might something done one time but miserably fail the next