logoalt Hacker News

greenavocadoyesterday at 5:48 PM2 repliesview on HN

I typically find myself using a context of between 150-500k with GPT models so local models are simply not enough and I stopped using them.


Replies

stymaaryesterday at 5:53 PM

That's way higher than their optimal ceiling (and absolutely suboptimal from a token cost point of view), why are you doing that?

show 1 reply
c0rruptbytesyesterday at 6:01 PM

large contexts degrade the performance - attention doesn't work will for large windows like that and cloud models are kind of hacking it

local models do involve some context engineering to get it okay, but it's not that rough