logoalt Hacker News

Aurornisyesterday at 7:04 PM1 replyview on HN

The person had “minecraft.py” in their context and the session context was very long.

Having an LLM session with very long context occasionally go off on a tangent is not uncommon. The people who expect absolute perfection out of every LLM interaction see this as some total indictment of the entire technology, but the people who use these tools daily have learned to treat the output as partially stochastic and to avoid extremely long context, even if the model offers it. It’s best to compact strategically or summarize next steps to hand off to a new session. Using sub-sessions can also reduce context pollution at the cost of additional token expenditure to summarize and transfer data to and from the sub-session.


Replies

ShinyLeftPadyesterday at 11:49 PM

TLDR: the first one

> this amazing tech is so stupid it just randomly brings up Minecraft or it’s got a major security issue

You can sugarcoat it but that's what it is. It's not slightly wrong like a junior engineer or weird like a junior engineer on LSD, it becomes like "your junior engineer suffered a stroke or sudden onset dementia completely forgetting the entire point". one trigger word and that's it we're building Minecraft castles now.