And that's why my whole schtick when it comes to agent design is that agents need to learn onli...

quotemstr • yesterday at 9:48 PM • 1 reply • view on HN

And that's why my whole schtick when it comes to agent design is that agents need to learn online, continuously, and in adapter space via some PEFT mechanism (I like soft prompts and prefix tuning), because it's really hard to ascend gradients in discrete domains like tokens.

Replies

embedding-shape • yesterday at 9:56 PM

> The model knows damn well when it's written ugly code. You can just ask it.

That's not been my experience at all, what model and prompt would you use for that? Every single one I've tried is oblivious to if a design makes sense or not unless explicitly prompted for it with constraints, future ideas and so on.

➕ show 1 reply

alt Hacker News

Replies