logoalt Hacker News

girvolast Sunday at 3:38 AM1 replyview on HN

The “change capture”/straight jacket style tests LLMs like to output drive me nuts. But humans write those all the time too so I shouldn’t be that surprised either!


Replies

mulmboylast Sunday at 8:09 AM

What do these look like?

show 1 reply