logoalt Hacker News

lucasayyesterday at 8:05 PM2 repliesview on HN

This feels less like automated research and more like structured trial and error with a decent feedback loop. Still useful, but I think the real bottleneck is how good your eval metric is. If that’s weak, the whole loop just optimizes for the wrong thing faster.


Replies

Almondsetatyesterday at 10:02 PM

Designing a good fitness function, a tale as old as time...

kridsdale1yesterday at 8:12 PM

I mean, isn’t that “the scientific method”?

show 1 reply