logoalt Hacker News

joeltheliontoday at 6:11 AM2 repliesview on HN

This looks like something that can (and should) be reimplemented open-source. It doesn't look like a particularly daunting project.


Replies

bluecoconuttoday at 6:18 AM

I've been working on something very similar as a tool for my own AI research -- though I don't have the success they claim. Mine often plateaus on the optimization metric. I think there's secret sauce in the meta-prompting and meta-heuristic comments from the paper that are quite vague, but it makes sense -- it changes the dynamics of the search space and helps the LLM get out of ruts. I'm now going to try to integrate some ideas based off of my interpretation of their work to see how it goes.

If it goes well, I could open source it.

What are the things you would want to optimize with such a framework? (So far I've been focusing on optimizing ML training and architecture search itself). Hearing other ideas would help motivate me to open source if there's real demand for something like this.

show 3 replies
friederrrtoday at 6:23 AM

Yep, agree.

Had mentioned the same on X: https://x.com/friederrrr/status/1922850981181784152?t=usXpK1...