logoalt Hacker News

claudeIsDownyesterday at 5:19 PM2 repliesview on HN

I would love to see a more descriptive review from simonw instead of just SVGs generations.


Replies

lossoloyesterday at 5:40 PM

He is not an ML researcher or engineer, he is a passionate AI enthusiast blogger. He mostly does SVGs and other low effort checks (sometimes with major flaws, as people have pointed out a few times in the HN comments). Properly evaluating the model across all fronts requires a deep understanding of LLMs, how they work, the trade offs behind new architectures and the relevant research papers. It also takes a lot of time to build a proper evaluation framework so basically you can't just vibe code that if you want something that is solid.

show 1 reply