Quality is increasing, but these small models have very little knowledge compared to their big broth...

AuryGlenz • today at 5:54 AM • 2 replies • view on HN

Quality is increasing, but these small models have very little knowledge compared to their big brothers (Qwen Image/Full size Flux 2). As in characters, artists, specific items, etc.

Replies

efskap • today at 9:28 AM

I smell the bias-variance tradeoff. By underfitting more, they get closer to the degenerate case of a model that only knows one perfect photo.

vunderba • today at 6:07 AM

Agreed - given what Tongyi-MAI Lab was able to accomplish with a 6b model - I would love to see what they could do with something larger. Somewhere in the range of 15-20b, between these smaller models (ZiT, Klein) and the significantly larger models (Flux.2 dev).

alt Hacker News

Replies