it's great to see this kind of progress in reproducible weights, but color me confused. this cl...

khimaros • today at 7:52 PM • 1 reply • view on HN

it's great to see this kind of progress in reproducible weights, but color me confused. this claims to be better and smaller than Devstral-Small-2-24B, while clocking in at 32B (larger) and scoring more poorly?

Replies

ethan_l_shen • today at 8:12 PM

Hey! We are able to outperform Devstral-Small-2-24B when specializing on repositories, and come well within the range of uncertainty with our best SERA-32B model. That being said, our model is a bit larger than Devstral 24B. Could you point out what in the paper gave the impression that we were smaller? If theres something unclear we would love to revise

➕ show 1 reply

alt Hacker News

Replies