The quality of custom models trained with proper reasoning datasets[0] even with small parameters (3...

a-t-c-g • today at 2:01 AM • 1 reply • view on HN

The quality of custom models trained with proper reasoning datasets[0] even with small parameters (3-7B is sweet spot) is incredible now

[0]: cartesien.io or Salesforce's WebscaleRL

objektif • today at 2:45 AM

What are you basing how good they are on? Personal experience or some benchmarks?

➕ show 1 reply

alt Hacker News