logoalt Hacker News

a-t-c-gtoday at 2:01 AM1 replyview on HN

The quality of custom models trained with proper reasoning datasets[0] even with small parameters (3-7B is sweet spot) is incredible now

[0]: cartesien.io or Salesforce's WebscaleRL


Replies

objektiftoday at 2:45 AM

What are you basing how good they are on? Personal experience or some benchmarks?

show 1 reply