logoalt Hacker News

A4ET8a8uTh0_v2yesterday at 4:06 AM1 replyview on HN

<< You are ready to fine-tune: You need the consistent, deterministic behavior that comes from fine-tuning on specific data, rather than the variability of zero-shot prompting. << You prioritize local-first deployment: Your application requires near-instant latency and total data privacy, running efficiently within the compute and battery limits of edge devices.

Thank you. I felt that was a very under appreciated direction ( most of the spotlight seemed to be on 'biggest' models ).


Replies

canyon289yesterday at 2:21 PM

I'm with you! Small generative models are awesome, I thought so a decade ago and I still think so now! The size of what is "small" has definitely increased though, I used to think a 100 parameter model was large back in 2016, but here I am now saying 270 million is small :)