logoalt Hacker News

tcbrahtoday at 5:40 PM1 replyview on HN

tbh i stopped caring about "can i run X locally" a while ago. for anything where quality matters (scripting, code, complex reasoning) the local models are just not there yet compared to API. where local shines is specific narrow tasks - TTS, embeddings, whisper for STT, stuff like that. trying to run a 70b model at 3 tok/s on your gaming GPU when you could just hit an API for like $0.002/req feels like a weird flex IMO


Replies

itigges22today at 6:37 PM

[flagged]

show 1 reply