Consider also that right now LLMs run slowly enough you can watch them think. I've seen a demo ...

mikestorrent • today at 6:13 AM • 2 replies • view on HN

Consider also that right now LLMs run slowly enough you can watch them think. I've seen a demo of an LLM running at an absurdly high speed and it reminds me of when I moved from a 2400 baud modem to a 14.4 - BBS screens that I could watch draw were all of a sudden nigh-interactive. Faster-than-realtime video generation is also coming, and will also continue to require huge hardware for a long while yet.

I love local models - I have a machine at home that runs a few for me and it's a lot of fun - but for the time being they are not super trustworthy on tool calls and staying on script. Another year or so might change all that!

Replies

KoolKat23 • today at 8:34 AM

If anyone wishes to see the future. A fast LLM is quite eye-opening. I think chatjimmy uses Talaas' chips where models are hardcoded into the silicon.

https://chatjimmy.ai/

➕ show 2 replies

ChickeNES • today at 7:16 AM

What does your local setup look like?

alt Hacker News

Replies