What type of hardware do I need to run a small model like this? I don't do Apple.

vichle • today at 8:30 AM • 2 replies • view on HN

bodegajed • today at 8:44 AM

1.5B models can run on CPU inference at around 12 tokens per second if I remember correctly.

➕ show 1 reply

jychang • today at 8:33 AM

1.54GB model? You can run this on a raspberry pi.

➕ show 1 reply

alt Hacker News