unfortunately the bigger models are pretty slow in token speed. The memory is just not that fast. ...

ejpir • yesterday at 10:01 PM • 0 replies • view on HN

unfortunately the bigger models are pretty slow in token speed. The memory is just not that fast.

You can check what each model does on AMD Strix halo here:

alt Hacker News