Was the choice of such a small model driven by a desire for high tok/sec? I ask because an m4 p...

carbocation • today at 12:13 AM • 1 reply • view on HN

Was the choice of such a small model driven by a desire for high tok/sec? I ask because an m4 pro 48gb machine can run larger models (if model intelligence is the thing that would make it more useful).

Replies

sourc3 • today at 12:23 AM

Yes that was my goal. Also noticed a huge performance gain going from ollama to mlx. Your mileage may vary.

alt Hacker News

Replies