logoalt Hacker News

altruiostoday at 7:07 PM3 repliesview on HN

what tokens/s are you getting with a 122B MoE model in this setup? I didn't see any benchmarks in the benchmarks section on the readme.md


Replies

aegis_cameratoday at 8:32 PM

https://www.sharpai.org/benchmark/ The MLX part is what we've done with SwiftLM, the local result is still being verified more details are on-going.

aegis_cameratoday at 7:48 PM

I'll add more details. We just wired up the pipeline on both MAC and IOS.

gigatexaltoday at 7:29 PM

yeah this I'd like to see added to teh readme.