what tokens/s are you getting with a 122B MoE model in this setup? I didn't see any benchm...

altruios • today at 7:07 PM • 3 replies • view on HN

what tokens/s are you getting with a 122B MoE model in this setup? I didn't see any benchmarks in the benchmarks section on the readme.md

aegis_camera • today at 8:32 PM

https://www.sharpai.org/benchmark/ The MLX part is what we've done with SwiftLM, the local result is still being verified more details are on-going.

aegis_camera • today at 7:48 PM

I'll add more details. We just wired up the pipeline on both MAC and IOS.

gigatexal • today at 7:29 PM

yeah this I'd like to see added to teh readme.

alt Hacker News