I just started using it on an m4 max 128 and it's the first time since buying the machine a year ago that it feels like local llm "just works" for reasonably decent coding.
Use pi though; claude code has way too much bootstrap context; slows everything way down.