With the same mindset, but without even PyTorch as dependency there's a straightforward CPU imp...

littlestymaar • 10/11/2024 • 0 replies • view on HN

With the same mindset, but without even PyTorch as dependency there's a straightforward CPU implementation of llama/gemma in Rust: https://github.com/samuel-vitorino/lm.rs/

It's impressive to realize how little code is needed to run these models at all.

alt Hacker News