alt
Hacker News
A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly
11 points
•
by
monax
•
today at 6:53 AM
•
0 comments
•
view on HN
Comments