logoalt Hacker News

A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly

11 pointsby monaxtoday at 6:53 AM0 commentsview on HN

Comments