logoalt Hacker News

yu3zhou4yesterday at 11:56 PM0 repliesview on HN

An open course on building high performance LLM inference engine! Hope to finish by the end of April

https://github.com/jmaczan/tiny-vllm