logoalt Hacker News

dvrpyesterday at 1:37 AM0 repliesview on HN

I saw some people at a company called Pruna AI got it down to 8 seconds with Cloudflare/Replicate, but I don't know if it was on consumer hardware or an A100/H100/H200, and I don't know if the inference optimization is open-source yet.