You do realize that this is likely a 10 trillion parameter model that takes something like 20 teraby...

dannypovolotski • yesterday at 10:18 PM • 0 replies • view on HN

You do realize that this is likely a 10 trillion parameter model that takes something like 20 terabytes of RAM to run inference? Calculate the price for all this VRAM .... It's not getting cheaper in the next few "months".

alt Hacker News