You do realize that this is likely a 10 trillion parameter model that takes something like 20 terabytes of RAM to run inference? Calculate the price for all this VRAM .... It's not getting cheaper in the next few "months".