Ya, i dont know of anyone wanting to run very large AI models in a windows environment. Or, frankly, on a laptop. Why not just VPN into a dedicated server?
I do. I can take my laptop anywhere I want, for example to a coffee shop and run a coding model while eating a croissant without worrying about an internet connection, as the term local model implies.
How much does a dedicated server with 128GB vram cost a month.
With BUILD happening tomorrow, I suspect Microsoft is going to have some stuff about local AI there with MS Foundry on Windows/Foundry Local. The timing of this announcement a day before BUILD is obviously intentional.
Suddenly all the Windows K2 stuff makes sense, but I doubt it'll be enough. Its too little too late for Microsoft.