This is so freaking awesome, I am working on a project trying run 10 models on two GPUs, loading...

medi_naseri • today at 2:36 AM • 0 replies • view on HN

This is so freaking awesome, I am working on a project trying run 10 models on two GPUs, loading/off loading is the only solution I have in mind.

Will try getting this deployed.

Does cold start timings advertised for a condition where there is no other model loaded on GPUs?

alt Hacker News