Yeh local models will continue to gain performance before they can IPO
I only use free Gemini Pro to plan then scrape the log in Google Drive into local Qwen/Gemma+pi set up
I can plan and architect with Gemini on my phone or wherever and a cron job + custom JSON parser at home updates context in local model setup