I wonder if it will be better at building TUI's. It has been absolutely abysmal at interacting with them and building them
That's my experience too. I've had increased luck encouraging the LLM to structure the code in "functional core, imperative shell" style, and telling it stupid things like "make sure you can test the code you're writing".
Claude can build UI but it sucks at testing it and iterating on it. Fable showed some improvements in this regard but alas.