Yeah but the investments arent aiming for churning out SaaS apps. Its to automate large swathes of intellectual labour. Of which only SWE has been cracked yet. There is a question mark as to if the others will crack. If they arent then these investments will collapse from speculation down to reality. That possibility is what is being discussed here
As to whether that will happen, I think that risk is real. Because claude code isnt made by the generalozed capabilities of the tech but by good old non-generalozable hueristics and rule based engines. I dont think that will scale to other feilds at the factor these investments assume. Its the bitter lesson again. It scales with deliberate and specific design, not data, so it wont scale
We learnt this with ibm watson. Deepblue achieved chess supremacy but the last mile wasnt data driven, it was heiristic driven, and so watson, its successor, couldnt scale/generalize.
My prediction is that this speculation on LLMs with harnesses will collapse since they wont scale. We'll have another winter where the reasearchers will be leaft alone long wnough to come up with the next breakthrough (probably game theory based data driven agency) which might then create what this hypecycle is speculating
We will find out how much of work is given to people just so that there's a person/company associated with a technical decision. I personally think this might be quite high.
you mentioned a very good point about scalability. we're seeing alot of productivity gains, but only from SWEs, which are but a very small segment of the global economy. all other economic use cases require thorough last-mile development and iteration that is not too different with current automation tools.
There’s argument to be made that SWE hasn’t been cracked either. The latest models are great at coding medium sized applications, but figuring out the requirements and consolidation of domain knowledge is something still lacking