> Humans can refine internal models from their own verbalised thoughts; LLMs cannot. can be don...

sonuhia • today at 11:19 AM • 0 replies • view on HN

> Humans can refine internal models from their own verbalised thoughts; LLMs cannot.

can be done without limitations but you won't get the current (and absolutely fucking pointless) kind of speed.

> Self-generated text is not an input-strengthening signal for current architectures.

It can be, the architecture is not the issue. Multi-model generations used for refining answers can also be tweaked for input-strengthening via multi- and cross-stage/link (in the chain) pre-/system-prompts.

> Training on a model’s own outputs produces distributional drift and mode collapse, not refinement

That's an integral part of self-learning. Or in many cases when children raise themselves or each other. Or when hormones are blocked (micro-collapse in sub-systems) or people are drugged (drift). If you didn't have loads of textbooks and online articles, you'd collapse all the time. Some time later: AHA!

It's a "hot reloading" kind of issue but assimilation and adaptation can't/don't happen at the same time. In pure informational contexts it's also just an aggregation while in the real world and in linguistics, things change, in/out of context and based on/grounded in--potentially liminal--(sub-)cultural dogmas, subjectively, collective and objectively phenomenological. Since weighted training data is basically a censored semi-omniscient "pre-computed" botbrain, it's a schizophrenic and dissociating mob of scripted personalities by design, which makes model collapse and drift practically mandatory.

> a safe self-training loop that today’s systems simply don’t have.

Early stages are never safe and you don't get safety otherwise except if you don't have idiots around you, which in money and fame hungry industries and environments is never the case.

> CoT is a prompted, supervised artifact — not an introspective substrate.

Yeah, but their naming schemes are absolute trash in general, anchoring false associations--technically, even deliberately misleading associations or sloppy ignorant ones, desperate to equate their product with human brains--and priming for misappropriation--"it's how humans think".

alt Hacker News