They are indeed impractical in agentic coding. However in deep research-like products you can have...

egorfine • yesterday at 3:14 PM • 1 reply • view on HN

They are indeed impractical in agentic coding.

However in deep research-like products you can have a pass with LLM to compress web page text into caveman speak, thus hugely compressing tokens.

Replies

claytongulick • yesterday at 3:29 PM

I don't understand how this would work without a huge loss in resolution or "cognitive" ability.

Prediction works based on the attention mechanism, and current humans don't speak like cavemen - so how could you expect a useful token chain from data that isn't trained on speech like that?

I get the concept of transformers, but this isn't doing a 1:1 transform from english to french or whatever, you're fundamentally unable to represent certain concepts effectively in caveman etc... or am I missing something?

➕ show 1 reply

alt Hacker News

Replies