I do sometimes wonder if we will get "detailed enough" vector embeddings in LLMs to bring ...

bobson381 • yesterday at 3:43 PM • 2 replies • view on HN

I do sometimes wonder if we will get "detailed enough" vector embeddings in LLMs to bring the grain of resolution down below human perception - like having enough bits to fully capture what's on tape in audio world. Maybe this is never possible, and (I hope) some details are unresolvable, but it will be interesting to see.

Replies

storystarling • yesterday at 10:14 PM

I suspect the curse of dimensionality makes this an optimization dead end. You hit prohibitive latency limits on retrieval long before the resolution approaches human perception. Even with current dimensions, the trade-off between index size and query speed is already the main constraint for production systems.

pixl97 • yesterday at 4:13 PM

LLMs are already used in signal processing so the idea is explored.

Simply put anything that can be encoded is a language, so you just need sensors to capture and classify the incoming data and build that into a model. The real question is post training the model to behave correctly as these places are far less explored than things at the human scale. RLHF may be a poor choice because the models may see actual behaviors that humans don't and humans will discount it as being incorrect.

alt Hacker News

Replies