That's why LLM will eventually be used only for initial interaction between the user in their l...

ValdikSS • today at 2:27 AM • 2 replies • view on HN

That's why LLM will eventually be used only for initial interaction between the user in their language, to prepare the data to a specialized model.

Imagine face recognition to work like a text chat, where the PC gets the frame from the camera and writes in the chat: "Who's that? Here's the RGB888 image in hex: ...".

Replies

stingraycharles • today at 6:15 AM

Do you know that MoE is a thing?

➕ show 1 reply

FeepingCreature • today at 6:09 AM

That's actually how vision language models already work, pretty much.

➕ show 1 reply

alt Hacker News

Replies