I am not sure we are at the "efficiency" phase of this. Even if you just wire this outpu...

empath75 • yesterday at 8:20 PM • 0 replies • view on HN

I am not sure we are at the "efficiency" phase of this.

Even if you just wire this output (or probably multiples running different counterfactuals) into a multimodal LLM that interprets the video and uses it to make decisions, you have something new.

alt Hacker News