Pretty terribly expensive way to watch a video with Claude. Use Gemini or some local VLM to do thi...

fzysingularity • yesterday at 11:43 PM • 2 replies • view on HN

Pretty terribly expensive way to watch a video with Claude.

Use Gemini or some local VLM to do this way more efficiently. We spent quite a bit of time on video understanding, and Claude will just burn tokens.

Check out this library: https://vlm-run.github.io/mm/

You can swap models and try out different encoding methods for videos (https://vlm-run.github.io/mm/encoders/#video)

Replies

Tenoke • today at 1:34 AM

Do you mean that Gemini is most token-efficent at watching videos? Is that the case for e.g. just giving it a video in the browser? I admit, I dont give LLMs videos as I just assume it'll burn too many tokens.

➕ show 1 reply

mh- • today at 1:15 AM

Assuming that's your project, the GitHub link from the PyPi page is a 404.

alt Hacker News

Replies