logoalt Hacker News

fzysingularityyesterday at 11:43 PM2 repliesview on HN

Pretty terribly expensive way to watch a video with Claude.

Use Gemini or some local VLM to do this way more efficiently. We spent quite a bit of time on video understanding, and Claude will just burn tokens.

Check out this library: https://vlm-run.github.io/mm/

You can swap models and try out different encoding methods for videos (https://vlm-run.github.io/mm/encoders/#video)


Replies

Tenoketoday at 1:34 AM

Do you mean that Gemini is most token-efficent at watching videos? Is that the case for e.g. just giving it a video in the browser? I admit, I dont give LLMs videos as I just assume it'll burn too many tokens.

show 1 reply
mh-today at 1:15 AM

Assuming that's your project, the GitHub link from the PyPi page is a 404.