This is not local but Gemini models can process very long videos and provide description with timest...

GaggiX • today at 8:02 PM • 1 reply • view on HN

This is not local but Gemini models can process very long videos and provide description with timestamps if asked for.

embedding-shape • today at 9:30 PM

Nor would it be describing things as they happen, but instead needing pre-processing, so in the end, very different :)

alt Hacker News