I think you guys are on the right track here. I’d love to learn more about the math behind the FDM. ...

nextzck • today at 12:42 AM • 1 reply • view on HN

I think you guys are on the right track here. I’d love to learn more about the math behind the FDM. I don’t think folks realize how behind we are on vision, thank you for your work here.

Replies

nee1r • today at 1:18 AM

thanks! the math and architecture of the FDM (no video encoder) is pretty simple, its a regular transformer with next-token predictions but with frames interleaved.

alt Hacker News

Replies