logoalt Hacker News

Havocyesterday at 10:25 PM0 repliesview on HN

Are world models from the perspective of an observer in the world or zoomed out?

Or in gaming terms do these models think FPS or RTS?

Text models and pixel grid vision models is easy but struggling to wrap my head around what world model "sees" so to speak.