logoalt Hacker News

culilast Sunday at 8:47 PM2 repliesview on HN

Kimi K2 is the model that most consistently passes the clock test. I agree it's definitely got something unique going on

https://clocks.brianmoore.com/


Replies

davejlast Sunday at 9:03 PM

Nice! I'm curious, what does this service cost to run? I notice that you don't have more expensive models like Opus but querying the models every minute must add up over time (excuse pun)?

show 1 reply
eunoslast Sunday at 10:05 PM

Lol why's GPT 5 broken on that test. DeepSeek surprisingly crisp and robust