logoalt Hacker News

Benjammeryesterday at 10:26 PM1 replyview on HN

So the idea is what? What's the successful outcome look like for this test, in your mind? What should good software do? Respond and say there are 5 legs? Or question what kind of dog this even is? Or get confused by a nonsensical picture that doesn't quite match the prompt in a confusing way? Should it understand the concept of a dog and be able to tell you that this isn't a real dog?


Replies

biophysboyyesterday at 11:04 PM

No, it’s just a test case to demonstrate flexibility when faced with unusual circumstances