logoalt Hacker News

jtrntoday at 5:15 PM3 repliesview on HN

This is the first flash/mini model that doesn't make a complete ass of itself when I prompt for the following: "Tell me as much as possible about Skatval in Norway. Not general information. Only what is uniquely true for Skatval."

Skatval is a small local area I live in, so I know when it's bullshitting. Usually, I get a long-winded answer that is PURE Barnum-statement, like "Skatval is a rural area known for its beautiful fields and mountains" and bla bla bla.

Even with minimal thinking (it seems to do none), it gives an extremely good answer. I am really happy about this.

I also noticed it had VERY good scores on tool-use, terminal, and agentic stuff. If that is TRUE, it might be awesome for coding.

I'm tentatively optimistic about this.


Replies

amunozotoday at 5:30 PM

I tried the same with my father's little village (Zarza Capilla, in Spain), and it gave a surprisingly good answer in a couple of seconds. Amazing.

peterldownstoday at 7:26 PM

That's a really cool prompt idea, I just tried it with my neighborhood and it nailed it. Very impressive.

kingstnaptoday at 5:38 PM

You are effectively describing SimpleQA but with a single question instead of a comprehensive benchmark and you can note the dramatic increase in performance there.