logoalt Hacker News

concindsyesterday at 2:13 PM1 replyview on HN

And it's a 4B model. I worry that nontechnical users will dramatically overestimate its accuracy and underestimate hallucinations, which makes me wonder how it could really be useful for academic research.


Replies

DGoettlichyesterday at 9:53 PM

valid point. its more of a stepping stone towards larger models. we're figuring out what the best way to do this is before scaling up.