It's not. OCR is not what the vision model is doing here. We're used to using OCR as a ver...

TurdF3rguson • yesterday at 10:15 PM • 0 replies • view on HN

It's not. OCR is not what the vision model is doing here. We're used to using OCR as a verb but it's more accurate to say the model "visioned" it.

Also, some models still do OCR and it's usually way more expensive that way.

alt Hacker News