logoalt Hacker News

helterskeltertoday at 3:59 PM1 replyview on HN

[flagged]


Replies

embedding-shapetoday at 4:00 PM

Haven't seen anything particular about that, but lots of the documents with names that were half-redacted contain OCRd text that is completely garbled, but olmocr-2-7b seems to handle it just fine. Unsure if they just had sucky processes or if there is something else going on.

show 1 reply