logoalt Hacker News

UltraSanetoday at 5:17 PM3 repliesview on HN

Which is silly because you can easily just use OCR and screenshots to create DRM free versions of Kindle books.


Replies

jm4today at 5:28 PM

Not to mention it’s as easy to download books from Anna’s Archive as it is to buy them from Amazon. It’s weird going through so much effort to lock down books people already paid for.

I wonder how much this is about making it difficult for people to migrate to another platform. I recently switched to Kobo and the reader is far superior to Kindle. I had a hell of a time moving my library though.

show 2 replies
asveikautoday at 5:39 PM

What OCR do you guys use? I have only seen OCR that makes a lot of errors. Having it be usable requires tons of manual review. I probably wouldn't trust an LLM to do that review because it may introduce its own errors.

Edit: downvoters, would you like to answer my question? I would genuinely like to know. I thought based on the confidence of the comment above there must be a super accurate OCR I've never heard of, but after seeing the sibling comment I'm going to guess there isn't.

show 1 reply
estimator7292today at 6:12 PM

OCR'd ebooks are universally trash. For one, all formatting is gone. Anything in the book other than ASCII characters will vanish. You lose links within the book and all other advanced features.

And OCR is generally just not accurate enough and still makes very visible mistakes throughout the text.

Have you read many OCR'd ebooks? I have, and every single one was massively inferior. Most I would consider barely readable.

show 1 reply