logoalt Hacker News

smallstepformanyesterday at 9:06 PM8 repliesview on HN

Today I asked Gemini to extract a table from an PDF appendix and create C++ data table with its contents. After 15 or so iterations with corrections and new mistakes, it eventually gave up. I was floored when it said “I’m sorry, I cannot do this simple task, I’ve exceeded my error threshold and cannot do this task for you. My LLM prediction engine invents data instead of doing a simple data copy/reformat”.

Stunned to see that Gemini threw its digital arms in the air and gave up.


Replies

base698yesterday at 9:41 PM

That's better than the loop grok got stuck in trying to use git and push the work it did leading to a $15 api credit deduction.

show 1 reply
hashtayesterday at 10:39 PM

That's interesting because my experience has been almost the opposite. A few months ago I tested Gemini on converting screenshots of tables from PDF files into CSV. I tried it on several different tables and it got every one right. It consistently outperformed ChatGPT.

show 1 reply
staticman2yesterday at 10:17 PM

You didn't say whether you were using the App but the App's performance seems to be severely throttled compared to API.

fsmvyesterday at 10:33 PM

You should just have it OCR a screenshot of the PDF that would probably work better

staindkyesterday at 10:06 PM

We've been quite impressed with GCP Document AI. Not sure if it has a free tier but perhaps that's where Google is putting all the good OCR.

jjiceyesterday at 9:13 PM

I haven't heard any accounts of it doing that since Gemini 2.5, but it was pretty easy to get it to do it with a programming task back then after a few failed attempts. Very interesting to hear it'll still do it.

suuuuuuuuyesterday at 10:53 PM

I envy you that it admitted that rather than simply making up data and lying about it.

nimchimpskyyesterday at 10:14 PM

[dead]