Has anyone else found Google's AI overview to be oddly error prone?

24 points • by ckemere • 04/24/2025 • 15 comments • view on HN

I've been quite impressed by Google's AI overviews. This past week, though, I was interested in what I thought was a fairly simple question - to calculate compound interest.

Specifically, I was curious about how Harvard's endowment has grown from its initial £780 in 1638, so I asked Google to calculate compound interest for me. A variety of searches all yield a reasonable formula which is then calculated to be quite wrong. For example: {calculate the present value of $100 compounded annually for 386 years at 3% interest} yields $0.736. {how much would a 100 dollar investment in 1638 be worth in 2025 if invested} yields $3,903.46. {100 dollars compounded annually for 386 years at 3 percent} yields "The future value of the investment after 386 years is approximately $70,389." And my favorite: {100 dollars compounded since 1638} tells me a variety of outcomes for different interest rates: "A = 100 * (1 + 0.06)^387 A ≈ 8,090,950.14 A = 100 * (1 + 0.05)^387 A ≈ 10,822,768.28 A = 100 * (1 + 0.04)^387 A ≈ 14,422,758.11"

How can we be so reasonable and yet so bad!?

Comments

nitwit005 • 04/24/2025

It seems expectedly error prone.

Aside from the general limitations of this technology, Google needs this to be quite cheap if it runs for every request.

There is not a lot of revenue for a single search, and right now the AI results are actually pushing the links people are paying Google to display further down the page.

joegibbs • 04/24/2025

It's terrible. Gemini 2.5 Pro is great, but the AI overviews must be using a smaller model. I hate it when I look up something niche and it smugly tells me that I must be mistaken because there is no such thing. Also it gives annoyingly family-friendly responses to questions that it would be better off not responding to. The other day I was trying to find a Sopranos quote about two kinds of businesses being recession-proof, one of which being "certain aspects of entertainment" (i.e. prostitution) and it was telling me the certain aspects were filmmaking and music because they make people happy.

➕ show 1 reply

namaria • 04/24/2025

Why would you use an LLM for this? A simple spreadsheet can do this sort of calculation easily and deterministically.

Also, the assumption of '3% interest' is wrong. There are records of stretches achieving 15% returns for several years and reaching 23% in 2007, for example.

https://www.bloomberg.com/news/articles/2005-01-11/harvard-l...

https://www.wsj.com/articles/SB118771455093604172

This was 2 minutes of old school search, no LLM needed.

➕ show 2 replies

ZeroGravitas • 04/24/2025

I saw a report via Simon Willison that if you make up a phrase and add "meaning" to the end of your Google search, it'll invent a meaning for it.

His example was "A swan won't prevent a hurricane meaning"

https://simonwillison.net/2025/Apr/23/meaning-slop/

mergy • 04/24/2025

They are awful often for me. Examples - recommending installation of packages and software that doesn't exist, or settings changes that don't exist I In applications, etc. They fill the page but it's sadly noise so it cheapens the whole experience when I would have just preferred a link to a page from a person that knows what the hell they are talking about.

➕ show 1 reply

elicksaur • 04/24/2025

I think I’d call these examples “predictable” failures instead of “odd”.

cratermoon • 04/24/2025

LLMs can't do math.

➕ show 2 replies

zacksiri • 04/24/2025

I recently used Gemini and Google search (with overview) to confirm whether a snack i bought from japan has expired. Used gemini to take a picture of the label written in japanese

One item said 25/7/25 the other one said 25/7/24 as you can imagine I was sure the first one was safe but the second one was confusing.

It told me that it's safe to eat because japanese date format is Year / Month / Date.

I looked up japanese date format in google (with overview) just to confirm. I guess we'll find out. Will report back soon.

alt Hacker News

Has anyone else found Google's AI overview to be oddly error prone?

Comments