Ars Technica Apr 7, 2026 at 16:53 Big Tech

Testing suggests Google's AI Overviews tell millions of lies per hour

Is 90 percent accuracy good enough for a search robot?

By Ryan Whitwam Original source

Testing suggests Google's AI Overviews tell millions of lies per hour

Looking up information on Google today means confronting AI Overviews, the Gemini-powered search robot that appears at the top of the results page. AI Overviews has had a rough time since its 2024 launch, attracting user ire over its scattershot accuracy, but it's getting better and usually provides the right answer. That's a low bar, though. A new analysis from The New York Times attempted to assess the accuracy of AI Overviews, finding it's right 90 percent of the time. The flip side is that 1 in 10 AI answers is wrong, and for Google, that means hundreds of thousands of lies going out every minute of the day. The Times conducted this analysis with the help of a startup called Oumi, which itself is deeply involved in developing AI models. The company used AI tools to probe AI Overviews with the SimpleQA evaluation, a common test to rank the factuality of generative models like Gemini. Released by OpenAI in 2024, SimpleQA is essentially a list of more than 4,000 questions with verifiable answers that can be fed into an AI. Oumi began running its test last year when Gemini 2.5 was still the company's best model. At the time, the benchmark showed an 85 percent accuracy rate. When the test was rerun following the Gemini 3 update, AI Overviews answered 91 percent of the questions correctly. If you extrapolate this miss rate out to all Google searches, AI Overviews is generating tens of millions of incorrect answers per day.Read full article Comments

Read the full article

Related tags

AI Overviews Ars Technica Google Millions Overviews Percent Accuracy Suggests Tell Millions Testing Testing Suggests

Companies and people

AI Overviews Ars Technica Google Millions Overviews Percent Accuracy Suggests Tell Millions

Story threads

AI Overviews

Последние материалы и связанный контекст по теме AI Overviews.

AI Overviews

Latest coverage and related links about AI Overviews.

Ars Technica

Последние материалы и связанный контекст по теме Ars Technica.

Ars Technica

Latest coverage and related links about Ars Technica.

Google

Latest coverage and related links about Google.

Google

Последние материалы и связанный контекст по теме Google.

Continue with this story

Follow the same topic through connected articles, entity pages, and active story threads.

Ars Technica Apr 7, 2026 at 20:02 Big Tech

What the heck is wrong with our AI overlords?

New profile of Sam Altman shines a light on a whole industry.

Altman Altman Shines Ars Technica Industry

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 19:09 Big Tech

Bluesky users are mastering the fine art of blaming everything on "vibe coding"

Use of AI coding tools has become a convenient boogeyman for any tech issues.

Ars Technica Blaming Everything Bluesky Bluesky Users

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 18:00 Big Tech

SCOTUS overturns 5th Circuit ruling that told ISP to kick pirates off Internet

Supreme Court's precedent-setting Cox ruling helps Grande beat music piracy claims.

Ars Technica Circuit Circuit Ruling Cox

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 16:39 Big Tech

Linux kernel maintainers are following through on removing Intel 486 support

Linux devs think even one second spent on 486 support is a second too many.

Ars Technica Following Following Through Intel

Read article arstechnica.com

TechCrunch Apr 7, 2026 at 16:05 Startups

Anthropic ups compute deal with Google and Broadcom amid skyrocketing demand

Anthropic bulked up its compute deal with Google and Broadcom as the company has seen its run-rate revenue surge to $30 billion.

Anthropic Anthropic Ups Broadcom Compute

Read article techcrunch.com

Ars Technica Apr 7, 2026 at 15:54 Big Tech

Finally, Artemis delivers some exceptional, high-quality photos of the Moon

The Moon, the Earth, and the Sun—oh what fun!

Ars Technica Artemis Artemis Delivers Earth

Read article arstechnica.com

Entity pages

AI Overviews Ars Technica Google Millions Overviews Percent Accuracy

Story threads

AI Overviews

Последние материалы и связанный контекст по теме AI Overviews.

AI Overviews

Latest coverage and related links about AI Overviews.

Ars Technica

Latest coverage and related links about Ars Technica.

Ars Technica

Последние материалы и связанный контекст по теме Ars Technica.

Ad slot

Article inline monetization block

A reserved partner slot for relevant tools, services, and contextual editorial integrations.

Partner slot

More stories that share tags, source, or category context.

Ars Technica Apr 7, 2026 at 20:02 Big Tech

What the heck is wrong with our AI overlords?

New profile of Sam Altman shines a light on a whole industry.

Altman Altman Shines Ars Technica Industry

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 19:09 Big Tech

Bluesky users are mastering the fine art of blaming everything on "vibe coding"

Use of AI coding tools has become a convenient boogeyman for any tech issues.

Ars Technica Blaming Everything Bluesky Bluesky Users

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 18:00 Big Tech

SCOTUS overturns 5th Circuit ruling that told ISP to kick pirates off Internet

Supreme Court's precedent-setting Cox ruling helps Grande beat music piracy claims.

Ars Technica Circuit Circuit Ruling Cox

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 16:39 Big Tech

Linux kernel maintainers are following through on removing Intel 486 support

Linux devs think even one second spent on 486 support is a second too many.

Ars Technica Following Following Through Intel

Read article arstechnica.com

More from Ars Technica

Fresh reporting and follow-up coverage from the same newsroom.

Open source page

Ars Technica Apr 7, 2026 at 20:02 Big Tech

What the heck is wrong with our AI overlords?

New profile of Sam Altman shines a light on a whole industry.

Altman Altman Shines Ars Technica Industry

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 19:09 Big Tech

Bluesky users are mastering the fine art of blaming everything on "vibe coding"

Use of AI coding tools has become a convenient boogeyman for any tech issues.

Ars Technica Blaming Everything Bluesky Bluesky Users

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 18:00 Big Tech

SCOTUS overturns 5th Circuit ruling that told ISP to kick pirates off Internet

Supreme Court's precedent-setting Cox ruling helps Grande beat music piracy claims.

Ars Technica Circuit Circuit Ruling Cox

Read article arstechnica.com

Ars Technica Apr 7, 2026 at 16:39 Big Tech

Linux kernel maintainers are following through on removing Intel 486 support

Linux devs think even one second spent on 486 support is a second too many.

Ars Technica Following Following Through Intel

Read article arstechnica.com

Testing suggests Google's AI Overviews tell millions of lies per hour

Related tags

Companies and people

Story threads

Continue with this story

Entity pages

Story threads

Article inline monetization block

Related articles

More from Ars Technica