Google Finds Top AI Still Wrong About Facts 31% of Time

Google DeepMind's FACTS benchmark finds Gemini 3 Pro achieves only 69% factual accuracy, exposing risks for businesses that rely on AI and underscoring the need for human oversight, citation checks, and better validation.

Emma Collins Emma Collins . Comments
Google Finds Top AI Still Wrong About Facts 31% of Time

2 Minutes

Google DeepMind's new FACTS benchmark paints an unsettling picture: the most advanced AI tested still gets roughly three out of ten factual claims wrong. The study shows that fluency and speed no longer equal trustworthiness.

Benchmarking truth: what FACTS tests

FACTS evaluates models across four tough tasks: answering real-world questions from their internal knowledge, using web search effectively, citing long documents accurately, and interpreting images. In those trials, Gemini 3 Pro led the pack but reached only 69% accuracy, while other leading models trailed by a wide margin.

The practical takeaway is stark. AI can write confidently, but confidence isn't the same as correctness. In industries like finance, healthcare or law, even small errors can be costly — a law firm reportedly dismissed an employee after a brief use of AI produced fictitious case citations in a legal draft.

Why this matters for businesses and users

For companies that have staked operations on AI, FACTS is a wake-up call. It doesn't mean giving up on the technology; rather it underlines the need for guardrails: human review, stricter sourcing, and task-specific validation. Google frames the benchmark as both a warning and a roadmap, aiming to surface where models fail so researchers can fix systemic issues.

Bottom line: AI is improving quickly, but when it comes to factual reliability, there's still significant work to do. Expect better accuracy over time, but treat current models as assistants that need oversight — not infallible sources of truth.

“I cover emerging technologies, digital innovation, and the intersection of tech and everyday life. My goal is to make complex trends accessible and inspiring.”

Leave a Comment

Comments