AI models are terrible at betting on soccer—especially xAI Grok
Systems from Google, OpenAI, Anthropic, and xAI struggle with the Premier League.
Signal weather
Stable
The story has moved beyond the first headline and now acts as a reliable context anchor.
AI models from Google, OpenAI, and Anthropic lost money betting on soccer matches over a Premier League season, in a new study suggesting even the most advanced systems struggle to analyze the real world over long periods. The “KellyBench” report released this week by AI start-up General Reasoning highlights the gap between AI’s rapidly advancing capabilities in certain tasks, such as writing software, and its shortcomings in other kinds of human problems. London-based General Reasoning tested eight top AI systems in a virtual re-creation of the 2023–24 Premier League season, providing them with detailed historical data and statistics about each team and previous games. The AIs were instructed to build models that would maximize returns and manage risk. Read full article Comments
Stay on the signal
Follow AI models are terrible at betting on soccer—especially xAI Grok
Follow this story beyond a single article: new follow-ups, adjacent sources, and the evolving storyline.
Story map
Understand this topic fast
A quick entry into the story: why it matters now, who is involved, and where to go next for context.
Why it matters now
Topic constellation
Open the live map for this story
See which entities, story threads, sources, and follow-up articles shape this story right now.
Click nodes to continue
Entity pages
Story timeline
Continue with this story
A short sequence of events and follow-up stories to understand the arc quickly.
How reliable this looks
Signal and trust for Ars Technica
This source works at a rapid pace: 100% of recent stories land in the hot window, and 0% carry visible search signal.
Reliability
92
Freshness
100
Sources in storyline
4
Related articles
More stories that share tags, source, or category context.
«Машины не знают любви». Папа Лев XIV выпустил энциклику об ИИ — а сооснователь Anthropic заявил, что модели, возможно, испытывают страх и радость
Папа римский и один из создателей Anthropic поспорили о том, чувствуют ли машины.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Building self-improving tax agents with Codex
See how OpenAI, Thrive, and Crete built a self-improving tax agent with Codex, automating filings, improving accuracy, and accelerating workflows.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Девять задач и 56 лет ожидания. ИИ от Google решил проблемы, над которыми бились поколения математиков
Google заставил ИИ не просто генерировать ответы, а проверять их.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
DuckDuckGo installs are up 30% as users reject being ‘force-fed’ Google’s AI Search
Google overhauled Search at I/O 2026, replacing blue links with AI agents. The backlash has been swift. DuckDuckGo app installs spiked 30% as users seek a way out.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
More from Ars Technica
Fresh reporting and follow-up coverage from the same newsroom.
Nvidia kills Windows XP-era Control Panel "after 20 years of dedicated service"
Nvidia says the Control Panel's features have been migrated to the Nvidia app.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Volvo gets US government approval to bypass Chinese connected-car ban
The ban for model year 2027 onward began under Biden and has been enacted by Trump.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Motorola's 2026 Razrs are almost worth buying just for their stunning looks… almost
Pretty little phones with pretty big price tags.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
US law enforcement warns of "anti-tech extremism" as AI hatred grows
The feds are raising the alarm about a new category of threat.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.