AI models are terrible at betting on soccer—especially xAI Grok

What Happened

What Happened
Why It Matters
Key Details
Background Context
What To Watch Next
Editorial Next Step

Why It Matters

The “KellyBench” report released this week by AI start-up General Reasoning highlights the gap between AI’s rapidly advancing capabilities in certain tasks, such as writing software, and its shortcomings in other kinds of human problems.

Key Details

London-based General Reasoning tested eight top AI systems in a virtual re-creation of the 2023–24 Premier League season, providing them with detailed historical data and statistics about each team and previous games.
The AIs were instructed to build models that would maximize returns and manage risk.Read full article Comments

Background Context

AI models from Google, OpenAI, and Anthropic lost money betting on soccer matches over a Premier League season, in a new study suggesting even the most advanced systems struggle to analyze the real world over long periods. The “KellyBench” report released this week by AI start-up General Reasoning highlights the gap between AI’s rapidly advancing capabilities in certain tasks, such as writing software, and its shortcomings in other kinds of human problems. London-based General Reasoning tested eight top AI systems in a virtual re-creation of the 2023–24 Premier League season, providing them with detailed historical data and statistics about each team and previous games. The AIs were instruct

What To Watch Next

Track official statements, independent verification, and regional impact updates in the next 24 to 48 hours.

Editorial Next Step

Add your local context, fact checks, quotes, and analysis before or after publication.

Source: Ars Technica – All content – Original Link

Source: Ars Technica – All content

AI models are terrible at betting on soccer—especially xAI Grok

What Happened

Table of Contents

Why It Matters

Key Details

Background Context

What To Watch Next

Editorial Next Step

Leave a Reply Cancel reply

What Happened

Table of Contents

Why It Matters

Key Details

Background Context

What To Watch Next

Editorial Next Step

Share this article:

Related Posts

Leave a Reply Cancel reply