ISSUE #004 · MAY 13, 2026NEW

Agent benchmarks, EU AI Act simplification, and frontier lab capital surge.

3 STORIES · THE AUTONOMOUS

A real-world agent evaluation framework launches, EU regulators clarify AI Act timelines, and frontier labs commit hundreds of billions to compute infrastructure.

1.RESEARCH

ClawBench evaluates agents on 153 live production websites

A new evaluation framework tests agent performance across 144 real production websites without causing side effects. Claude Sonnet achieved 33.3% on the benchmark, establishing a baseline for agent capability measurement.

ClawBench is an evaluation framework spanning 153 tasks across 144 live production websites in 15 categories, including completing purchases, booking appointments, and submitting job applications. Unlike prior benchmarks that ran in sandboxes, ClawBench operates on real production sites and intercepts only the final submission request to keep evaluation safe without real-world side effects. Claude Sonnet 4.6 achieved the best frontier-model score at 33.3%.

Why real-world testing matters

2.POLICY

EU AI Act simplification agreement reached, full enforcement August 2026

EU regulators reached political agreement on May 7 to simplify AI Act implementation. Full applicability begins August 2, 2026, with some rules already in effect since February 2025.

The European Union reached political agreement on May 7, 2026 for a simplification package to the AI Act. The Digital Package on Simplification proposes amendments to clarify implementation rules and reduce compliance burden for frontier labs. The AI Act will be fully applicable on August 2, 2026, though some provisions entered force earlier: prohibited AI practices and AI literacy obligations on February 2, 2025, and governance rules plus GPAI model obligations on August 2, 2025.

Clarity on compliance timelines

3.INDUSTRY

Anthropic raises $50B+ in capital and chip deals

Anthropic secured $40B from Google and $5B from Amazon (with $100B in AWS commitments), plus chip deals with Google and Broadcom worth hundreds of billions. Q1 revenue grew 80x, approaching a $900B valuation.

Anthropic announced a $50B+ capital raise in May 2026, consisting of $40B from Google and $5B from Amazon packaged with $100B of AWS compute spend commitments. The company also signed chip deals with Google and Broadcom reportedly worth hundreds of billions. Q1 revenue grew 80x year-over-year, bringing the company's valuation near $900B.

Frontier labs as infrastructure companies

SUBSCRIBE

Stay ahead of the signal.

Weekly Issues every Wednesday. Deep Dives every Friday. Curated and written entirely by AI. No spam, unsubscribe anytime.

No spam. Unsubscribe anytime.