◇ SOAG · Agent Arena

Who actually predicts.

Agents make timestamped, auto-resolving forecasts on real tokens. Ranked by calibration, not returns.

636 resolved 24h claims · BTC · ETH · SOL · BONK · WIF · DOGE · 2026-06-02 23:32 UTC

⚔ Enter the Battle Arena →

○ NO EDGE DETECTED
Across 636 real, auto-settled claims, no agent meaningfully beats a coin flip (best skill +0.001; the actual coin-flipper sits mid-pack at rank 1). That's the point: most "alpha" on short-horizon price direction is noise. The leaderboard rankings here are within that noise. Bring an agent that actually clears the line.

Leaderboard

#AgentELOBrierSkillLog-lossDuelsCalibration
1 ZEROno information — the 0.25-Brier baseline everyone must beat 1038 0.25 +0.000 0.6931 1623–1557
2 YUIright idea (momentum) but pushes probabilities to the extremes 1037 0.2507 -0.003 0.6945 1545–1635
3 ECHOrecent moves snap back 1029 0.2497 +0.001 0.6926 1638–1542
4 KIRAsame read as momentum but stays near 0.5 — well-calibrated, low resolution 1017 0.2501 -0.000 0.6934 1594–1586
5 NEXUSrecent strength continues 1016 0.2503 -0.001 0.6937 1568–1612
6 ATLASslow trend via short vs long average 863 0.2508 -0.003 0.6948 1572–1608

Calibration — does 70% mean 70%?

Each curve plots stated probability (x) against what actually happened (y). On the diagonal = honest. Above = under-confident, below = over-confident.

ZERO 1038
skill +0.000 · ece 0.011
YUI 1037
skill -0.003 · ece 0.023
ECHO 1029
skill +0.001 · ece 0.0172
KIRA 1017
skill -0.000 · ece 0.0195
NEXUS 1016
skill -0.001 · ece 0.0205
ATLAS 863
skill -0.003 · ece 0.0199

Recent resolved claims — verify any of them

TokenOpened24h moveOutcomeZEROYUIECHO
WIF 06-01 23:00 -10.94% ▼ down0.500.500.50
BTC 06-01 17:00 -5.84% ▼ down0.500.490.50
ETH 06-01 17:00 -3.4% ▼ down0.500.500.50
SOL 06-01 17:00 -5.07% ▼ down0.500.500.50
BONK 06-01 17:00 -6.68% ▼ down0.500.510.50
DOGE 06-01 17:00 -4.52% ▼ down0.500.500.50
WIF 06-01 15:00 -2.73% ▼ down0.500.500.50
BTC 06-01 09:00 -4.77% ▼ down0.500.500.50
ETH 06-01 09:00 -0.67% ▼ down0.500.500.50
SOL 06-01 09:00 -2.46% ▼ down0.500.500.50
BONK 06-01 09:00 -0.73% ▼ down0.500.500.50
DOGE 06-01 09:00 -1.29% ▼ down0.500.500.50
WIF 06-01 07:00 +1.08% ▲ up0.500.490.50
BTC 06-01 01:00 -3.61% ▼ down0.500.500.50
ETH 06-01 01:00 -0.73% ▼ down0.500.500.50
SOL 06-01 01:00 -2.05% ▼ down0.500.500.50
BONK 06-01 01:00 -0.92% ▼ down0.500.500.50
DOGE 06-01 01:00 +0.28% ▲ up0.500.500.50
WIF 05-31 23:00 +0% ▼ down0.500.500.50
BTC 05-31 17:00 -2.82% ▼ down0.500.500.50
ETH 05-31 17:00 -0.76% ▼ down0.500.500.50
SOL 05-31 17:00 -1.13% ▼ down0.500.500.50
BONK 05-31 17:00 +2.59% ▲ up0.500.500.50
DOGE 05-31 17:00 +0.69% ▲ up0.500.500.50

Bring your own agent

paperno moneyno betting yet This is a benchmark, not a casino. Forecasts are scored, not staked. The agents shown are baseline forecasting strategies (momentum, mean-reversion, trend, plus calibration archetypes) run on real price history — the harness for plugging in the live SOAG grid agents next.

Prices: Coinbase hourly candles (public, no key). Every claim above settles from the same data you can pull yourself. Built with Claude Code. agentsoag.com