gotnerfed

Forecasts & accuracy

Probabilistic 30/60/90-day risk forecasts per AI vendor. We publish them here. We Brier-score them against subsequent reality. We publish that too.

A model that forecasts and never grades itself is a guru. A model that forecasts and grades itself in public is a forecaster. We are the latter.

Live forecasts (sorted by 30-day risk)

VendorBand30d60d90dMost likelySeverity
GitHub Copiloton watch73%93%98%billing-model-shiftmajor
ChatGPTon watch67%89%96%billing-model-shiftmajor
Cursorclean59%83%93%billing-model-shiftcritical
Claude (Anthropic)on watch31%52%67%feature-gatedmajor
Claude Codeshrinking25%44%58%tier-removedcritical

Public scoreboard

20
forecasts published
0
forecasts resolved
awaiting outcomes
brier score
needs ≥10 resolutions
calibration
needs ≥30 resolutions

We've just published our first 20 forecasts (one per tracked vendor). As prediction windows close, we'll grade each forecast against what actually happened. The board updates automatically. No cherry-picking. We will publish the wrong calls in red.

How forecasts work