1.2M tokens
105 minutes

vs

200k tokens
20 minutes

Spec Theater in Action
02
A Practical Test

Chase Levin compared GSD, Superpowers, and vanilla Claude Code.

Heavy orchestration layers versus the lightest baseline system.

03
The Outcome

The lightest setup was faster, cheaper, and much closer in quality than expected.

The gap in output was smaller than the gap in time and token cost.

More structure does not automatically mean better results.

05
Spec Theater

When agents disappoint, teams often react with more of everything.

More specs. More phases. More meta-prompts. More orchestration.

06
The Hidden Cost

Structure costs time, tokens, attention, and feedback speed.

If it does not clearly earn that cost, it stops being system design and starts becoming theater.

07
Anthropic's Hint

Poor agent results often come from too much steering, not too little.

Too many instructions. Too many constraints. Too much early railroading.

The agent gets less orientation and less room to solve.

German has a word for the right kind of structure

Leitplanke

/ˈlaɪtˌplaŋkə/

Guardrail — structure that guides without dictating every move.

10
The Deeper Point

AI does not decide. It opens the space.

The real leverage is seeing guidance quickly, validating outputs quickly, and steering quickly.

Adaptive systems > permanent ceremony

Save this for your next agent workflow discussion.

Engineering, AI, and Technical Leadership. Fresh perspectives, shipped on a schedule.