Anthropic shipped Opus 4.7 last week and the agentic coding improvements look real. But the breaking changes are giving me pause.
Specifically:
-
temperature,top_p,top_kat non-default values now return 400 -
Extended thinking budgets removed (adaptive thinking replaces them)
-
Tokenizer changed, so same prompts can cost 1x-1.35x more tokens
For teams already running Opus 4.6 in production: are you migrating immediately, or testing first? The execution quality improvements on multi-step coding tasks seem significant, but the sampling parameter removal is a real migration task for anyone who tuned those values.
Also curious if anyone has measured the actual token count difference on their production prompts. Anthropic says 1x-1.35x but that is a wide range.
The task_budget feature for agent loops looks promising too. Anyone tried it yet?