Tagged: Llm-Infrastructure

1 post

Claude Extended Thinking: How budget_tokens Is Billed (Real Costs)

Claude Extended Thinking: How budget_tokens Is Billed (Real Costs)

March 27, 2026 · 14 min read · blog
Extended thinking bills as output tokens. What budget_tokens controls, real cost per request, and the Node.js setup with a copyable cost table.