What is the difference between model tiers and effort levels in Claude Code?

Model tiers determine the underlying capability of the model being used. Haiku is the fastest and least capable, Sonnet is the general-purpose default, and Opus is the most capable for complex reasoning. Effort levels are a separate control that governs how many internal reasoning tokens the model spends before producing a response. Low effort is fast and minimal; max effort removes the reasoning token ceiling entirely. The two settings are independent, so you can run Opus at low effort or Sonnet at high effort depending on the task.

Does the opusplan alias give you the 1 million token context window in Claude Code?

No. According to open bug reports on the Claude Code GitHub repository, selecting opusplan shows a 200K context window even on accounts where selecting Opus directly provides the full 1 million token window. The opusplan alias automatically switches from Opus during planning to Sonnet during execution, which is convenient, but it does not guarantee the expanded context. If you need the full 1M window for a large codebase, you should select Opus explicitly rather than using the opusplan alias.

Does adding ultrathink to CLAUDE.md make Claude always use high effort?

No. The ultrathink keyword only raises effort to high for the single response in which it appears, then the session reverts to its configured effort level. Placing ultrathink in CLAUDE.md means the model applies high effort once when it processes that file at session start, not persistently throughout the session. To set a persistent effort level, use the --effort flag at launch, configure it in settings.json for low through high, or set the CLAUDE_CODE_EFFORT_LEVEL environment variable for the max level.

← Content

Engineering · 7 min read · April 19, 2026

Claude Code model tiers and effort levels, explained plainly

Choosing the wrong model or effort level in Claude Code wastes tokens silently. Here is what each setting actually controls.

Source: hackernoon · Oleg Efimov · open original ↗ ↗

Share: X LinkedIn

Claude Code offers three model tiers and four effort levels that independently control reasoning depth and cost, but most developers never adjust either.

— Haiku suits fast lookups and mechanical tasks; Sonnet handles most daily coding work.
— Opus unlocks 1M token context on Max, Team, and Enterprise plans automatically.
— Plan mode enforces read-only access at the tool level, not just as a soft guideline.
— The opusplan alias switches from Opus to Sonnet at execution time but loses the 1M context window.
— Effort levels (low, medium, high, max) control internal reasoning token budget before output.
— Anthropic changed Opus 4.6 default effort from high to medium in v2.1.68, causing community friction.
— Subagents inherit the parent session model and effort unless explicitly overridden in frontmatter.
— Higher effort can cause overthinking on simple tasks, producing verbose output where directness was needed.

Frequently asked

Model tiers determine the underlying capability of the model being used. Haiku is the fastest and least capable, Sonnet is the general-purpose default, and Opus is the most capable for complex reasoning. Effort levels are a separate control that governs how many internal reasoning tokens the model spends before producing a response. Low effort is fast and minimal; max effort removes the reasoning token ceiling entirely. The two settings are independent, so you can run Opus at low effort or Sonnet at high effort depending on the task.

#claude #llm #coding #tokens #agents #productivity

Claude Code model tiers and effort levels, explained plainly

Frequently asked

Vibe Coding Triggers a Dopamine Loop That Undermines Engineering Judgment

Deterministic Routing Cuts Tail Latency by Aligning Requests With Data

How GCP Architects Should Actually Use Generative AI