Question 1

How does the 200,000-token context window work in Claude Code?

Accepted Answer

The context window is the working memory that Claude Code uses for each conversation. It contains all messages, files read, and tool results accumulated during your session. Visualize this window as a fixed budget of 200,000 tokens. Each action consumes part of this budget: a 500-line file repres...

Question 2

How do I know how many tokens are left in my session?

Accepted Answer

Run the /cost command in Claude Code to display your consumption in real time. This command shows the number of tokens used and the estimated session cost. Display token consumption and cost $ claude > /cost The context indicator also appears in the Claude Code status bar. When it exceeds 80%, a ...

Question 3

What strategies help optimize context daily?

Accepted Answer

Adopt three key practices: limit files read, break down your tasks, and use targeted instructions in CLAUDE.md. These three levers reduce token consumption by 40% on average. Specify which files to read rather than letting the agent explore the entire project. A prompt like "Read only src/auth/lo...

Question 4

How does Plan mode save tokens?

Accepted Answer

Plan mode consumes only input tokens, without generating costly tool calls. Activate it with the Shift+Tab shortcut or the /plan command so that Claude Code analyzes without acting. In Plan mode, the agent reads your codebase, proposes a strategy, and waits for your validation before executing an...

Question 5

How does automatic compaction work in Claude Code?

Accepted Answer

Automatic compaction triggers when context reaches approximately 95% of the 200,000-token window. Claude Code then summarizes previous exchanges to free up space while preserving essential information. Understand that compaction is not a memory loss: the agent retains a structured summary of deci...

Question 6

What are PreCompact hooks and how do you configure them?

Accepted Answer

PreCompact hooks are shell scripts executed automatically just before each compaction. Configure them in your .claude/settings.json file to save the critical state of your session. A typical PreCompact hook saves the current diff, git state, or a custom summary in a temporary file that you can re...

Question 7

How to use multi-sessions to scale horizontally?

Accepted Answer

Launch multiple Claude Code instances in parallel, each dedicated to a specific task, to multiply your processing capacity without saturating a single context window. Horizontal scaling involves distributing work across multiple independent sessions rather than loading everything into a single 20...

Question 8

Can you recover context lost after compaction?

Accepted Answer

No, tokens removed during compaction are not directly recoverable. Use the /resume command and PreCompact hooks to mitigate this limitation. Compaction is a destructive process: the summary replaces the original exchanges. However, three mechanisms protect you. The CLAUDE.md file persists across ...

Question 9

Which files consume the most tokens and how to identify them?

Accepted Answer

Generated files (lock files, bundles, maps) often consume more than 50,000 tokens each. Add them to your .claudeignore to automatically exclude them from context. A typical package-lock.json file represents 30,000 to 80,000 tokens. A compiled bundle.js file can exceed 100,000 tokens - half of you...

Question 10

How to combine CLAUDE.md and context management for productive sessions?

Accepted Answer

Write a concise CLAUDE.md file (under 200 lines) that automatically loads critical conventions without wasting tokens. This file is Claude Code's persistent memory across sessions. CLAUDE.md is loaded at the start of every session and consumes between 500 and 2,000 tokens depending on its size. A...

Question 11

Should you prefer one long session or several short sessions?

Accepted Answer

Prefer short, focused sessions of 30 to 45 minutes to maintain a clean context and precise responses. Long sessions accumulate noise in the context. A 2-hour session often reaches 2 to 3 compactions, which progressively dilutes summary quality. Conversely, 30-minute sessions generally stay under ...

Question 12

How does the MCP protocol interact with the context window?

Accepted Answer

Each call to an MCP (Model Context Protocol) server injects its response into the context window. Monitor MCP tools that return large volumes of data to avoid saturating your token budget. MCP allows Claude Code to communicate with external services: databases, APIs, remote file systems. Each MCP...

Question 13

Is there a cost difference between input and output tokens?

Accepted Answer

Yes, output tokens cost 5 times more than input tokens with Claude Opus 4 (2026 version). Optimize your prompts to reduce the length of generated responses. Anthropic pricing for Claude Opus 4 is $15 per million input tokens and $75 per million output tokens. For Claude Sonnet 4.6, pricing is $3 ...

Question 14

How to debug a session where the context seems "polluted"?

Accepted Answer

Launch a new session with claude --resume or start a clean session when agent responses lose relevance. A polluted context manifests as hallucinations or repetitions. Three signals indicate a polluted context: the agent repeats already-given information, mixes files from different modules, or app...

Element	Average consumption	Share of context
Source file (500 lines)	4,000 tokens	2%
Bash command result	1,500 tokens	0.75%
Average user message	200 tokens	0.1%
Agent response	1,500 tokens	0.75%
CLAUDE.md file loaded	800 tokens	0.4%

Strategy	Token savings	Difficulty
Target files explicitly	30-50%	Easy
Break into sub-tasks	20-40%	Medium
Use Plan mode	40-60%	Easy
Configure CLAUDE.md	10-20%	Easy
Use `/compact` manually	50-70%	Easy

Hook	Objective	Recommended timeout
`git diff --stat`	Save change summary	5,000 ms
`git stash list`	List active stashes	3,000 ms
Custom summary script	Export critical context	10,000 ms

File type	Average tokens	Recommended action
package-lock.json	40,000	.claudeignore
bundle.js	80,000+	.claudeignore
Source file (200 lines)	1,600	Read if needed
Test file (150 lines)	1,200	Read if needed
.env file	100	Never expose

Session duration	Tokens consumed	Compactions	Context quality
15 min	15,000-30,000	0	Excellent
30 min	40,000-80,000	0-1	Good
1 h	80,000-150,000	1-2	Average
2 h+	150,000+	2-3+	Degraded

Context Management - FAQ

TL;DR