Question 1

What is the context window for Claude models?

Accepted Answer

Claude Opus, Sonnet, and Haiku all support a 200K token context window.

Question 2

How can I reduce token usage?

Accepted Answer

Use truncation, summarization, sliding window techniques, and limit conversation history.

Question 3

What is the cost per request?

Accepted Answer

Cost depends on model and total tokens. Opus ~$15/M input tokens, Sonnet ~$3/M, Haiku ~$0.25/M. Output tokens are charged separately.