Which model is cheaper: Claude Sonnet 4.6 or Grok 4.20?

AnthropicClaude Sonnet 4.6VSxAIGrok 4.20

Analysis by:the whichllmmodel Editorial Team|Updated: June 2026

Our Take

Standardized reasoning and coding benchmarks are currently pending verification for one or both of these models. We recommend testing Claude Sonnet 4.6 and Grok 4.20 directly in their respective provider playgrounds to see which fits your specific prompt styles best.

Was this recommendation helpful?

Model Specs

Claude Sonnet 4.6

Website

Benchmarks & Scores

Coding (swe-bench-pro)

N/A

complex codebases, multi-file repositories, and architectural planning

Reasoning (gpqa-diamond)

79.9%

graduate-level science QA

Cost & Context

Cost (per 1M tokens)

$6.00Input: $3.00 | Output: $15.00

Context Window

1.05M tokens

Model Specs

Grok 4.20

Website

Benchmarks & Scores

Coding (swe-bench-pro)

51.8%

complex codebases, multi-file repositories, and architectural planning

Reasoning (gpqa-diamond)Winner (+10.1%)

90%

graduate-level science QA

Cost & Context

Cost (per 1M tokens)2.0x cheaper

$3.00Input: $2.00 | Output: $6.00

Context Window

1.05M tokens

Read our data collection methodology

Frequently Asked Questions about Claude Sonnet 4.6 vs Grok 4.20

Grok 4.20 is cheaper than Claude Sonnet 4.6. Grok 4.20 has a blended cost of $3.00/1M tokens, which is about 2.0x cheaper than Claude Sonnet 4.6 at $6.00/1M tokens.

Related Matchups

Explore similar comparisons for Claude Sonnet 4.6 and Grok 4.20.

Browse More Comparisons

AnthropicClaude Sonnet 4.6

AnthropicClaude Sonnet 4.6

OpenAIGPT-5.6 Terra

Compare Specs

Do you want to find a model for your constraints?

Use our interactive model finder to filter LLMs by reasoning capability, coding performance, cost, and context length.

Open Model Finder