💬 Text ModelOpen Source

Llama-3.1 8B

by Meta

✍️ Analysis by:the whichllmmodel Editorial Team|📅 Updated: June 2026

API Reference

Benchmark Evaluations

Codingswe-bench-pro

N/A

Reasoninggpqa-diamond

30.4%

Pricing & Speed Details

Input Cost

N/A

per 1M tokens

Output Cost

N/A

per 1M tokens

Blended Cost

N/A

based on 3:1 ratio

Speed

193 tps

Context Window

131.07k tokens

Want to test this model?

Compare Llama-3.1 8B dynamically against other top AI models on our comparison dashboard to optimize your choice.

Go to Dashboard

Quick Info

ProviderMeta
CategoryText / Chat Generation
LicenseOpen Source

Frequently Asked Questions

Pricing data for Llama-3.1 8B is currently unavailable or not officially provided.

Llama-3.1 8B supports a context window of 131.07k tokens. This allows it to process large documents, codebase segments, or long chat histories in a single query session.

Llama-3.1 8B scores N/A on coding evaluations (SWE-bench-pro) and 30.4% on reasoning tasks (GPQA-diamond).

Llama-3.1 8B is an open source model developed by Meta.

Popular Head-to-Head Comparisons

vs OpenAILlama-3.1 8B vs GPT-5.5

vs OpenAILlama-3.1 8B vs GPT-5.5 Pro

vs OpenAILlama-3.1 8B vs GPT-5.4

vs OpenAILlama-3.1 8B vs GPT-5.4 nano

vs OpenAILlama-3.1 8B vs GPT-5 mini

vs OpenAILlama-3.1 8B vs GPT-5.4 Pro