whichllmmodel
Back to Dashboard
💬 Text ModelOpen Source

Llama-3.1 8B

by Meta

✍️ Analysis by:the whichllmmodel Editorial Team|📅 Updated: June 2026
API Reference

Benchmark Evaluations

Codingswe-bench-pro

N/A

Reasoninggpqa-diamond

30.4%

Pricing & Speed Details

Input Cost

N/A

per 1M tokens
Output Cost

N/A

per 1M tokens
Blended Cost

N/A

based on 3:1 ratio
Speed

193 tps

Context Window

131.07k tokens

Want to test this model?

Compare Llama-3.1 8B dynamically against other top AI models on our comparison dashboard to optimize your choice.

Go to Dashboard

Quick Info

  • ProviderMeta
  • CategoryText / Chat Generation
  • LicenseOpen Source

Frequently Asked Questions

Pricing data for Llama-3.1 8B is currently unavailable or not officially provided.

Llama-3.1 8B supports a context window of 131.07k tokens. This allows it to process large documents, codebase segments, or long chat histories in a single query session.

Llama-3.1 8B scores N/A on coding evaluations (SWE-bench-pro) and 30.4% on reasoning tasks (GPQA-diamond).

Llama-3.1 8B is an open source model developed by Meta.