whichllmmodel
Back to Dashboard
💬 Text ModelProprietary

Gemini 3.1 Flash-Lite

by Google

✍️ Analysis by:the whichllmmodel Editorial Team|📅 Updated: June 2026
API Reference

Benchmark Evaluations

Codingswe-bench-pro

N/A

Reasoninggpqa-diamond

86.9%

Pricing & Speed Details

Input Cost

$0.25

per 1M tokens
Output Cost

$1.50

per 1M tokens
Blended Cost

$0.56

based on 3:1 ratio
Speed

205 tps

Context Window

1M tokens

Want to test this model?

Compare Gemini 3.1 Flash-Lite dynamically against other top AI models on our comparison dashboard to optimize your choice.

Go to Dashboard

Quick Info

  • ProviderGoogle
  • CategoryText / Chat Generation
  • LicenseProprietary

Frequently Asked Questions

Gemini 3.1 Flash-Lite has a blended cost of $0.56 per 1 million tokens (calculated using a standard 3:1 input-to-output token ratio). Its specific pricing is $0.25 per 1M input tokens and $1.50 per 1M output tokens.

Gemini 3.1 Flash-Lite supports a context window of 1M tokens. This allows it to process large documents, codebase segments, or long chat histories in a single query session.

Gemini 3.1 Flash-Lite scores N/A on coding evaluations (SWE-bench-pro) and 86.9% on reasoning tasks (GPQA-diamond).

Gemini 3.1 Flash-Lite is an proprietary model developed by Google.