# How to use modelparameters.dev

[modelparameters.dev](https://modelparams.dev) is an open, community-maintained catalog of LLM model
parameters. Each entry shows the knobs you can turn — type, default, range, and the
conditions that gate it.

The same model accessed via an **API key** and via a **subscription** usually exposes a
different set of parameters. We list both as separate entries so the data stays honest.

## Catalog API

The full catalog is static JSON, CORS-enabled, served from the edge:

```bash
curl https://modelparams.dev/api/v1/models.json
```

Each entry is keyed by `provider/model` for API-key variants; subscription variants
append `-subscription`.

## Single model

```bash
curl https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7.json
curl https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7-subscription.json
```

## JSON Schema

Every entry validates against a JSON Schema you can use in your editor or pipeline:

```bash
curl https://modelparams.dev/api/v1/schema.json
```

Add this header to any YAML you author for autocomplete in VS Code:

```yaml
# yaml-language-server: $schema=https://modelparams.dev/api/v1/schema.json
```

## Logos

Provider logos are at `/assets/logos/{provider}.svg`. They use `currentColor`, so they
inherit your text color:

```bash
curl https://modelparams.dev/assets/logos/anthropic.svg
```

## Contribute

The data lives in YAML under `models/{provider}/{model}-{auth}.yaml` in the
[GitHub repo](https://github.com/mnfst/modelparameters.dev). Open a PR; CI validates against the schema and rebuilds.

## For agents

- Machine-readable site overview: https://modelparams.dev/llms.txt
- Full usage guide plus every parameter inline: https://modelparams.dev/llms-full.txt
- When your browser supports it, this page registers in-browser **WebMCP** tools on
  `navigator.modelContext`: `search_models`, `get_model_parameters`, `list_providers`,
  `list_parameters`, and `get_usage_guide`.

---

# Full catalog

78 models, grouped by provider. Each line reads: `path` (type,
group) — description, then defaults, ranges, allowed values, and applicability conditions
in brackets.

## OpenAI

### openai/chatgpt-4o-latest

OpenAI Chatgpt 4o Latest (API key) · JSON: https://modelparams.dev/api/v1/models/openai/chatgpt-4o-latest.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-3.5-turbo

OpenAI Gpt 3.5 Turbo (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-3.5-turbo.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4-turbo

OpenAI Gpt 4 Turbo (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4-turbo.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4-turbo-2024-04-09

OpenAI Gpt 4 Turbo 2024.04 09 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4-turbo-2024-04-09.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4.1

OpenAI Gpt 4.1 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4.1.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4.1-mini

OpenAI Gpt 4.1 Mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4.1-mini.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4.1-nano

OpenAI Gpt 4.1 Nano (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4.1-nano.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4o

OpenAI GPT-4o (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4o.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4o-2024-11-20

OpenAI Gpt 4o 2024.11 20 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4o-2024-11-20.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-4o-mini

OpenAI GPT-4o mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4o-mini.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01]

### openai/gpt-5

OpenAI Gpt 5 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"]

### openai/gpt-5-chat-latest

OpenAI Gpt 5 Chat Latest (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5-chat-latest.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]

### openai/gpt-5-mini

OpenAI Gpt 5 Mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5-mini.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"]

### openai/gpt-5-nano

OpenAI Gpt 5 Nano (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5-nano.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"]

### openai/gpt-5.1

OpenAI Gpt 5.1 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.1.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "none"] [values: "none", "low", "medium", "high"]

### openai/gpt-5.1-codex-max-subscription

OpenAI Gpt 5.1 Codex Max (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.1-codex-max-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.1-codex-subscription

OpenAI Gpt 5.1 Codex (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.1-codex-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.2

OpenAI Gpt 5.2 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.2.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"]

### openai/gpt-5.2-codex-subscription

OpenAI Gpt 5.2 Codex (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.2-codex-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.2-subscription

OpenAI Gpt 5.2 (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.2-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.3-codex-spark-subscription

OpenAI Gpt 5.3 Codex Spark (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.3-codex-spark-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.3-codex-subscription

OpenAI Gpt 5.3 Codex (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.3-codex-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.4

OpenAI Gpt 5.4 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"]

### openai/gpt-5.4-mini

OpenAI Gpt 5.4 Mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4-mini.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"]

### openai/gpt-5.4-mini-subscription

OpenAI Gpt 5.4 Mini (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4-mini-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.4-subscription

OpenAI Gpt 5.4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/gpt-5.5

OpenAI Gpt 5.5 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.5.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"]

### openai/gpt-5.5-subscription

OpenAI Gpt 5.5 (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.5-subscription.json

- `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"]
- `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"]
- `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"]

### openai/o1

OpenAI o1 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o1.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"]

### openai/o1-mini

OpenAI o1-mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o1-mini.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"]

### openai/o1-preview

OpenAI O1 Preview (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o1-preview.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"]

### openai/o3

OpenAI o3 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o3.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"]

### openai/o3-mini

OpenAI o3-mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o3-mini.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"]

### openai/o4-mini

OpenAI o4-mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o4-mini.json

- `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16]
- `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"]

## Anthropic

### anthropic/claude-3-5-haiku-20241022

Anthropic Claude 3.5 Haiku 20241022 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-haiku-20241022.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]

### anthropic/claude-3-5-haiku-latest

Anthropic Claude 3.5 Haiku Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-haiku-latest.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]

### anthropic/claude-3-5-sonnet-20241022

Anthropic Claude 3.5 Sonnet 20241022 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-sonnet-20241022.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]

### anthropic/claude-3-5-sonnet-latest

Anthropic Claude 3.5 Sonnet Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-sonnet-latest.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]

### anthropic/claude-3-7-sonnet-20250219

Anthropic Claude 3.7 Sonnet 20250219 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-7-sonnet-20250219.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-3-7-sonnet-latest

Anthropic Claude 3.7 Sonnet Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-7-sonnet-latest.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-3-opus-20240229

Anthropic Claude 3 Opus 20240229 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-opus-20240229.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]

### anthropic/claude-3-opus-latest

Anthropic Claude 3 Opus Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-opus-latest.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]

### anthropic/claude-haiku-4

Anthropic Claude Haiku 4 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-haiku-4-5

Anthropic Claude Haiku 4.5 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4-5.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-haiku-4-5-20251001

Anthropic Claude Haiku 4.5 20251001 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4-5-20251001.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-haiku-4-subscription

Anthropic Claude Haiku 4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4-subscription.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-opus-4-1-20250805

Anthropic Claude Opus 4.1 20250805 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-1-20250805.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"]

### anthropic/claude-opus-4-20250514

Anthropic Claude Opus 4.20250514 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-20250514.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"]

### anthropic/claude-opus-4-5-20251101

Anthropic Claude Opus 4.5 20251101 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-5-20251101.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"]
- `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high"]

### anthropic/claude-opus-4-6

Anthropic Claude Opus 4.6 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-6.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"enabled", "adaptive"}; top_p ≠ null]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"enabled", "adaptive"}; temperature ≠ null]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"enabled", "adaptive"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type ∈ {"adaptive", "enabled"}]
- `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high", "max"]

### anthropic/claude-opus-4-7

Anthropic Claude Opus 4.7 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "omitted"] [values: "summarized", "omitted"] [only when thinking.type = "adaptive"]
- `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high", "xhigh", "max"]

### anthropic/claude-opus-4-subscription

Anthropic Claude Opus 4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-subscription.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-sonnet-4-20250514

Anthropic Claude Sonnet 4.20250514 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-20250514.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"]

### anthropic/claude-sonnet-4-5

Anthropic Claude Sonnet 4.5 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-5.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-sonnet-4-5-20250929

Anthropic Claude Sonnet 4.5 20250929 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-5-20250929.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

### anthropic/claude-sonnet-4-6

Anthropic Claude Sonnet 4.6 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-6.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"enabled", "adaptive"}; top_p ≠ null]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"enabled", "adaptive"}; temperature ≠ null]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"enabled", "adaptive"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]
- `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type ∈ {"adaptive", "enabled"}]
- `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high", "max"]

### anthropic/claude-sonnet-4-subscription

Anthropic Claude Sonnet 4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-subscription.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1]
- `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}]
- `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"]
- `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"]

## Mistral

### mistral/codestral-latest

Mistral Codestral Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/codestral-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/devstral-2512

Mistral Devstral 2512 (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/devstral-2512.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/devstral-latest

Mistral Devstral Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/devstral-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/magistral-medium-latest

Mistral Magistral Medium Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/magistral-medium-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/magistral-small-latest

Mistral Magistral Small Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/magistral-small-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/ministral-14b-latest

Mistral Ministral 14b Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/ministral-14b-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/ministral-3b-latest

Mistral Ministral 3b Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/ministral-3b-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/ministral-8b-latest

Mistral Ministral 8b Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/ministral-8b-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/mistral-large-latest

Mistral Mistral Large Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-large-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/mistral-medium-3.5

Mistral Mistral Medium 3.5 (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-medium-3.5.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/mistral-medium-latest

Mistral Mistral Medium Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-medium-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/mistral-small-latest

Mistral Mistral Small Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-small-latest.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

### mistral/open-mistral-nemo

Mistral Open Mistral Nemo (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/open-mistral-nemo.json

- `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1]
- `stop` (string, Length) — Stops generation when this string is detected.
- `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1]
- `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01]
- `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0]
- `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1]
- `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1]
- `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"]
- `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false]

## DeepSeek

### deepseek/deepseek-chat

DeepSeek Deepseek Chat (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-chat.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"]
- `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "disabled"] [values: "disabled", "enabled"]

### deepseek/deepseek-reasoner

DeepSeek Deepseek Reasoner (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-reasoner.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"]
- `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "enabled"] [values: "enabled", "disabled"]
- `reasoning_effort` (enum, Reasoning) — Controls DeepSeek thinking effort when thinking mode is enabled. [default: "high"] [values: "high", "max"] [only when thinking.type = "enabled"]

### deepseek/deepseek-v4-flash

DeepSeek Deepseek V4 Flash (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-v4-flash.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"]
- `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "enabled"] [values: "enabled", "disabled"]
- `reasoning_effort` (enum, Reasoning) — Controls DeepSeek thinking effort when thinking mode is enabled. [default: "high"] [values: "high", "max"] [only when thinking.type = "enabled"]

### deepseek/deepseek-v4-pro

DeepSeek Deepseek V4 Pro (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-v4-pro.json

- `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1]
- `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"]
- `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"]
- `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "enabled"] [values: "enabled", "disabled"]
- `reasoning_effort` (enum, Reasoning) — Controls DeepSeek thinking effort when thinking mode is enabled. [default: "high"] [values: "high", "max"] [only when thinking.type = "enabled"]

## Google

### google/gemini-2.5-flash

Google Gemini 2.5 Flash (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-2.5-flash.json

- `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536]
- `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01]
- `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0]
- `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired.
- `generationConfig.thinkingConfig.thinkingBudget` (integer, Reasoning) — Number of thinking tokens Gemini should use; 0 disables thinking and -1 uses dynamic thinking. [default: -1] [range: min -1, max 24576]
- `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false]
- `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"]

### google/gemini-2.5-flash-lite

Google Gemini 2.5 Flash Lite (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-2.5-flash-lite.json

- `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536]
- `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01]
- `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0]
- `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired.
- `generationConfig.thinkingConfig.thinkingBudget` (integer, Reasoning) — Number of thinking tokens Gemini should use; -1 uses dynamic thinking, 0 disables thinking, and fixed budgets start at 512 tokens. [default: 0]
- `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false]
- `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"]

### google/gemini-2.5-pro

Google Gemini 2.5 Pro (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-2.5-pro.json

- `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536]
- `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01]
- `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0]
- `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired.
- `generationConfig.thinkingConfig.thinkingBudget` (integer, Reasoning) — Maximum number of thinking tokens Gemini should use before producing the final answer. [range: min 128, max 32768]
- `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false]
- `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"]

### google/gemini-3.5-flash

Google Gemini 3.5 Flash (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-3.5-flash.json

- `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536]
- `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1]
- `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01]
- `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0]
- `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired.
- `generationConfig.thinkingConfig.thinkingLevel` (enum, Reasoning) — Controls Gemini 3.5 Flash reasoning effort. [default: "medium"] [values: "minimal", "low", "medium", "high"]
- `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false]
- `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"]