# How to use modelparameters.dev [modelparameters.dev](https://modelparams.dev) is an open, community-maintained catalog of LLM model parameters. Each entry shows the knobs you can turn — type, default, range, and the conditions that gate it. The same model accessed via an **API key** and via a **subscription** usually exposes a different set of parameters. We list both as separate entries so the data stays honest. ## Catalog API The full catalog is static JSON, CORS-enabled, served from the edge: ```bash curl https://modelparams.dev/api/v1/models.json ``` Each entry is keyed by `provider/model` for API-key variants; subscription variants append `-subscription`. ## Single model ```bash curl https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7.json curl https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7-subscription.json ``` ## JSON Schema Every entry validates against a JSON Schema you can use in your editor or pipeline: ```bash curl https://modelparams.dev/api/v1/schema.json ``` Add this header to any YAML you author for autocomplete in VS Code: ```yaml # yaml-language-server: $schema=https://modelparams.dev/api/v1/schema.json ``` ## Logos Provider logos are at `/assets/logos/{provider}.svg`. They use `currentColor`, so they inherit your text color: ```bash curl https://modelparams.dev/assets/logos/anthropic.svg ``` ## Contribute The data lives in YAML under `models/{provider}/{model}-{auth}.yaml` in the [GitHub repo](https://github.com/mnfst/modelparameters.dev). Open a PR; CI validates against the schema and rebuilds. ## For agents - Machine-readable site overview: https://modelparams.dev/llms.txt - Full usage guide plus every parameter inline: https://modelparams.dev/llms-full.txt - When your browser supports it, this page registers in-browser **WebMCP** tools on `navigator.modelContext`: `search_models`, `get_model_parameters`, `list_providers`, `list_parameters`, and `get_usage_guide`. --- # Full catalog 78 models, grouped by provider. Each line reads: `path` (type, group) — description, then defaults, ranges, allowed values, and applicability conditions in brackets. ## OpenAI ### openai/chatgpt-4o-latest OpenAI Chatgpt 4o Latest (API key) · JSON: https://modelparams.dev/api/v1/models/openai/chatgpt-4o-latest.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-3.5-turbo OpenAI Gpt 3.5 Turbo (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-3.5-turbo.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4-turbo OpenAI Gpt 4 Turbo (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4-turbo.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4-turbo-2024-04-09 OpenAI Gpt 4 Turbo 2024.04 09 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4-turbo-2024-04-09.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4.1 OpenAI Gpt 4.1 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4.1.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4.1-mini OpenAI Gpt 4.1 Mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4.1-mini.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4.1-nano OpenAI Gpt 4.1 Nano (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4.1-nano.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4o OpenAI GPT-4o (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4o.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4o-2024-11-20 OpenAI Gpt 4o 2024.11 20 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4o-2024-11-20.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-4o-mini OpenAI GPT-4o mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-4o-mini.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] ### openai/gpt-5 OpenAI Gpt 5 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"] ### openai/gpt-5-chat-latest OpenAI Gpt 5 Chat Latest (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5-chat-latest.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] ### openai/gpt-5-mini OpenAI Gpt 5 Mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5-mini.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"] ### openai/gpt-5-nano OpenAI Gpt 5 Nano (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5-nano.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"] ### openai/gpt-5.1 OpenAI Gpt 5.1 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.1.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "none"] [values: "none", "low", "medium", "high"] ### openai/gpt-5.1-codex-max-subscription OpenAI Gpt 5.1 Codex Max (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.1-codex-max-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.1-codex-subscription OpenAI Gpt 5.1 Codex (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.1-codex-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.2 OpenAI Gpt 5.2 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.2.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"] ### openai/gpt-5.2-codex-subscription OpenAI Gpt 5.2 Codex (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.2-codex-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.2-subscription OpenAI Gpt 5.2 (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.2-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.3-codex-spark-subscription OpenAI Gpt 5.3 Codex Spark (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.3-codex-spark-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.3-codex-subscription OpenAI Gpt 5.3 Codex (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.3-codex-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.4 OpenAI Gpt 5.4 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"] ### openai/gpt-5.4-mini OpenAI Gpt 5.4 Mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4-mini.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"] ### openai/gpt-5.4-mini-subscription OpenAI Gpt 5.4 Mini (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4-mini-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.4-subscription OpenAI Gpt 5.4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.4-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/gpt-5.5 OpenAI Gpt 5.5 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.5.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "none", "low", "medium", "high", "xhigh"] ### openai/gpt-5.5-subscription OpenAI Gpt 5.5 (Subscription) · JSON: https://modelparams.dev/api/v1/models/openai/gpt-5.5-subscription.json - `reasoning.effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high", "xhigh"] - `reasoning.summary` (enum, Reasoning) — Controls the level of reasoning summary returned with the response. [default: "auto"] [values: "auto", "concise", "detailed", "none"] - `text.verbosity` (enum, Output) — Controls how concise or detailed the model's final text response should be. [default: "medium"] [values: "low", "medium", "high"] ### openai/o1 OpenAI o1 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o1.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"] ### openai/o1-mini OpenAI o1-mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o1-mini.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"] ### openai/o1-preview OpenAI O1 Preview (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o1-preview.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "minimal", "low", "medium", "high"] ### openai/o3 OpenAI o3 (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o3.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"] ### openai/o3-mini OpenAI o3-mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o3-mini.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"] ### openai/o4-mini OpenAI o4-mini (API key) · JSON: https://modelparams.dev/api/v1/models/openai/o4-mini.json - `max_completion_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 16] - `reasoning_effort` (enum, Reasoning) — Controls how much reasoning the model should perform before producing an answer. [default: "medium"] [values: "low", "medium", "high", "xhigh"] ## Anthropic ### anthropic/claude-3-5-haiku-20241022 Anthropic Claude 3.5 Haiku 20241022 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-haiku-20241022.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] ### anthropic/claude-3-5-haiku-latest Anthropic Claude 3.5 Haiku Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-haiku-latest.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] ### anthropic/claude-3-5-sonnet-20241022 Anthropic Claude 3.5 Sonnet 20241022 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-sonnet-20241022.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] ### anthropic/claude-3-5-sonnet-latest Anthropic Claude 3.5 Sonnet Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-5-sonnet-latest.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] ### anthropic/claude-3-7-sonnet-20250219 Anthropic Claude 3.7 Sonnet 20250219 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-7-sonnet-20250219.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-3-7-sonnet-latest Anthropic Claude 3.7 Sonnet Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-7-sonnet-latest.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-3-opus-20240229 Anthropic Claude 3 Opus 20240229 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-opus-20240229.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] ### anthropic/claude-3-opus-latest Anthropic Claude 3 Opus Latest (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-3-opus-latest.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] ### anthropic/claude-haiku-4 Anthropic Claude Haiku 4 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-haiku-4-5 Anthropic Claude Haiku 4.5 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4-5.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-haiku-4-5-20251001 Anthropic Claude Haiku 4.5 20251001 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4-5-20251001.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-haiku-4-subscription Anthropic Claude Haiku 4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-haiku-4-subscription.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-opus-4-1-20250805 Anthropic Claude Opus 4.1 20250805 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-1-20250805.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"] ### anthropic/claude-opus-4-20250514 Anthropic Claude Opus 4.20250514 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-20250514.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"] ### anthropic/claude-opus-4-5-20251101 Anthropic Claude Opus 4.5 20251101 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-5-20251101.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"] - `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high"] ### anthropic/claude-opus-4-6 Anthropic Claude Opus 4.6 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-6.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"enabled", "adaptive"}; top_p ≠ null] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"enabled", "adaptive"}; temperature ≠ null] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"enabled", "adaptive"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type ∈ {"adaptive", "enabled"}] - `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high", "max"] ### anthropic/claude-opus-4-7 Anthropic Claude Opus 4.7 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "omitted"] [values: "summarized", "omitted"] [only when thinking.type = "adaptive"] - `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high", "xhigh", "max"] ### anthropic/claude-opus-4-subscription Anthropic Claude Opus 4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-subscription.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-sonnet-4-20250514 Anthropic Claude Sonnet 4.20250514 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-20250514.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type = "enabled"] ### anthropic/claude-sonnet-4-5 Anthropic Claude Sonnet 4.5 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-5.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-sonnet-4-5-20250929 Anthropic Claude Sonnet 4.5 20250929 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-5-20250929.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type = "enabled"; top_p ≠ null] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"; temperature ≠ null] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ### anthropic/claude-sonnet-4-6 Anthropic Claude Sonnet 4.6 (API key) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-6.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"enabled", "adaptive"}; top_p ≠ null] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"enabled", "adaptive"}; temperature ≠ null] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"enabled", "adaptive"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] - `thinking.display` (enum, Reasoning) — Controls whether Anthropic returns summarized or omitted thinking content. [default: "summarized"] [values: "summarized", "omitted"] [only when thinking.type ∈ {"adaptive", "enabled"}] - `output_config.effort` (enum, Reasoning) — Controls Anthropic response thoroughness and token spend. [default: "high"] [values: "low", "medium", "high", "max"] ### anthropic/claude-sonnet-4-subscription Anthropic Claude Sonnet 4 (Subscription) · JSON: https://modelparams.dev/api/v1/models/anthropic/claude-sonnet-4-subscription.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 1, step 0.1] [except when thinking.type ∈ {"adaptive", "enabled"}] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type ∈ {"adaptive", "enabled"}; temperature ≠ 1] - `top_k` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 0] [range: min 0] [except when thinking.type ∈ {"adaptive", "enabled"}] - `thinking.type` (enum, Reasoning) — Controls the Anthropic thinking mode values supported by this model. [default: "disabled"] [values: "disabled", "adaptive", "enabled"] - `thinking.budget_tokens` (integer, Reasoning) — Maximum token budget Anthropic may use for extended thinking before producing the final answer. [default: 4096] [range: min 1024] [only when thinking.type = "enabled"] ## Mistral ### mistral/codestral-latest Mistral Codestral Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/codestral-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/devstral-2512 Mistral Devstral 2512 (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/devstral-2512.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/devstral-latest Mistral Devstral Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/devstral-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/magistral-medium-latest Mistral Magistral Medium Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/magistral-medium-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/magistral-small-latest Mistral Magistral Small Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/magistral-small-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/ministral-14b-latest Mistral Ministral 14b Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/ministral-14b-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/ministral-3b-latest Mistral Ministral 3b Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/ministral-3b-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/ministral-8b-latest Mistral Ministral 8b Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/ministral-8b-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/mistral-large-latest Mistral Mistral Large Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-large-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/mistral-medium-3.5 Mistral Mistral Medium 3.5 (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-medium-3.5.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/mistral-medium-latest Mistral Mistral Medium Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-medium-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/mistral-small-latest Mistral Mistral Small Latest (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/mistral-small-latest.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ### mistral/open-mistral-nemo Mistral Open Mistral Nemo (API key) · JSON: https://modelparams.dev/api/v1/models/mistral/open-mistral-nemo.json - `max_tokens` (integer, Length) — Maximum number of tokens to generate in the completion. [range: min 1] - `stop` (string, Length) — Stops generation when this string is detected. - `temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [range: min 0, max 1.5, step 0.1] - `top_p` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 1] [range: min 0, max 1, step 0.01] - `random_seed` (integer, Sampling) — Seed used for deterministic sampling when reproducible outputs are desired. [range: min 0] - `presence_penalty` (number, Sampling) — Penalizes repeated words or phrases to encourage a wider variety of generated content. [default: 0] [range: min -2, max 2, step 0.1] - `frequency_penalty` (number, Sampling) — Penalizes words based on how often they already appear in the generated text. [default: 0] [range: min -2, max 2, step 0.1] - `response_format.type` (enum, Output) — Controls whether the model returns normal text or JSON mode output. [default: "text"] [values: "text", "json_object"] - `safe_prompt` (boolean, Metadata) — Controls whether Mistral injects its safety prompt before the conversation. [default: false] ## DeepSeek ### deepseek/deepseek-chat DeepSeek Deepseek Chat (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-chat.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"] - `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "disabled"] [values: "disabled", "enabled"] ### deepseek/deepseek-reasoner DeepSeek Deepseek Reasoner (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-reasoner.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"] - `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "enabled"] [values: "enabled", "disabled"] - `reasoning_effort` (enum, Reasoning) — Controls DeepSeek thinking effort when thinking mode is enabled. [default: "high"] [values: "high", "max"] [only when thinking.type = "enabled"] ### deepseek/deepseek-v4-flash DeepSeek Deepseek V4 Flash (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-v4-flash.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"] - `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "enabled"] [values: "enabled", "disabled"] - `reasoning_effort` (enum, Reasoning) — Controls DeepSeek thinking effort when thinking mode is enabled. [default: "high"] [values: "high", "max"] [only when thinking.type = "enabled"] ### deepseek/deepseek-v4-pro DeepSeek Deepseek V4 Pro (API key) · JSON: https://modelparams.dev/api/v1/models/deepseek/deepseek-v4-pro.json - `max_tokens` (integer, Length) — Maximum number of output tokens the model may generate. [default: 4096] [range: min 1] - `temperature` (number, Sampling) — Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 2, step 0.1] [except when thinking.type = "enabled"] - `top_p` (number, Sampling) — Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect. [default: 1] [range: min 0, max 1, step 0.01] [except when thinking.type = "enabled"] - `thinking.type` (enum, Reasoning) — Controls whether DeepSeek uses thinking mode before producing the final answer. [default: "enabled"] [values: "enabled", "disabled"] - `reasoning_effort` (enum, Reasoning) — Controls DeepSeek thinking effort when thinking mode is enabled. [default: "high"] [values: "high", "max"] [only when thinking.type = "enabled"] ## Google ### google/gemini-2.5-flash Google Gemini 2.5 Flash (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-2.5-flash.json - `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536] - `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01] - `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0] - `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired. - `generationConfig.thinkingConfig.thinkingBudget` (integer, Reasoning) — Number of thinking tokens Gemini should use; 0 disables thinking and -1 uses dynamic thinking. [default: -1] [range: min -1, max 24576] - `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false] - `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"] ### google/gemini-2.5-flash-lite Google Gemini 2.5 Flash Lite (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-2.5-flash-lite.json - `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536] - `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01] - `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0] - `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired. - `generationConfig.thinkingConfig.thinkingBudget` (integer, Reasoning) — Number of thinking tokens Gemini should use; -1 uses dynamic thinking, 0 disables thinking, and fixed budgets start at 512 tokens. [default: 0] - `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false] - `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"] ### google/gemini-2.5-pro Google Gemini 2.5 Pro (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-2.5-pro.json - `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536] - `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01] - `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0] - `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired. - `generationConfig.thinkingConfig.thinkingBudget` (integer, Reasoning) — Maximum number of thinking tokens Gemini should use before producing the final answer. [range: min 128, max 32768] - `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false] - `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"] ### google/gemini-3.5-flash Google Gemini 3.5 Flash (API key) · JSON: https://modelparams.dev/api/v1/models/google/gemini-3.5-flash.json - `generationConfig.maxOutputTokens` (integer, Length) — Maximum number of tokens to include in a response candidate. [range: min 1, max 65536] - `generationConfig.temperature` (number, Sampling) — Controls randomness. Lower values make outputs more focused; higher values make them more varied. [default: 1] [range: min 0, max 2, step 0.1] - `generationConfig.topP` (number, Sampling) — Controls nucleus sampling by limiting generation to tokens within the selected cumulative probability. [default: 0.95] [range: min 0, max 1, step 0.01] - `generationConfig.topK` (integer, Sampling) — Limits token sampling to the top K most likely next tokens. [default: 64] [range: min 0] - `generationConfig.seed` (integer, Sampling) — Optional seed used for decoding when reproducible sampling is desired. - `generationConfig.thinkingConfig.thinkingLevel` (enum, Reasoning) — Controls Gemini 3.5 Flash reasoning effort. [default: "medium"] [values: "minimal", "low", "medium", "high"] - `generationConfig.thinkingConfig.includeThoughts` (boolean, Reasoning) — Controls whether Gemini returns available thought summaries in the response parts. [default: false] - `generationConfig.responseMimeType` (enum, Output) — MIME type for generated text candidates. [default: "text/plain"] [values: "text/plain", "application/json"]