Cloud Providers
This page covers setup for cloud-based API providers. For GitHub Copilot, see GitHub Copilot. For ChatGPT OAuth, see ChatGPT OAuth Login.
Anthropic
Section titled “Anthropic”Models: Claude Haiku 4.5, Sonnet 4.6, Sonnet 4.6 (1M context), Opus 4.6, Opus 4.6 (1M context), Opus 4.7
- Get an API key from console.anthropic.com.
- Run
fermi initand select Anthropic (Claude). - Paste your API key when prompted.
The key is stored as ANTHROPIC_API_KEY in ~/.fermi/.env.
1M Context Beta
Section titled “1M Context Beta”The 1M context variants of Sonnet 4.6 and Opus 4.6 appear as separate model options. These use Anthropic’s extended context beta and allow up to 1,000,000 tokens of context.
Thinking levels: Opus 4.7: off, low, medium, high, xhigh, max. Opus 4.6: off, low, medium, high, max. Sonnet/Haiku: off, low, medium, high.
OpenAI
Section titled “OpenAI”Models: GPT-5.2, GPT-5.2 Codex, GPT-5.3 Codex, GPT-5.4, GPT-5.4 Mini, GPT-5.4 Nano, GPT-5.5
- Get an API key from platform.openai.com.
- Run
fermi initand select OpenAI. - Paste your API key when prompted.
The key is stored as OPENAI_API_KEY in ~/.fermi/.env.
Alternatively, use your ChatGPT account via OAuth. See ChatGPT OAuth Login.
Thinking levels: none, low, medium, high, xhigh.
DeepSeek
Section titled “DeepSeek”Models: DeepSeek V4 Flash, DeepSeek V4 Pro
- Get an API key from DeepSeek’s developer portal.
- Run
fermi initand select DeepSeek. - Paste your API key when prompted.
The key is stored as a managed slot in ~/.fermi/.env.
Kimi / Moonshot
Section titled “Kimi / Moonshot”Models: Kimi K2.6, K2.5, K2 Instruct
Available through three endpoint variants:
| Variant | Endpoint | Fermi Slot | Detected External Env |
|---|---|---|---|
| Kimi-Global | api.moonshot.ai | FERMI_KIMI_API_KEY | MOONSHOT_API_KEY, KIMI_API_KEY |
| Kimi-China | api.moonshot.cn | FERMI_KIMI_CN_API_KEY | MOONSHOT_API_KEY, KIMI_CN_API_KEY |
| Kimi-Code | api.kimi.com/coding | FERMI_KIMI_CODE_API_KEY | KIMI_CODE_API_KEY |
::: warning
The Kimi-Code endpoint is restricted to whitelisted agents. You may receive a 403 error. Use kimi or kimi-cn (standard API) instead.
:::
- Get an API key from Moonshot’s developer portal.
- Run
fermi initand select Moonshot (Kimi), then pick your variant. - Import a detected env var or paste your API key.
GLM / Zhipu
Section titled “GLM / Zhipu”Models: GLM-5.1, GLM-5, GLM-5 Turbo, GLM-5V Turbo, GLM-4.7
Available through four endpoint variants:
| Variant | Endpoint | Fermi Slot | Detected External Env |
|---|---|---|---|
| GLM-China | open.bigmodel.cn | FERMI_GLM_API_KEY | GLM_API_KEY |
| GLM-Global | api.z.ai | FERMI_GLM_INTL_API_KEY | GLM_INTL_API_KEY |
| GLM-China-Code | open.bigmodel.cn/api/coding | FERMI_GLM_CODE_API_KEY | GLM_CODE_API_KEY |
| GLM-Global-Code | api.z.ai/api/coding | FERMI_GLM_INTL_CODE_API_KEY | GLM_INTL_CODE_API_KEY |
::: warning
The GLM coding endpoints are restricted to whitelisted agents. Use glm or glm-intl (standard API) instead.
:::
- Get an API key from Zhipu’s developer portal.
- Run
fermi initand select z.ai (GLM/Zhipu), then pick your variant. - Import a detected env var or paste your API key.
MiniMax
Section titled “MiniMax”Models: MiniMax M2.5, M2.5 Highspeed, M2.7, M2.7 Highspeed
Available through two endpoint variants:
| Variant | Endpoint | Fermi Slot | Detected External Env |
|---|---|---|---|
| MiniMax-Global | api.minimax.io | FERMI_MINIMAX_API_KEY | MINIMAX_API_KEY |
| MiniMax-China | api.minimaxi.com | FERMI_MINIMAX_CN_API_KEY | MINIMAX_CN_API_KEY |
- Get an API key from MiniMax’s developer portal.
- Run
fermi initand select MiniMax, then pick your variant. - Import a detected env var or paste your API key.
Xiaomi (MiMo)
Section titled “Xiaomi (MiMo)”Models: MiMo V2.5, MiMo V2.5 Pro
- Get an API key from Xiaomi’s developer portal.
- Run
fermi initand select Xiaomi (MiMo). - Paste your API key when prompted.
The key is stored as a managed slot in ~/.fermi/.env.
Qwen / DashScope
Section titled “Qwen / DashScope”Models: Qwen3.6 Plus, Qwen3.7 Max
Available through three regional endpoint variants:
| Variant | Region | Fermi Slot | Detected External Env |
|---|---|---|---|
| Qwen | China (Beijing) | FERMI_QWEN_API_KEY | DASHSCOPE_API_KEY, QWEN_API_KEY |
| Qwen-Intl | Singapore | FERMI_QWEN_INTL_API_KEY | DASHSCOPE_INTL_API_KEY, QWEN_INTL_API_KEY |
| Qwen-US | United States | FERMI_QWEN_US_API_KEY | DASHSCOPE_US_API_KEY, QWEN_US_API_KEY |
- Get an API key from Alibaba Cloud’s DashScope console.
- Run
fermi initand select Qwen / DashScope, then pick your region. - Import a detected env var or paste your API key.
OpenRouter
Section titled “OpenRouter”Models: Multi-vendor curated presets (Claude, GPT, Kimi, MiniMax, GLM, DeepSeek, Qwen, Xiaomi) plus any custom model.
OpenRouter acts as a unified API gateway to multiple model providers.
- Get an API key from openrouter.ai.
- Run
fermi initand select OpenRouter. - Paste your API key.
- Pick from the curated model presets.
The key is stored as OPENROUTER_API_KEY in ~/.fermi/.env.