Models & providers
Headmaster connects to model providers you already have API keys for. It does not host models itself. You add a provider, paste a key, and Headmaster can use any model that provider offers. Headmaster supports 28+ LLM platforms — official, cloud, Chinese, international, and custom. All configured from Settings → Headmaster’s Library → Connections → Add provider.Provider catalog
| Category | Platform | Auth method | Multi-key | Notes |
|---|---|---|---|---|
| Official | Gemini | API key | Yes | Image gen, tool calling |
| Gemini (Vertex AI) | API key + Project ID | Yes | Enterprise Google Cloud | |
| OpenAI | API key | Yes | Function calling, image gen | |
| Anthropic | API key | Yes | Claude series | |
| Cloud | AWS Bedrock | Access key / Profile | No | AWS enterprise AI |
| New API | API key | Yes | Unified multi-model gateway | |
| Chinese | Dashscope (Qwen) | API key | Yes | Qwen series |
| Dashscope Coding Plan | API key | Yes | Coding plan tier | |
| Zhipu | API key | Yes | Zhipu AI | |
| Moonshot (China/Global) | API key | Yes | Two variants | |
| Qianfan | API key | Yes | Baidu | |
| Hunyuan | API key | Yes | Tencent | |
| Lingyi | API key | Yes | Lingyi Wanwu | |
| ModelScope | API key | Yes | Community platform | |
| InfiniAI | API key | Yes | ||
| Ctyun | API key | Yes | China Telecom Cloud | |
| StepFun | API key | Yes | ||
| International | DeepSeek | API key | Yes | |
| MiniMax | API key | Yes | ||
| Novita | API key | Yes | AI gateway | |
| OpenRouter | API key | Yes | Multi-model aggregation (300+ models) | |
| PPIO | API key | Yes | Inference gateway | |
| SiliconFlow (CN/Global) | API key | Yes | Two variants | |
| xAI | API key | Yes | Grok models | |
| Ark | API key | Yes | Volcengine | |
| Poe | API key | Yes | Poe platform | |
| Custom | Custom | API key | Yes | OpenAI-compatible (Ollama, LM Studio, vLLM, llama.cpp) |
Adding a provider
- Open Settings → Headmaster’s Library → Connections → Add provider.
- Pick the provider type from the dropdown.
- Enter your API key. You can paste multiple keys (comma-separated or one per line) — Headmaster rotates across them automatically.
- If the provider has a custom base URL (for OpenAI-compatible endpoints or self-hosted models), enter it.
- Click Test connection to verify. If it passes, click Save.
Platform-specific setup
Gemini
- Get a key at Google AI Studio — sign in, create a new API key.
- Settings → Connections → Add provider → Gemini → paste key → save.
- Model list is auto-fetched. You’ll see Gemini Pro, Ultra, Flash, and image-capable variants.
Gemini (Vertex AI)
- Create a project at Google Cloud Console, enable the Vertex AI API.
- Note the Project ID (format:
my-project-123456). - Settings → Connections → Add provider → Gemini (Vertex AI) → enter Project ID + API key → save.
OpenAI
- Get a key at platform.openai.com.
- Settings → Connections → Add provider → OpenAI → paste key → save.
- Models auto-fetched: GPT-4o, GPT-4 Turbo, GPT-3.5 Turbo, o-series, DALL-E (for image gen).
Anthropic
- Get a key at console.anthropic.com.
- Settings → Connections → Add provider → Anthropic → paste key → save.
- Models auto-fetched: Claude Sonnet, Opus, Haiku.
AWS Bedrock
Two auth methods:- Access key: Enter AWS Region, Access Key ID, Secret Access Key.
- Profile: Enter AWS Region + locally configured AWS CLI Profile name.
us-east-1, us-west-2, eu-west-1, eu-central-1, ap-southeast-1, ap-northeast-1, ap-southeast-2, ca-central-1.
Multi-key rotation is not supported for AWS Bedrock (it uses AWS credential auth, not API keys).
New API (unified gateway)
Select New API, enter Base URL + API key. Supports multiple protocols: OpenAI, Gemini, Anthropic. OptionalmodelProtocols override per model (e.g., Gemini models use gemini protocol, Claude uses anthropic).
OpenRouter
- Sign in at openrouter.ai, create a key at openrouter.ai/keys — copy it immediately (shown only once).
- Settings → Connections → Add provider → OpenRouter → paste key → save.
- 300+ models from many providers appear in the selector under a single key.
DeepSeek
- Get a key at platform.deepseek.com.
- Settings → Connections → Add provider → DeepSeek → paste key → save.
Dashscope (Qwen)
- Get a key at dashscope.console.aliyun.com.
- Settings → Connections → Add provider → Dashscope → paste key → save.
Custom (OpenAI-compatible)
For local models or any OpenAI-compatible endpoint:- Enter Base URL + API key + model names.
- Use any non-empty placeholder for local models without auth.
Preset platforms (quick config)
For preset platforms, the Base URL is auto-filled — just paste your API key. Console links for getting keys:| Platform | Console URL |
|---|---|
| Dashscope | dashscope.console.aliyun.com |
| Zhipu | open.bigmodel.cn |
| Moonshot | platform.moonshot.cn |
| ModelScope | modelscope.cn/my/myaccesstoken |
| OpenRouter | openrouter.ai/keys |
| DeepSeek | platform.deepseek.com |
| SiliconFlow | siliconflow.cn / siliconflow.com |
| xAI | console.x.ai |
| MiniMax | platform.minimaxi.com |
Multi-key rotation
Headmaster supports multi-key rotation for each provider. Paste multiple API keys (comma-separated or one per line) in the provider configuration.Under the hood
- Automatic rotation on errors (401, 429, 503) — the failed key is skipped and the next one is tried.
- Smart blacklist: Failed keys are blocked for 90 seconds, then auto-recover.
- Load balancing: Random starting key selection spreads load evenly.
- Per-key logging: Key rotation events are logged so you can see which key was used for which request.
Format
Comma-separated:Platform support
- Supported: Gemini, Vertex AI, OpenAI, Anthropic, Custom, New API, and all OpenAI-compatible platforms (ModelScope, OpenRouter, Dashscope, DeepSeek, MiniMax, SiliconFlow, Zhipu, Moonshot, xAI, Ark, Qianfan, Hunyuan, Lingyi, Poe, InfiniAI, Ctyun, StepFun, etc.)
- Not supported: AWS Bedrock (uses AWS credential auth).
Model defaults
Open Settings → My Headmaster → Model defaults to set which model new chats start on. This does not change the model in existing conversations. Pick a model based on the kind of work:- Fast and cheap — for short questions, edits, and casual conversation. Examples: Claude Haiku, GPT-4o-mini, Gemini Flash.
- Large context — for long documents, code review across a whole repo, or research with many sources. Examples: Claude Sonnet, GPT-4o, Gemini Pro.
- Tool-heavy — for workflows that call external services a lot. Look for models with strong function-calling support.
- Reasoning — for complex multi-step problems. Examples: o1, o3, DeepSeek R1.
Per-conversation model override
You can pick a different model for a single chat by clicking the model name at the top of the conversation view. A dropdown shows every model from every connected provider. The choice sticks for that conversation only — new chats still use your default. Each agent (built-in, Claude Code, Codex, etc.) exposes its own model list. Switching the agent also switches the model list.Profile model fallback
Each profile can specify a fallback model. If the primary model is unavailable (provider outage, rate limit across all keys), the runtime falls back to the secondary model automatically. Configure this in Settings → My Headmaster → Profile → Model fallback.Local models
To use a local model (Ollama, LM Studio, vLLM, llama.cpp):- Start your local model server. Make sure it’s listening on a known port.
- In Headmaster: Settings → Connections → Add provider → Custom OpenAI-compatible.
- Set the base URL to your local server’s address (e.g.,
http://localhost:11434/v1for Ollama). - Leave the API key blank (or use a dummy key if the field is required).
- Save. The local models appear in the selector.
Model discovery
When you add a provider, Headmaster fetches the provider’s model list automatically. The model selector shows:- Model name
- Context window size (if known)
- Whether the model supports tool use
- Whether the model supports image input