Providers
llms.py supports multiple LLM providers through OpenAI-compatible APIs. Any provider can be added by configuring them in llms.json.
Provider Overview
Section titled “Provider Overview”Free Tier Providers
Section titled “Free Tier Providers”Providers with free models or free tiers:
- OpenRouter Free - Free tier models via OpenRouter
- Groq - Fast inference with free tier
- Google Free - Gemini models with free tier
- Codestral - Free code generation models
- Ollama - Local models (completely free)
Premium Providers
Section titled “Premium Providers”Paid providers with advanced models:
- OpenAI - GPT-4o, GPT-5, o3, etc.
- Anthropic - Claude Opus, Sonnet, Haiku
- Google - Gemini Pro and advanced models
- Grok - X.AI’s Grok models
- OpenRouter - Access to 100+ models
- Qwen - Alibaba’s Qwen models
- Z.ai - GLM models
- Mistral - Mistral AI models
OpenAI
Section titled “OpenAI”Type: OpenAiProvider
API Key: OPENAI_API_KEY
Popular Models:
gpt-5- Latest flagship modelgpt-5-mini- Smaller, faster GPT-5gpt-4o- Multimodal GPT-4gpt-4o-mini- Efficient GPT-4gpt-4o-audio-preview- Audio processingo3- Advanced reasoning model
Features:
- Text generation
- Vision (images)
- Audio processing
- File attachments (PDFs)
- Function calling
- Streaming
Enable:
export OPENAI_API_KEY="sk-..."llms --enable openaiAnthropic (Claude)
Section titled “Anthropic (Claude)”Type: OpenAiProvider
API Key: ANTHROPIC_API_KEY
Popular Models:
claude-opus-4-1- Most capable modelclaude-sonnet-4-0- Balanced performanceclaude-haiku-3-5- Fast and efficient
Features:
- Text generation
- Vision (images)
- Large context windows (200K+ tokens)
- Strong reasoning capabilities
Enable:
export ANTHROPIC_API_KEY="sk-ant-..."llms --enable anthropicGoogle Gemini
Section titled “Google Gemini”Type: GoogleProvider or GoogleOpenAiProvider
API Key: GOOGLE_API_KEY or GOOGLE_FREE_API_KEY
Popular Models:
gemini-2.5-pro- Most capablegemini-2.5-flash- Fast and efficientgemini-2.5-flash-lite- Lightweightgemini-flash-latest- Latest flash model
Features:
- Text generation
- Vision (images)
- Audio processing
- File attachments
- Safety settings
- Free tier available
Enable:
export GOOGLE_FREE_API_KEY="AIza..."llms --enable google_freeOpenRouter
Section titled “OpenRouter”Type: OpenAiProvider
API Key: OPENROUTER_API_KEY
Popular Models:
grok-4- X.AI’s Grokkimi-k2- Moonshot AIdeepseek-v3.1:671b- DeepSeekllama4:400b- Meta’s Llama 4glm-4.5-air- Z.AI’s GLM- 100+ more models
Features:
- Access to latest models
- Free tier available
- Multiple providers in one API
- Competitive pricing
Enable:
export OPENROUTER_API_KEY="sk-or-..."llms --enable openrouter_free # Free tierllms --enable openrouter # Paid tierGrok (X.AI)
Section titled “Grok (X.AI)”Type: OpenAiProvider
API Key: GROK_API_KEY
Popular Models:
grok-4- Latest modelgrok-3- Previous generationgrok-3-mini- Smaller, fastergrok-code-fast-1- Code generation
Features:
- Real-time information
- Humor and personality
- Uncensored responses
- Strong reasoning
Enable:
export GROK_API_KEY="xai-..."llms --enable grokType: OpenAiProvider
API Key: GROQ_API_KEY
Popular Models:
llama3.3:70b- Meta Llama 3.3llama4:400b- Meta Llama 4kimi-k2- Moonshot AIgpt-oss:120b- OpenAI OSSqwen3:32b- Qwen 3
Features:
- Extremely fast inference
- Free tier
- Competitive pricing
- Multiple model providers
Enable:
export GROQ_API_KEY="gsk_..."llms --enable groqOllama (Local)
Section titled “Ollama (Local)”Type: OllamaProvider
API Key: Not required
Features:
- Local inference
- Complete privacy
- No API costs
- Auto-discovery of installed models
- Custom models
Setup:
- Install Ollama from ollama.ai
- Pull models:
ollama pull llama3.3 - Enable in llms.py:
llms --enable ollamaConfiguration:
{ "ollama": { "enabled": true, "type": "OllamaProvider", "base_url": "http://localhost:11434", "models": {}, "all_models": true // Auto-discover all models }}Qwen (Alibaba Cloud)
Section titled “Qwen (Alibaba Cloud)”Type: OpenAiProvider
API Key: DASHSCOPE_API_KEY
Popular Models:
qwen3-max- Most capableqwen2.5vl- Vision modelqwen3-vl:235b- Large vision modelqwen3-coder- Code generationqwq-plus- Reasoning model
Features:
- Multilingual support
- Vision models
- Audio processing
- Code generation
- Reasoning capabilities
Enable:
export DASHSCOPE_API_KEY="sk-..."llms --enable qwenType: OpenAiProvider
API Key: ZAI_API_KEY
Popular Models:
glm-4.6- Latest modelglm-4.5- Previous generationglm-4.5-air- Lightweightglm-4.5-flash- Fast inference
Features:
- Strong reasoning
- Multilingual
- Competitive pricing
Enable:
export ZAI_API_KEY="sk-..."llms --enable z.aiMistral
Section titled “Mistral”Type: OpenAiProvider
API Key: MISTRAL_API_KEY
Popular Models:
mistral-large- Most capablepixtral- Vision modelcodestral- Code generation
Features:
- Code generation
- Multilingual
- European provider
Enable:
export MISTRAL_API_KEY="..."llms --enable mistralCodestral
Section titled “Codestral”Type: OpenAiProvider
API Key: CODESTRAL_API_KEY
Models:
codestral- Code generation
Features:
- Specialized for code
- Free tier available
Enable:
export CODESTRAL_API_KEY="..."llms --enable codestralNext Steps
Section titled “Next Steps”- Configuration - Configure providers
- API Keys - Set up API keys
- CLI Usage - Use providers from CLI