Back to Docs Hub

AI Providers

Connect to 10 different AI providers — cloud-hosted models or fully local.

10
Total Providers
7
Cloud Providers
3
Local Providers

Aura Work supports 10 AI providers covering both cloud-hosted and local (Ollama, LM Studio, Custom Endpoint) models. Each provider has its own adapter that handles authentication, model discovery, and chat completions. Providers marked as local never send data off your machine.

What are Providers?

Providers are the bridge between Aura Work and AI models. Each provider implements a standard interface that handles:

  • Authentication — API keys, OAuth tokens, or local connections
  • Model discovery — automatically detecting available models
  • Chat completions — sending prompts and receiving responses
  • Usage tracking — counting tokens and estimating costs

☁️ Cloud Providers

Cloud providers host models on their infrastructure. You need an API key to use them:

ProviderBest ForPricing
OpenAIGeneral purpose, code generationPay per token
AnthropicComplex reasoning, long contextPay per token
Google GeminiMultimodal, large context windowsPay per token
DeepSeekCode-focused, cost-effectivePay per token
MinimaxChinese language tasksPay per token
QwenChinese language, reasoningPay per token
Aura CloudManaged service with E2EE syncSubscription

🏠 Local Providers

Local providers run models on your own hardware. No API keys, no internet required, complete privacy:

ProviderSetupBest For
Ollamaollama pull llama3Easy local model management
LM StudioDownload from lmstudio.aiGUI for model management
Custom EndpointAny OpenAI-compatible APISelf-hosted models, proxies

Local providers are ideal for privacy-sensitive work, offline environments, and cost savings (no per-token charges).

🔧 Setting Up Providers

To add a provider:

  1. 1. Go to Settings → Providers
  2. 2. Click on the provider you want to add
  3. 3. Enter your API key (for cloud providers)
  4. 4. Click "Validate" to test the connection
  5. 5. Select which models you want to use
  6. 6. Configure optional settings (base URL, max tokens, etc.)

For local providers, just install the software (Ollama/LM Studio) and Aura Work will auto-discover available models.

Aura Cloud Models

3 models: aura-fast, aura-coder, aura-premium Cloud

Hosted models via aura.work API. Includes Aura Fast (text+tools), Aura Coder (reasoning), and Aura Premium (vision+reasoning). Requires Aura Cloud sign-in.

Anthropic

2 models: claude-sonnet-4-20250514, claude-3-5-haiku-20241022 Cloud

Claude Sonnet 4 and Claude 3.5 Haiku. Best-in-class reasoning, code generation, and vision capabilities. API key required.

OpenAI

2 models: gpt-4o, gpt-4o-mini Cloud

GPT-4o and GPT-4o mini. Industry-standard language models with broad tool-calling support. Also supports GitHub Copilot Codex accounts.

Google Gemini

1 model: gemini-2.0-flash Cloud

Gemini 2.0 Flash. Google's fast, multimodal model with native vision capabilities. Free tier available via API key.

DeepSeek

1 model: deepseek-chat Cloud

DeepSeek V3. Cost-effective open-weight model with strong reasoning and code generation. API key required.

Ollama

1 model: llama3.2+ Local

Fully local model runner. Run Llama 3.2, Mistral, CodeLlama, and hundreds of other models on your own hardware. Zero cloud dependency.

Custom Endpoint

1 model: Dynamic Local

Any OpenAI-compatible API endpoint. Connect to local inference servers, self-hosted proxies, or any provider with an OpenAI-compatible chat completions API.

Minimax

1 model: abab6.5s-chat Cloud

Minimax abab6.5s. Chinese AI provider with competitive language model performance. API key required.

Qwen

1 model: qwen-plus Cloud

Qwen Plus (DashScope). Alibaba's flagship LLM with strong multilingual capabilities including Chinese and English.

LM Studio

1 model: Dynamic Local

Local model server (http://127.0.0.1:1234). Run any GGUF model from Hugging Face with OpenAI-compatible API. Zero configuration needed.

How providers work

Each provider implements a ProviderAdapter interface with listModels(), validateCredentials(), and chat() methods. The system auto-discovers available models on connection and caches them. OpenAI-compatible providers share a single adapter implementation.

🔑 Credential Security

API keys are encrypted using the device-bound vault before storage. The vault uses:

  • Windows — DPAPI (Data Protection API)
  • macOS — Keychain
  • Linux — Secret Service (GNOME Keyring / KWallet)

Credentials are never logged, never exposed to the agent, and never included in audit entries. The vault supports biometric unlock on supported platforms.

📊 Usage Tracking

Every task records detailed usage information:

  • Input tokens — tokens sent to the model
  • Output tokens — tokens received from the model
  • Estimated cost — calculated from the pricing cache
  • Model used — which provider and model handled the task

View usage statistics in the Dashboard or export them for billing. The audit log maintains a permanent record of all provider interactions.

🔄 Fallback & Retry

If a provider fails, Aura Work can automatically retry with an alternative:

  1. 1. Primary provider fails (rate limit, timeout, error)
  2. 2. System checks for fallback providers in your configuration
  3. 3. If fallback exists, retries with the alternative provider
  4. 4. If no fallback, notifies you and asks for manual intervention

Configure fallback providers in Settings → Providers → Fallback Chain.

💡 Cost Optimization Tips

  • Use the cost-first routing policy for routine tasks
  • Set up Ollama for development and testing (free)
  • Use DeepSeek for code tasks (cheaper than OpenAI/Anthropic)
  • Monitor usage in the Dashboard to identify expensive patterns
  • Set token limits per task to prevent runaway costs

Local vs. Cloud

CriteriaLocal ProvidersCloud Providers
PrivacyComplete — data never leaves your machineRequests are sent to the provider's servers
CostFree after setupPay per token used
QualityGood, depends on your hardwareUsually best-in-class (large models)
SpeedDepends on your GPU/CPUUsually fast with strong infrastructure
Internet requiredNoYes
Best forSensitive data, offline work, cost savingsComplex tasks, best possible quality

Choosing the Right Model

🎯 Model Selection Guide

  • For complex coding — Claude Sonnet 4, GPT-4o, or DeepSeek R1 (advanced reasoning)
  • For fast/simple tasks — Claude 3.5 Haiku, GPT-4o mini, or Gemini 2.0 Flash
  • For very long context — Gemini 2.5 Pro (massive context window)
  • For maximum privacy — any model via Ollama or LM Studio
  • For lowest possible cost — DeepSeek V3 or a local model via Ollama

Supported Providers

Aura Work supports 10 providers to maximize flexibility. Each provider has its own strengths:

Cloud

  • OpenAI — GPT-4o, GPT-4.5, o1, o3. Excellent at coding and analysis
  • Anthropic — Claude Opus, Sonnet, Haiku. Long context and computer vision
  • Google Gemini — Gemini 2.5 Pro, Flash. Multimodal and lightweight models
  • DeepSeek — DeepSeek-V3, R1. Strong performance at low cost
  • xAI Grok — Grok-3. Cutting-edge models from xAI

Local

  • Ollama — Llama, Mistral, Gemma, Qwen. Complete privacy
  • LM Studio — a graphical interface for running models locally
  • LocalAI — a local OpenAI-compatible API

How to Configure Providers

Use the following command to configure a provider:

aura config set providers.openai.apiKey sk-...
aura config set providers.anthropic.apiKey sk-ant-...

For local providers:

aura config set providers.ollama.baseUrl http://localhost:11434
aura config set providers.ollama.model llama3.2

All keys are stored encrypted using OS-level encryption (Keytar on macOS, DPAPI on Windows, libsecret on Linux).