
Private inference for your AI agents
Your agents process sensitive data: customer records, proprietary code, financial transactions. Other providers log those API calls and train on them. Venice gives your agents a privacy-first, OpenAI-compatible API with privacy modes for every level of sensitivity.
An inference API that works for your agents, not on them
Venice gives your agents access to 200+ models through an OpenAI-compatible API. Venice never logs or stores your prompts on its servers, and you choose the privacy mode for each call.


How Venice Works
Step 1
Swap your base URL
Point your agent at https://api.venice.ai/api/v1. Venice is OpenAI-compatible, so your existing SDKs, frameworks, and tool chains work without code changes.
Step 2
Build with 200+ models
Your agent makes tool calls, streams responses, and runs function calling exactly as before, now across 200+ models from OpenAI, Anthropic, xAI, Google, and leading open-source projects.
Step 3
Pick your privacy mode
Venice never logs or stores your prompts on its servers. Match privacy to each call: anonymized frontier access, zero-data-retention private models, hardware-verified TEE, or end-to-end encryption.

Other providers log every API call. Venice doesn't.
When your agents call OpenAI or Anthropic directly, every prompt is logged, stored, and potentially used for training. Through Venice, frontier models are accessed anonymously with your identity stripped, and Venice never logs or stores your prompts on its servers.

OpenAI-compatible. Swap one line and you're private.
Venice's API is fully OpenAI-compatible. Change your base URL and your existing agents, frameworks, and tool chains work instantly. Function calling, streaming, and structured outputs all supported.

Four privacy modes, one API
Match privacy to the sensitivity of each call. Anonymous strips your identity from frontier providers. Private runs with zero data retention on Venice-controlled infrastructure. TEE adds hardware-verified isolation. E2EE encrypts prompts client-side so only the enclave can decrypt them.

200+ models through one endpoint
Access frontier models from OpenAI, Anthropic, and Google anonymously, plus xAI's Grok suite and leading open-source models privately. Switch models with a single parameter, no new integration.
Get Started Free, Upgrade When Ready
Free
$0/mo
Explore Venice with base AI models
Pro
$18/mo
Your private AI studio - every model, every medium, no limits.
Most Popular
Pro Plus
$68/mo
Everything in Pro, scaled for serious creators and users
Max
$200/mo
Everything in Plus - for the ultimate access to Venice power users
API Pricing
Venice offers an OpenAI-compatible API for developers. Pro subscribers get free-tier API access, while Plus and Max subscribers enjoy higher rate limits and credit-based access to premium models.
API Pricing Details