What is an AI gateway?

An AI gateway is a control point that sits between your applications or agents and the AI models they call. Every request passes through it, so you can enforce identity, budgets, rate limits, data-loss prevention, and logging in one place instead of trusting each app to behave.

Is the Unseen AI Gateway OpenAI-compatible?

Yes. It is a drop-in replacement for the OpenAI API. Change the base URL and use a virtual key; your existing SDKs and request format keep working, now with governance applied to every call.

How do virtual keys work?

You issue a virtual key per app, team, or agent, each bound to a real identity. Keys can carry their own budget and rate limit, and any key can be revoked centrally in a click, without rotating the underlying model provider credentials.

Can an AI gateway control AI costs?

Yes. Each key can have a hard spend cap and rate limit, so a runaway loop hits its ceiling instead of your invoice. Response caching serves identical calls without paying the model twice, and per-identity tracking makes chargeback and forecasting straightforward.

Secure AI Gateway

One gateway for every AI call your apps and agents make

Point any app or agent at a single OpenAI-compatible endpoint and get governance for free: virtual keys bound to identity, budgets that cannot be blown, caching that cuts spend, and a guardrail on every request.

Book a demo Free assessment

app.py

# point any app or agent at Unseen
from openai import OpenAI
 
client = OpenAI(
  base_url="https://api.unseen.ai/v1",  # one line
  api_key=UNSEEN_VIRTUAL_KEY,   # bound to identity
)
 
# every call after this: budgeted,
# cached, guarded, and logged.

An agent is shadow AI with credentials

It authenticates with a long-lived key, runs on a schedule nobody is watching, and chains actions across systems. Multiply that by every script, side project, and copilot your developers wired up, and you have a large, ungoverned surface making AI calls on the company's behalf.

The failure modes are boring and expensive: an API key hardcoded into a repo, a runaway loop that burns a month's budget overnight, a prompt-injection payload that reaches the model, and no record of any of it afterwards. None of it shows up in the tools built for human users. That is the gap the gateway closes.

Change one line. Govern everything after it.

Unseen sits in front of your models as one OpenAI-compatible endpoint. There is no SDK to adopt and no rewrite. Swap the base URL, swap the key, and every call inherits the controls.

Point to the gateway

Set your base URL to the Unseen endpoint. Your existing OpenAI-compatible SDK and request format stay exactly the same.

Use a virtual key

Replace the provider key with a virtual key bound to an app, team, or agent identity, with its own budget and limits.

Ship, with a record

Every call is now inspected, budgeted, cached, and logged. Security gets the trail; developers keep their velocity.

Governance on every request

The controls a human-facing tool cannot give you, applied automatically to programmatic traffic.

Virtual keys, bound to identity

Issue a key per app, team, or agent, each tied to a real identity. Revoke one in a click. No more shared secrets living in a dozen .env files.

Budgets and rate limits

Set a hard spend cap and request rate per key. An agent stuck in a loop hits its ceiling instead of your monthly invoice.

Response caching

Identical calls are served from cache, so you stop paying twice for the same answer. Lower token spend, lower latency.

Guardrails and full logging

Every request is checked for prompt injection and jailbreaks and written to a tamper-evident log. You can always show what a model was asked and what it returned.

OpenAI-compatible API

A drop-in replacement for the OpenAI API. Change the base URL, keep your code. Existing SDKs just work.

One endpoint, every model

Route to GPT, Claude, Gemini, Llama, Mistral, or a private deployment from the same endpoint. Swap models without shipping code.

Built-in DLP

Secrets, PII, and source code are detected and redacted before a prompt ever leaves your perimeter.

Webhooks and SIEM

Real-time events for policy violations, budget alerts, and usage anomalies, piped to Slack, your SIEM, or ticketing.

Built for developers, approved by security

Ship fast, without the security roadblock

Developers want to move. Security wants control. The gateway is how both win. Your team keeps the SDK and request format they already use; the only change is a base URL and a key.

Security gets a per-identity view of every call, a budget it can actually enforce, and an audit trail it can stand behind. Nobody files a ticket to use AI, and nobody loses sight of what is running.

The other two surfaces

Put a control point in front of your AI

Book a demo Free Shadow AI Assessment