OxiGateOxiGate

Zero-overhead LLM Gateway.
FinOps Infrastructure for AI.

Engineered in Rust. Hard and soft budget caps. Per-identity, team, and tag spend tracking. Fallbacks and load balancing across 100+ models. Extensible via Rust and Python plugins. All at sub-100µs overhead — because governance shouldn't slow you down.

View on GitHub

Enterprise-Grade FinOps Infrastructure

Everything you need to manage LLM costs and reliability at scale.

Built in Rust

Sub-100µs overhead, memory-safe, no GC pauses. Performance that never sleeps.

Financial Correctness

Integer nano-USD accounting, zero drift over billions of requests. Every cent tracked.

Budget & Rate Limits

Per-user, per-team, per-tag enforcement with soft warnings and hard caps.

OpenAI Compatible

Drop-in replacement supporting OpenAI, Anthropic, Gemini, AWS Bedrock.

Smart Fallbacks

Automatic retry, weighted load balancing, provider failover. Zero downtime.

Deep Observability

Structured JSON logs, Prometheus metrics, OpenTelemetry export. See everything.

Powerful Plugin System

Extend with Rust or Python plugins. Add custom providers, middleware, or FinOps logic without forking.

Built in Rust for speed. Built for control.

How It Works

Simple integration, powerful results. One gateway to rule all your LLM providers.

Your App
OxiGate
Rust
OpenAI
Anthropic
Gemini
Open AI Compatible