Developer Edition

Add AI to Your App. No Backend Required.

Last updated: April 27, 2026

Behest is the AI backend you don't have to build. CORS handled, auth included, PII scrubbed, prompts defended — call from your frontend, ship in minutes.

Add AI to Your App is Behest's developer surface: an OpenAI-compatible /v1/chat/completions endpoint you call from the browser with CORS allowlists, JWT auth, Presidio PII redaction, Sentinel prompt-injection checks, Redis conversation memory, per-session cost capture, and token budgets — no bespoke Node proxy required.

Start the 5-minute Quickstart View on GitHub

✓ Includes employee token usage tracking

Everything between your app and your LLM

One API call. Behest handles the rest — so you focus on the features your users see.

CORS handled

Configure your origins. Call the API from the browser.

Auth included

Multi-tenant JWT + per-project API keys, RBAC built in.

Conversation Memory

Persistent context across sessions — Redis sorted sets, per user.

PII Shield

Microsoft Presidio scrubs sensitive data before the LLM sees it.

Sentinel

Prompt-injection defense in-line on every request.

Token FinOps

Includes employee token usage tracking and cost attribution out of the box.

Build here. Scale with FinOps and Governance.

The same gateway developers use on this page powers enterprise programs: finance-grade attribution and caps on Token FinOps, plus policy and audit controls on AI Governance.

Cost & allocation

Token FinOps

Per-session spend visibility, cost-center rollups, and budgets that enforce before the invoice — for CFO and platform teams.

Risk & compliance

AI Governance

Model allowlists, EU AI Act–aligned contracting hooks, NIST AI RMF mapping, PII Shield, Sentinel, and audit trails.

Built-in Token FinOps

Enforce hard budgets, attribute costs per session, and get complete visibility into your enterprise AI spend.

Budget Limits

Cost Attribution

Learn more

Call from the browser. Ship today.

No backend proxy. Same OpenAI-compatible request shape your code already speaks.

your-app.tsx — Add AI to your existing app

// No backend proxy needed — CORS handled
// Call Behest directly from your browser

const response = await fetch(
  "https://your-project.behest.app/v1/chat/completions",
  {
    method: "POST",
    headers: {
      "Authorization": `Bearer ${userToken}`,
      "Content-Type": "application/json",
      "X-End-User-Id": userId,
      "X-Session-Id": sessionId,
    },
    body: JSON.stringify({
      model: "gemini-2.5-flash",
      messages: [{ role: "user", content: message }],
    }),
  }
);

// Conversation Memory persists across sessions
// PII scrubbed before reaching the LLM
// Per-user rate limits + token budgets enforced
// Per-session cost attribution captured automatically

The AI Backend You Don't Have to Build

One API Call. Everything Handled.

Stop building auth, rate limiting, CORS proxies, and memory management. Behest deploys the complete AI backend in your cloud. You write the frontend.

Zero AI backend code to write or maintain
Call directly from your browser — CORS handled
PII scrubbed and prompts defended automatically
Self-hosted in your cloud — your data stays yours

CORS-Ready

Call from your browser — no backend proxy needed

Auth Built In

Multi-tenant JWT + API keys with tenant isolation

Memory Included

Persistent conversation context across sessions

PII Protected

Automatic detection and scrubbing before it reaches the LLM

Prompts Defended

Sentinel blocks injection attacks with pattern detection

Rate Limited

Three-tier limits per IP, project, and user

Fully Observable

Traces, metrics, logs — correlated in Grafana

Self-Hosted

Deploy in your cloud — data never leaves your infra

Zero to first API call in under 5 minutes.

Free Developer Edition. No credit card. No SDK required — standard fetch from any frontend.

Start the 5-minute Quickstart Read the docs

✓ Includes employee token usage tracking