NEW

Zylon in a Box: Plug & Play Private AI. Get a pre-configured on-prem server ready to run locally, with zero cloud dependency.

Learn More ->

Zylon API Gateway

The governed extensibility layer for private enterprise AI

Zylon's AI core is the self-contained infrastructure that enables private AI for regulated industries. Local LLMs, GPU orchestration, document processing, agentic retrieval, all running on your infrastructure with zero external dependencies.

Get a Private Walkthrough

Developers need flexibility. Security teams need control. Zylon’s API Gateway delivers both: secure, standards-compatible endpoints for private enterprise AI in regulated industries, with token-scoped governance, enforced model and data controls, and compliance-ready audit trails built into every request.

API GATEWAY

What the API Gateway Enables

Secure API infrastructure for private, on-premise enterprise AI

Zylon is an enterprise AI platform delivering private generative AI and on-premise AI software for regulated industries, enabling secure deployment inside enterprise infrastructure without external cloud dependencies. This is everything that's inside of Zylon's API Gateway:

Custom AI Agents

Build domain-specific AI agents powered by your private LLMs, internal knowledge bases, and governed tools. Deploy production-grade agents that operate fully on-premise with controlled model access, secure data retrieval, and multi-step reasoning. Designed for regulated industries where accuracy, attribution, and auditability are mandatory, not optional. Compliant with SOC 2, GLBA, FINRA, and NCUA requirements.

Workflow Automation

Trigger AI from internal systems and run multi-step automation with enforced policies, async processing, and full observability — securely inside your infrastructure.

Internal Integrations

Connect CRMs, databases, and enterprise systems without moving data to the cloud. Governed access ensures AI operates securely within your existing infrastructure.

Governed API Access

Every request inherits authentication, authorization, model access controls, guardrails, rate limits, and full audit logging. Token-scoped permissions ensure developers can build freely within defined boundaries, while security teams maintain complete visibility and compliance alignment across all AI activity.

AI GOVERNANCE

Enterprise AI Governance, Enforced Automatically

Unlike direct integrations with cloud APIs like OpenAI or local inference tools like Ollama, every request through Zylon’s API Gateway is policy-controlled.

Authentication & role-based authorization
Model access controls
Guardrails and input/output inspection
Rate limiting and usage controls
Knowledge base access restrictions
Full audit logging with attribution

Result: Developers move fast. Security and compliance move with them.

HOW IT WORKS

How Token-Scoped Access Works

Private API Gateways for Private AI. Secure by default. No exceptions layer required.

Step 1: Admin Configures a Gateway

Define: allowed models, accessible knowledge bases, guardrails and rate limits

Step 1: Admin Configures a Gateway

Define: allowed models, accessible knowledge bases, guardrails and rate limits

Step 2: Developer Generates a Token

Each token inherits the gateway’s permissions.

Step 2: Developer Generates a Token

Each token inherits the gateway’s permissions.

Step 3: Developer Builds

Agents and applications operate inside defined boundaries.

Step 3: Developer Builds

Agents and applications operate inside defined boundaries.

Step 4: Everything Is Logged

Requests, tools, data access, and usage metrics are recorded automatically.

Step 4: Everything Is Logged

Requests, tools, data access, and usage metrics are recorded automatically.

API LEVELS

Two API Levels

Two layers of governed API access for private, on-premise enterprise AI — from low-level AI capabilities to high-level workspace and administrative automation.

Low-Level API

High-Level API

Low-Level API

Direct AI capabilities. OpenAI- and Anthropic-compatible endpoints. Includes:

LLM inference (chat, streaming)
RAG operations (search + generation)
Embeddings
Data ingestion
Tool use & agent orchestration
MCP integrations

Use this to build:

Custom AI agents
AI-powered internal apps
Intelligent automation
Developer integrations

High-Level API

Workspace & administrative automation

Includes:

Organization and team management
Project automation
Knowledge base configuration
User permissions
Usage analytics

Use this to:

Automate onboarding
Build internal dashboards
Streamline administrative operations

FROM CLOUD AI TO ON-PREMISE AI

OpenAI & Anthropic Compatible

Zylon follows API standards used by OpenAI and Anthropic

Switching from cloud to on-premise requires minimal code changes.

Change base URL + API key → your tools continue working.

What this means:

Existing tools and integrations work immediately
Developers use familiar API patterns
No learning curve for teams already using cloud AI
Easy migration from cloud to on-premise

Example: Change the base URL and API key, and tools like Continue, Cursor, LangChain, and custom applications work with Zylon.

SUPPORTED AI TOOLS

Built-In AI Tools

Zylon supports the leading open LLM ecosystems out of the box.

Web Search

Real-time internet search for current information. Opt-in capability for non-air-gapped deployments.

CSV Numeric Analysis

Parse and analyze structured data. Generate insights from spreadsheets and tabular data automatically.

Chart Generation

Create visualizations from data. Automatically generate charts and graphs in responses.

Database Querying

Semantic query-to-SQL translation. Ask questions in natural language, execute SQL queries, and get results, all through the API.

Bring Your Own Tools

Extend Zylon with custom tools via MCP. Build specialized capabilities for your domain and make them available to all Zylon users.

COMPARISON

Unlimited usage. No Per-Token Pricing

You optimize for capability, not token consumption.

Unlike cloud AI, where costs rise with reasoning depth and innovation is constrained by token pricing, Zylon enables complex, multi-step agents with predictable infrastructure-based costs, so teams optimize for capability, not token consumption.

ChatGPT/Copilot

Zylon

Features

Optimize for capability (not cost)

Complex agents economically viable

Air-gapped deployment

Multi-step reasoning without cost anxiety

Full audit & governance

Limited

Predictable costs

Pay per token

Use any model

Only vendor provided

Connectors to on-prem data

ChatGPT/Copilot

Zylon

Features

Air-gapped deployment

Complex agents economically viable

Optimize for capability (not cost)

Multi-step reasoning without cost anxiety

Full audit & governance

Limited

Predictable costs

Pay per token

Use any model

lock in

Connectors to on-prem data

AUTOMATIONS

n8n: Visual AI Automation

n8n turns Zylon into a private AI automation engine.

Visual no-code workflows
Hundreds of integrations
Self-hosted like Zylon
MCP tool creation

Example automations:

HR onboarding → Create project → Ingest docs → Assign access
Contract analysis → Extract clauses → Flag anomalies → Route for review
Support tickets → Generate response → Human review → Send

AI GOVERNANCE

Monitoring & Governance

Enterprise-grade monitoring, audit logging, and policy enforcement for private, on-premise AI deployments in regulated industries.

Built-In Observability

Track:

API volume and latency
Model usage distribution
Tool invocation metrics
Error rates
Gateway-level analytics

Sys Admin Dashboard

Centralized web interface for:

Gateway configuration
Token creation & revocation
Policy enforcement
Usage tracking
Audit export

Complete Audit Trails

Every API request captures:

Request & response content
User attribution
Model used
Tools invoked
Data accessed
Timestamps & latency

Compliance-ready logging for regulated industries.

SECURITY AND CONTROLS

Enterprise Security Controls

Private API Gateways for Private AI. Secure by default. No exceptions layer required.

Authentication & Authorization

Token-based access with granular permission scoping. Each token inherits the governance policies of its gateway.

Authentication & Authorization

Token-based access with granular permission scoping. Each token inherits the governance policies of its gateway.

Rate Limiting

Prevent runaway usage with configurable limits per token, user, or team.

Rate Limiting

Prevent runaway usage with configurable limits per token, user, or team.

Data Access Controls

Restrict which knowledge bases and data sources can be queried through specific gateways.

Data Access Controls

Restrict which knowledge bases and data sources can be queried through specific gateways.

Air-Gap Compatible

API Gateway works in fully disconnected environments. Tools requiring external access (like web search) can be disabled for air-gapped deployments.

Air-Gap Compatible

API Gateway works in fully disconnected environments. Tools requiring external access (like web search) can be disabled for air-gapped deployments.

INTEGRATIONS ECOSYSTEM

Connect to Your Existing Systems

Integrate directly with:

File systems (S3, NFS, SMB)
SharePoint, Confluence, Google Drive
PostgreSQL, MySQL, SQL Server
Salesforce, SAP, Microsoft Dynamics
Industry systems (banking cores, healthcare EMRs, ERP systems)

Data stays in place. No cloud copies. No ETL duplication.