Zylon API Gateway

The governed extensibility layer for private enterprise AI

Zylon's AI core is the self-contained infrastructure that enables private AI for regulated industries. Local LLMs, GPU orchestration, document processing, agentic retrieval, all running on your infrastructure with zero external dependencies.

Zylon's AI core is the self-contained infrastructure that enables private AI for regulated industries. Local LLMs, GPU orchestration, document processing, agentic retrieval, all running on your infrastructure with zero external dependencies.

Zylon's AI core is the self-contained infrastructure that enables private AI for regulated industries. Local LLMs, GPU orchestration, document processing, agentic retrieval, all running on your infrastructure with zero external dependencies.

Developers need flexibility. Security teams need control. Zylon’s API Gateway delivers both: secure, standards-compatible endpoints for private enterprise AI in regulated industries, with token-scoped governance, enforced model and data controls, and compliance-ready audit trails built into every request.
Developers need flexibility. Security teams need control. Zylon’s API Gateway delivers both: secure, standards-compatible endpoints for private enterprise AI in regulated industries, with token-scoped governance, enforced model and data controls, and compliance-ready audit trails built into every request.
Developers need flexibility. Security teams need control. Zylon’s API Gateway delivers both: secure, standards-compatible endpoints for private enterprise AI in regulated industries, with token-scoped governance, enforced model and data controls, and compliance-ready audit trails built into every request.

API GATEWAY

API GATEWAY

API GATEWAY

What the API Gateway Enables

Secure API infrastructure for private, on-premise enterprise AI

Zylon is an enterprise AI platform delivering private generative AI and on-premise AI software for regulated industries, enabling secure deployment inside enterprise infrastructure without external cloud dependencies. This is everything that's inside of Zylon's API Gateway:

Custom AI Agents

Build domain-specific AI agents powered by your private LLMs, internal knowledge bases, and governed tools. Deploy production-grade agents that operate fully on-premise with controlled model access, secure data retrieval, and multi-step reasoning. Designed for regulated industries where accuracy, attribution, and auditability are mandatory, not optional. Compliant with SOC 2, GLBA, FINRA, and NCUA requirements.





Workflow Automation

Trigger AI from internal systems and run multi-step automation with enforced policies, async processing, and full observability — securely inside your infrastructure.

Internal Integrations

Connect CRMs, databases, and enterprise systems without moving data to the cloud. Governed access ensures AI operates securely within your existing infrastructure.

Governed API Access

Every request inherits authentication, authorization, model access controls, guardrails, rate limits, and full audit logging. Token-scoped permissions ensure developers can build freely within defined boundaries, while security teams maintain complete visibility and compliance alignment across all AI activity.

AI GOVERNANCE

AI GOVERNANCE

AI GOVERNANCE

Enterprise AI Governance, Enforced Automatically

Unlike direct integrations with cloud APIs like OpenAI or local inference tools like Ollama, every request through Zylon’s API Gateway is policy-controlled.

  • Authentication & role-based authorization

  • Model access controls

  • Guardrails and input/output inspection

  • Rate limiting and usage controls

  • Knowledge base access restrictions

  • Full audit logging with attribution

Result: Developers move fast. Security and compliance move with them.

HOW IT WORKS

HOW IT WORKS

HOW IT WORKS

How Token-Scoped Access Works

Private API Gateways for Private AI. Secure by default. No exceptions layer required.

Step 1: Admin Configures a Gateway

Define: allowed models, accessible knowledge bases, guardrails and rate limits

Step 1: Admin Configures a Gateway

Define: allowed models, accessible knowledge bases, guardrails and rate limits

Step 1: Admin Configures a Gateway

Define: allowed models, accessible knowledge bases, guardrails and rate limits

Step 2: Developer Generates a Token

Each token inherits the gateway’s permissions.

Step 2: Developer Generates a Token

Each token inherits the gateway’s permissions.

Step 2: Developer Generates a Token

Each token inherits the gateway’s permissions.

Step 3: Developer Builds

Agents and applications operate inside defined boundaries.

Step 3: Developer Builds

Agents and applications operate inside defined boundaries.

Step 3: Developer Builds

Agents and applications operate inside defined boundaries.

Step 4: Everything Is Logged

Requests, tools, data access, and usage metrics are recorded automatically.

Step 4: Everything Is Logged

Requests, tools, data access, and usage metrics are recorded automatically.

Step 4: Everything Is Logged

Requests, tools, data access, and usage metrics are recorded automatically.

API LEVELS

API LEVELS

API LEVELS

Two API Levels

Two layers of governed API access for private, on-premise enterprise AI — from low-level AI capabilities to high-level workspace and administrative automation.

Low-Level API

High-Level API

Low-Level API

High-Level API

Low-Level API

Direct AI capabilities. OpenAI- and Anthropic-compatible endpoints. Includes:

  • LLM inference (chat, streaming)

  • RAG operations (search + generation)

  • Embeddings

  • Data ingestion

  • Tool use & agent orchestration

  • MCP integrations

Use this to build:

  • Custom AI agents

  • AI-powered internal apps

  • Intelligent automation

  • Developer integrations

High-Level API

Workspace & administrative automation

Includes:

  • Organization and team management

  • Project automation

  • Knowledge base configuration

  • User permissions

  • Usage analytics

Use this to:

  • Automate onboarding

  • Build internal dashboards

  • Streamline administrative operations

FROM CLOUD AI TO ON-PREMISE AI

FROM CLOUD AI TO ON-PREMISE AI

FROM CLOUD AI TO ON-PREMISE AI

OpenAI & Anthropic Compatible

Zylon follows API standards used by OpenAI and Anthropic

Switching from cloud to on-premise requires minimal code changes.


Change base URL + API key → your tools continue working.


What this means:

  • Existing tools and integrations work immediately

  • Developers use familiar API patterns

  • No learning curve for teams already using cloud AI

  • Easy migration from cloud to on-premise

Example: Change the base URL and API key, and tools like Continue, Cursor, LangChain, and custom applications work with Zylon.


SUPPORTED AI TOOLS

SUPPORTED AI TOOLS

SUPPORTED AI TOOLS

Built-In AI Tools

Zylon supports the leading open LLM ecosystems out of the box.

Web Search

Real-time internet search for current information. Opt-in capability for non-air-gapped deployments.

CSV Numeric Analysis

Parse and analyze structured data. Generate insights from spreadsheets and tabular data automatically.

Chart Generation

Create visualizations from data. Automatically generate charts and graphs in responses.

Database Querying

Semantic query-to-SQL translation. Ask questions in natural language, execute SQL queries, and get results, all through the API.

Bring Your Own Tools

Extend Zylon with custom tools via MCP. Build specialized capabilities for your domain and make them available to all Zylon users.

COMPARISON

COMPARISON

COMPARISON

Unlimited usage. No Per-Token Pricing

You optimize for capability, not token consumption.

Unlike cloud AI, where costs rise with reasoning depth and innovation is constrained by token pricing, Zylon enables complex, multi-step agents with predictable infrastructure-based costs, so teams optimize for capability, not token consumption.

ChatGPT/Copilot

Zylon

Features

Optimize for capability (not cost)

Complex agents economically viable

Air-gapped deployment

Multi-step reasoning without cost anxiety

Full audit & governance

Limited

Predictable costs

Pay per token

Use any model

Only vendor provided

Connectors to on-prem data

ChatGPT/Copilot

Zylon

Features

Air-gapped deployment

Complex agents economically viable

Optimize for capability (not cost)

Multi-step reasoning without cost anxiety

Full audit & governance

Limited

Predictable costs

Pay per token

Use any model

lock in

Connectors to on-prem data

ChatGPT/Copilot

Zylon

Features

Optimize for capability (not cost)

Complex agents economically viable

Air-gapped deployment

Multi-step reasoning without cost anxiety

Full audit & governance

Limited

Predictable costs

Pay per token

Use any model

Only vendor provided

Connectors to on-prem data

AUTOMATIONS

AUTOMATIONS

AUTOMATIONS

n8n: Visual AI Automation

n8n turns Zylon into a private AI automation engine.

  • Visual no-code workflows

  • Hundreds of integrations

  • Self-hosted like Zylon

  • MCP tool creation

Example automations:

  • HR onboarding → Create project → Ingest docs → Assign access

  • Contract analysis → Extract clauses → Flag anomalies → Route for review

  • Support tickets → Generate response → Human review → Send

AI GOVERNANCE

AI GOVERNANCE

AI GOVERNANCE

Monitoring & Governance

Enterprise-grade monitoring, audit logging, and policy enforcement for private, on-premise AI deployments in regulated industries.

Built-In Observability

Track:

  • API volume and latency

  • Model usage distribution

  • Tool invocation metrics

  • Error rates

  • Gateway-level analytics

Sys Admin Dashboard

Centralized web interface for:

  • Gateway configuration

  • Token creation & revocation

  • Policy enforcement

  • Usage tracking

  • Audit export

Complete Audit Trails

Every API request captures:

  • Request & response content

  • User attribution

  • Model used

  • Tools invoked

  • Data accessed

  • Timestamps & latency

Compliance-ready logging for regulated industries.

SECURITY AND CONTROLS

SECURITY AND CONTROLS

SECURITY AND CONTROLS

Enterprise Security Controls

Private API Gateways for Private AI. Secure by default. No exceptions layer required.

Authentication & Authorization

Token-based access with granular permission scoping. Each token inherits the governance policies of its gateway.

Authentication & Authorization

Token-based access with granular permission scoping. Each token inherits the governance policies of its gateway.

Authentication & Authorization

Token-based access with granular permission scoping. Each token inherits the governance policies of its gateway.

Rate Limiting

Prevent runaway usage with configurable limits per token, user, or team.

Rate Limiting

Prevent runaway usage with configurable limits per token, user, or team.

Rate Limiting

Prevent runaway usage with configurable limits per token, user, or team.

Data Access Controls

Restrict which knowledge bases and data sources can be queried through specific gateways.

Data Access Controls

Restrict which knowledge bases and data sources can be queried through specific gateways.

Data Access Controls

Restrict which knowledge bases and data sources can be queried through specific gateways.

Air-Gap Compatible

API Gateway works in fully disconnected environments. Tools requiring external access (like web search) can be disabled for air-gapped deployments.

Air-Gap Compatible

API Gateway works in fully disconnected environments. Tools requiring external access (like web search) can be disabled for air-gapped deployments.

Air-Gap Compatible

API Gateway works in fully disconnected environments. Tools requiring external access (like web search) can be disabled for air-gapped deployments.

INTEGRATIONS ECOSYSTEM

INTEGRATIONS ECOSYSTEM

INTEGRATIONS ECOSYSTEM

Connect to Your Existing Systems

Integrate directly with:

  • File systems (S3, NFS, SMB)

  • SharePoint, Confluence, Google Drive

  • PostgreSQL, MySQL, SQL Server

  • Salesforce, SAP, Microsoft Dynamics

  • Industry systems (banking cores, healthcare EMRs, ERP systems)

Data stays in place. No cloud copies. No ETL duplication.