Skip to content
DeepTokenInference Gateway
HomeDashboardModelsLeaderboardDocsPricingEnterpriseBlog
    Billing & Quota

    Usage-based AI gateway pricing

    Pay for model usage with prepaid balance. Subscribe only when you need team controls, routing governance, observability, and enterprise procurement.

    Pay-as-you-goModels
    Usage

    Model calls are metered by model-specific pricing and deducted from balance.

    Balance

    Add funds manually or enable auto recharge before production traffic spikes.

    Pay-as-you-goModelsFAQ
    Pay-as-you-go balance

    Add funds for metered model usage

    Recharge balance for API calls across routed providers. Small top-ups stay simple; production spend can move into invoices, commitments, and volume terms.

    Custom recharge rules: $10+ self-serve minimum, up to $5,000 before enterprise review.

    Metered usage
    Every request is deducted by model, provider, and usage unit.
    Auto recharge
    Set threshold and refill amount before traffic spikes.
    Budget controls
    Control spend by org, project, member, and API key.
    Starter

    API testing and small experiments.

    $10
    prepaid balance
    Builder

    Individual development and early prototypes.

    $50
    prepaid balance
    Default
    Scale

    Early production traffic with predictable balance.

    $200
    prepaid balance
    Growth

    Team production workloads and finance review.

    $1,000
    prepaid balance
    Business

    Invoice, discount, and customer-success path.

    $5,000+
    prepaid balance
    Commit

    Annual commitment, SLA, and dedicated capacity.

    Custom
    prepaid balance

    Everything included in your account

    All platform features come standard — no subscription fees. Just add quota and go.

    Unified Inference API

    One OpenAI-compatible endpoint for every model and provider.

    Routing & Fallback

    Multi-provider routing with health-aware automatic fallback.

    Usage & Billing

    Token-accurate metering with clean cost attribution.

    Budget Controls

    Hard caps and soft alerts by key, member, and project.

    Team & Organizations

    Shared balance pools, member roles, and org policies.

    Webhooks & Automation

    Order, subscription, and payout events with HMAC signing.

    Audit Logs

    Per-request logs with append-only audit trail.

    Support

    Email support included. Priority and SLA for committed spend.

    Start building
    DeepToken Enterprise

    Need a single contract for every model provider?

    Consolidate vendors, routing, billing, and compliance into one governed gateway. Purpose-built for platform, security, and procurement teams.

    SSO & SCIM
    99.99% SLA
    SOC 2 / GDPR
    Unified invoicing
    Explore EnterpriseTalk to sales
    Compliance-ready

    Zero-retention routing with audit-ready logs.

    Massive scale

    Billions of requests and trillions of tokens weekly.

    Vendor consolidation

    One agreement, one key, one invoice for 40+ providers.

    Policy routing

    Failover, guardrails, and region pinning as code.

    FAQ

    Billing questions

    Answers about balance, model usage, auto recharge, budgets, and enterprise commitments.