Simple, predictable pricing

Request-based plans designed for developers and small teams. Start free, scale when you need to, and never worry about token calculations.

Free

For developers exploring Oxlo and testing ideas.

$0/month
Limit:
  • 60 requests / day
  • Requests may be queued behind paid plans
Get started for free

What you get:
  • Access to smaller OSS models
  • Clear usage limits
  • No credit card required
Hot

Pro

(Early bull discount)

For builders running early apps and prototypes.

$14.9 $35/month
Limit:
  • 300 requests / day
  • Optimized models under 8B parameters
Subscribe Now Get 15 days extra for free

Billed today. Includes 15 extra free days added to your first billing cycle.

Everything in Free, plus
  • Faster request handling
  • Access to optimized inference models
  • Higher throughput for development workloads

Premium

For teams running production workloads.

$49.9 $80/month
Limit:
  • 2,000 requests / day
  • Production-grade performance
Subscribe Now Get 30 days extra for free

Billed today. Includes extra 30 days free on your first billing cycle.

Everything in Pro, plus
  • Priority execution
  • Higher and consistent throughput
  • Designed for production environments

Enterprise

For high-volume or custom requirements.

Custom pricing
Limit:
  • Custom or unlimited request volumes
  • Dedicated capacity options
Email Us!

Everything in Premium, plus
  • Custom usage limits
  • Dedicated support
  • Tailored deployment options

Compare the plans

All plans use request-based pricing. No token calculations.

Usage & Limits
Requests included100 / day300 / dayHigh request limitsCustom
Burst rate limit5 / minute30 / min120 / min (tunable)Custom
Monthly request capYesYesNo small daily capCustom
Queued behind paid trafficYesYes (Behind Premium)NoNo (Dedicated GPUs)
Models & Performance
Optimized models over 8BNoLimitedYesYes
Production-grade inferenceNoNoYesYes
Priority executionLowestMediumHighestOptional
Average Response Latency≤ 7 seconds≤ 1 second≤ 100 ms- tunable
Request & Context Limits

(Caps are for safety and performance, not billing)

Input tokens / requestUp to 2kUp to 4k8k-16kCustom
Output tokens / requestUp to 512Up to 1kUp to 4kCustom
Pricing & Billing
Request-based pricingYesYesYesYes
Token-based billingNoNoNoNo
Fixed monthly limitsYesYesYesCustom
Usage limits visible upfrontYesYesYesYes
Developer Experience
Open-source modelsYesYesYesYes
Simple API integrationYesYesYesYes
Model-agnostic pricingYesYesYesYes
Support levelCommunityCommunityPriorityDedicated
Infrastructure & Technical Differentiation
Gateway-level request meteringYesYesYesYes
Pricing independent of prompt lengthYesYesYesYes
Traffic prioritization by planNoYesYesYes
Async and batch-friendly workloadsYesYesYesYes
Based on current platform design and publicly available competitor offerings. Features may evolve over time.