Request-based plans designed for developers and small teams. Start free, scale when you need to, and never worry about token calculations.
For developers exploring Oxlo and testing ideas.
(Early bull discount)
For builders running early apps and prototypes.
Billed today. Includes 15 extra free days added to your first billing cycle.
For teams running production workloads.
Billed today. Includes extra 30 days free on your first billing cycle.
For high-volume or custom requirements.
All plans use request-based pricing. No token calculations.
| Usage & Limits | ||||
|---|---|---|---|---|
| Requests included | 100 / day | 300 / day | High request limits | Custom |
| Burst rate limit | 5 / minute | 30 / min | 120 / min (tunable) | Custom |
| Monthly request cap | Yes | Yes | No small daily cap | Custom |
| Queued behind paid traffic | Yes | Yes (Behind Premium) | No | No (Dedicated GPUs) |
| Models & Performance | ||||
| Optimized models over 8B | No | Limited | Yes | Yes |
| Production-grade inference | No | No | Yes | Yes |
| Priority execution | Lowest | Medium | Highest | Optional |
| Average Response Latency | ≤ 7 seconds | ≤ 1 second | ≤ 100 ms | - tunable |
| Request & Context Limits (Caps are for safety and performance, not billing) | ||||
| Input tokens / request | Up to 2k | Up to 4k | 8k-16k | Custom |
| Output tokens / request | Up to 512 | Up to 1k | Up to 4k | Custom |
| Pricing & Billing | ||||
| Request-based pricing | Yes | Yes | Yes | Yes |
| Token-based billing | No | No | No | No |
| Fixed monthly limits | Yes | Yes | Yes | Custom |
| Usage limits visible upfront | Yes | Yes | Yes | Yes |
| Developer Experience | ||||
| Open-source models | Yes | Yes | Yes | Yes |
| Simple API integration | Yes | Yes | Yes | Yes |
| Model-agnostic pricing | Yes | Yes | Yes | Yes |
| Support level | Community | Community | Priority | Dedicated |
| Infrastructure & Technical Differentiation | ||||
| Gateway-level request metering | Yes | Yes | Yes | Yes |
| Pricing independent of prompt length | Yes | Yes | Yes | Yes |
| Traffic prioritization by plan | No | Yes | Yes | Yes |
| Async and batch-friendly workloads | Yes | Yes | Yes | Yes |