Guaranteed 15% off your current AI inference bill for team spending up to $20000 / month.
Book a call →Choose an open-source model and deploy it in seconds.

Full-scale MoE reasoning model with top-tier coding, math, and long-context performance across all major benchmarks.

Latest high-capacity reasoning model built for long context understanding, research workflows, and complex problem solving.

744B parameter MoE model built for complex systems engineering, long-horizon agentic tasks, and advanced reasoning.

Efficient MoE model with 1M context and near state-of-the-art open-source reasoning performance.

Mixture-of-Experts model optimized for coding, agentic tool use, complex workflows, and office productivity tasks.

Reasoning focused language model specialized for analytical tasks, coding workflows, and complex multi-step reasoning.

Powerful general-purpose language model delivering strong reasoning, coding, and natural language performance.

High-capacity reasoning model built for long context understanding, research workflows, and multi-step problem solving.

Premium reasoning-optimized language model designed for deep thinking, complex problem solving, and advanced multi-step AI tasks.

Premium open-source GPT-style large language model with top-tier reasoning, coding, and natural language capabilities for high-performance AI applications

Large-scale model designed for complex reasoning and production-grade workloads.

Flagship reasoning model offering superior accuracy and deep problem-solving capabilities for demanding AI workloads.

Advanced language model optimized for complex reasoning, structured responses, and high-performance AI assistants.

Specialized model for code generation, refactoring, and programming assistance.

Large multilingual model built for strong reasoning and generation tasks.

Code-focused language model for software development and technical reasoning.

Large language model focused on high-quality text generation and reasoning.

Open-source GPT-style model supporting both natural language and coding tasks.

Efficient large language model designed for strong reasoning, conversational AI, and scalable production deployments.

Efficient instruction tuned language model designed for conversational AI, structured responses, and scalable production applications.

Strong open-source model with excellent instruction following and reasoning quality.

Efficient reasoning-focused language model optimized for fast, cost-effective problem solving and structured AI tasks.

Multilingual language model optimized for general reasoning and conversational tasks.

Fast, efficient general-purpose language model for chat, summarization, and reasoning.

Compact model designed for efficient inference with solid generation quality.

Lightweight model optimized for low-latency and cost-efficient workloads.

Premium image generation model delivering exceptional visual quality, precise prompt adherence, and reliable production-grade performance

High-speed diffusion model optimized for rapid image generation.

Widely used text-to-image model for fast and flexible image generation.

Latest YOLO model offering improved object detection accuracy and performance.

Real-time object detection model for images and video streams.

State-of-the-art multilingual speech-to-text model with improved accuracy, robustness, and performance for production transcription.

High-accuracy transcription model suitable for production use cases.

Reliable speech recognition model for transcription and audio analysis.

High-quality embedding model optimized for semantic search and RAG pipelines.

Embedding model optimized for similarity search and information retrieval.

Lightweight speech synthesis model for generating natural-sounding audio.

Flagship image generation model optimized for ultra-realistic visuals and delivering exceptional photorealism.

High-resolution image generation model focused on output realism and detail.

High-speed code generation model designed for rapid completions, responsive coding workflows, and efficient developer productivity
Hi there! Try our cost calculator to see what you'd save with Oxlo.ai.