Guaranteed 15% off your current AI inference bill for team spending up to $20000 / month.

ModelRegistry

Choose an open-source model and deploy it in seconds.

Models

DeepSeek V4 Pro

Full-scale MoE reasoning model with top-tier coding, math, and long-context performance across all major benchmarks.

LLMReasoningCode

Chat now

Kimi K2.6

Latest high-capacity reasoning model built for long context understanding, research workflows, and complex problem solving.

LLMTextLong Context

Chat now

GLM 5

744B parameter MoE model built for complex systems engineering, long-horizon agentic tasks, and advanced reasoning.

LLMReasoningCode

Chat now

DeepSeek V4 Flash

Efficient MoE model with 1M context and near state-of-the-art open-source reasoning performance.

LLMReasoningCode

Chat now

Minimax M2.5

Mixture-of-Experts model optimized for coding, agentic tool use, complex workflows, and office productivity tasks.

LLMCodeReasoning

Chat now

DeepSeek R1 0528

Reasoning focused language model specialized for analytical tasks, coding workflows, and complex multi-step reasoning.

LLMReasoningCode

Chat now

DeepSeek V3.2

Powerful general-purpose language model delivering strong reasoning, coding, and natural language performance.

LLMReasoningCode

Chat now

Kimi K2.5

High-capacity reasoning model built for long context understanding, research workflows, and multi-step problem solving.

LLMTextLong Context

Chat now

Kimi-K2-Thinking

Premium reasoning-optimized language model designed for deep thinking, complex problem solving, and advanced multi-step AI tasks.

LLMText

Chat now

GPT-OSS-120B

Premium open-source GPT-style large language model with top-tier reasoning, coding, and natural language capabilities for high-performance AI applications

LLMTextCode

Chat now

Llama-3.3-70B

Large-scale model designed for complex reasoning and production-grade workloads.

LLMText

Chat now

DeepSeek R1 70B

Flagship reasoning model offering superior accuracy and deep problem-solving capabilities for demanding AI workloads.

LLMReasoning

Chat now

DeepSeek V3 0324

Advanced language model optimized for complex reasoning, structured responses, and high-performance AI assistants.

LLMReasoningChat

Chat now

DeepSeek Coder - 33B

Specialized model for code generation, refactoring, and programming assistance.

LLMCode

Chat now

Qwen-3 32B

Large multilingual model built for strong reasoning and generation tasks.

LLMText

Chat now

Qwen-3 Coder 30B

Code-focused language model for software development and technical reasoning.

LLMCode

Chat now

Gemma-3-27B

Large language model focused on high-quality text generation and reasoning.

LLMText

Chat now

GPT-OSS-20B

Open-source GPT-style model supporting both natural language and coding tasks.

LLMTextCode

Chat now

Llama 4 Maverick 17B

Efficient large language model designed for strong reasoning, conversational AI, and scalable production deployments.

LLMReasoningChat

Chat now

Ministral 3 14B Instruct

Efficient instruction tuned language model designed for conversational AI, structured responses, and scalable production applications.

LLMReasoningChat

Chat now

Llama-3.1-8B

Strong open-source model with excellent instruction following and reasoning quality.

LLMText

Chat now

DeepSeek R1 8B

Efficient reasoning-focused language model optimized for fast, cost-effective problem solving and structured AI tasks.

LLMReasoning

Chat now

Qwen 2.5-7B

Multilingual language model optimized for general reasoning and conversational tasks.

LLMText

Chat now

Mistral-7B

Fast, efficient general-purpose language model for chat, summarization, and reasoning.

LLMText

Chat now

Gemma-3-4B

Compact model designed for efficient inference with solid generation quality.

LLMText

Chat now

Llama-3.2-3B

Lightweight model optimized for low-latency and cost-efficient workloads.

LLMText

Chat now

Oxlo Image Pro

Premium image generation model delivering exceptional visual quality, precise prompt adherence, and reliable production-grade performance

Image

Generate

Flux.1 Schnell

High-speed diffusion model optimized for rapid image generation.

Image

Generate

Stable Diffusion v1.5

Widely used text-to-image model for fast and flexible image generation.

Image

Generate

YOLOv11

Latest YOLO model offering improved object detection accuracy and performance.

Computer Vision

Connect

YOLOv9

Real-time object detection model for images and video streams.

Computer Vision

Connect

Whisper Large v3

State-of-the-art multilingual speech-to-text model with improved accuracy, robustness, and performance for production transcription.

AudioSpeech to Text

Generate

Whisper Large

High-accuracy transcription model suitable for production use cases.

AudioSpeech to Text

Generate

Whisper-Medium

Reliable speech recognition model for transcription and audio analysis.

AudioSpeech to Text

Generate

BGE-Large

High-quality embedding model optimized for semantic search and RAG pipelines.

Embeddings

Deploy

E5-Large

Embedding model optimized for similarity search and information retrieval.

Embeddings

Deploy

Kokoro-82M

Lightweight speech synthesis model for generating natural-sounding audio.

AudioText to Speech

Generate

Oxlo Image Ultra

Flagship image generation model optimized for ultra-realistic visuals and delivering exceptional photorealism.

Image

Coming Soon

Stable Diffusion 3.5 Large

High-resolution image generation model focused on output realism and detail.

Image

Coming Soon

Oxlo Coder Fast

High-speed code generation model designed for rapid completions, responsive coding workflows, and efficient developer productivity

LLMCode

Coming Soon

Ox Assistant

Online