Skip to main content
Serverless RL currently supports the following foundation models for training. To express interest in a particular model, contact support.

Model catalog

ModelModel ID (for API usage)TypeContext WindowParametersDescription
OpenPipe Qwen3 14B InstructOpenPipe/Qwen3-14B-InstructText32.8K14.8B (Total)An efficient multilingual, dense, instruction-tuned model, optimized by OpenPipe for building agents with finetuning.
Qwen3 30B A3BQwen/Qwen3-30B-A3B-Instruct-2507Text262K3.3B-30.5B (Active-Total)Qwen3-30B-A3B-Instruct-2507 is a 30.5B MoE instruction-tuned model with enhanced reasoning, coding, and long-context understanding.
Qwen3.5 35B A3BQwen/Qwen3.5-35B-A3BText262K3B-36B (Active-Total)A multimodal MoE model with 256 experts (8 routed + 1 shared active) combining vision-language understanding, agentic tool use, and 201-language support.