Available models

Serverless RL currently supports the following foundation models for training. To express interest in a particular model, contact support.

Model catalog

Model	Model ID (for API usage)	Type	Context Window	Parameters	Description
OpenPipe Qwen3 14B Instruct	`OpenPipe/Qwen3-14B-Instruct`	Text	32.8K	14.8B (Total)	An efficient multilingual, dense, instruction-tuned model, optimized by OpenPipe for building agents with finetuning.
Qwen3 30B A3B	`Qwen/Qwen3-30B-A3B-Instruct-2507`	Text	262K	3.3B-30.5B (Active-Total)	Qwen3-30B-A3B-Instruct-2507 is a 30.5B MoE instruction-tuned model with enhanced reasoning, coding, and long-context understanding.
Qwen3.5 35B A3B	`Qwen/Qwen3.5-35B-A3B`	Text	262K	3B-36B (Active-Total)	A multimodal MoE model with 256 experts (8 routed + 1 shared active) combining vision-language understanding, agentic tool use, and 201-language support.

⌘I