Using Cerebras With Roo Code
Cerebras AI specializes in extremely fast inference speeds (up to 2600 tokens/second) with competitive pricing, including a free tier. Their models are optimized for coding, general intelligence, and reasoning tasks.
Website: https://cloud.cerebras.ai/
Getting an API Key
- Sign Up/Sign In: Go to Cerebras Cloud. Create an account or sign in.
- Navigate to API Keys: Access the API keys section in your dashboard.
- Create a Key: Generate a new API key. Give it a descriptive name (e.g., "Roo Code").
- Copy the Key: Important: Copy the API key immediately. Store it securely.
Supported Models
Roo Code supports the following Cerebras models:
qwen-3-coder-480b-free
(Default)qwen-3-coder-480b
qwen-3-235b-a22b-instruct-2507
llama-3.3-70b
qwen-3-32b
qwen-3-235b-a22b-thinking-2507
Configuration in Roo Code
- Open Roo Code Settings: Click the gear icon () in the Roo Code panel.
- Select Provider: Choose "Cerebras" from the "API Provider" dropdown.
- Enter API Key: Paste your Cerebras API key into the "Cerebras API Key" field.
- Select Model: Choose your desired model from the "Model" dropdown.
Tips and Notes
- Performance: Cerebras specializes in extremely fast inference speeds, making it ideal for real-time coding assistance.
- Free Tier: The
qwen-3-coder-480b-free
model provides access to high-performance inference at no cost with rate limits. - Context Windows: Models support context windows ranging from 64K to 128K tokens.
- Pricing: Refer to the Cerebras Cloud dashboard for the latest pricing information.