Skip to main content

Using Cerebras With Roo Code

Cerebras AI specializes in extremely fast inference speeds (up to 2600 tokens/second) with competitive pricing, including a free tier. Their models are optimized for coding, general intelligence, and reasoning tasks.

Website: https://cloud.cerebras.ai/


Getting an API Key

  1. Sign Up/Sign In: Go to Cerebras Cloud. Create an account or sign in.
  2. Navigate to API Keys: Access the API keys section in your dashboard.
  3. Create a Key: Generate a new API key. Give it a descriptive name (e.g., "Roo Code").
  4. Copy the Key: Important: Copy the API key immediately. Store it securely.

Supported Models

Roo Code supports the following Cerebras models:

  • qwen-3-coder-480b-free (Default)
  • qwen-3-coder-480b
  • qwen-3-235b-a22b-instruct-2507
  • llama-3.3-70b
  • qwen-3-32b
  • qwen-3-235b-a22b-thinking-2507

Configuration in Roo Code

  1. Open Roo Code Settings: Click the gear icon () in the Roo Code panel.
  2. Select Provider: Choose "Cerebras" from the "API Provider" dropdown.
  3. Enter API Key: Paste your Cerebras API key into the "Cerebras API Key" field.
  4. Select Model: Choose your desired model from the "Model" dropdown.

Tips and Notes

  • Performance: Cerebras specializes in extremely fast inference speeds, making it ideal for real-time coding assistance.
  • Free Tier: The qwen-3-coder-480b-free model provides access to high-performance inference at no cost with rate limits.
  • Context Windows: Models support context windows ranging from 64K to 128K tokens.
  • Pricing: Refer to the Cerebras Cloud dashboard for the latest pricing information.