Codebase Indexing

⚠️ Experimental Feature: This feature is under active development and may change significantly in future releases.

Codebase Indexing enables semantic code search across your entire project using AI embeddings. Instead of searching for exact text matches, it understands the meaning of your queries, helping Roo Code find relevant code even when you don't know specific function names or file locations.

What It Does

When enabled, the indexing system:

Parses your code using Tree-sitter to identify semantic blocks (functions, classes, methods)
Creates embeddings of each code block using AI models
Stores vectors in a Qdrant database for fast similarity search
Provides the codebase_search tool to Roo for intelligent code discovery

This enables natural language queries like "user authentication logic" or "database connection handling" to find relevant code across your entire project.

Key Benefits

Semantic Search: Find code by meaning, not just keywords
Enhanced AI Understanding: Roo can better comprehend and work with your codebase
Cross-Project Discovery: Search across all files, not just what's open
Pattern Recognition: Locate similar implementations and code patterns

Setup Requirements

Embedding Provider

Choose one of these options for generating embeddings:

OpenAI (Recommended)

Requires OpenAI API key
Supports all OpenAI embedding models
Default: text-embedding-3-small
Processes up to 100,000 tokens per batch

Ollama (Local)

Requires local Ollama installation
No API costs or internet dependency
Supports any Ollama-compatible embedding model
Requires Ollama base URL configuration

OpenAI Compatible

Works with any OpenAI-compatible API endpoint
Supports providers like Google Gemini, Azure OpenAI, and others
Requires base URL and API key configuration
Uses OpenAI-style request/response format

Setting Up Ollama for Embeddings

Install Ollama
- macOS: brew install ollama or download from ollama.com
- Linux: curl -fsSL https://ollama.com/install.sh | sh
- Windows: Download installer from ollama.com
Start Ollama Service
```
ollama serve
```
This starts Ollama on http://localhost:11434 (default port)
Install Embedding Model
```
ollama pull nomic-embed-text
```
This downloads the recommended embedding model (~274MB)
Verify Installation
```
ollama list
```
You should see nomic-embed-text in the list
Configure in Roo Code
- Set Ollama Base URL: http://localhost:11434
- Select Model: nomic-embed-text

Setting Up OpenAI Compatible Providers

OpenAI Compatible providers use the same API format as OpenAI but with different endpoints. This includes Google Gemini, Azure OpenAI, and other compatible services.

Example: Google Gemini

Get API Key
- Visit Google AI Studio
- Create or select a project
- Generate an API key
Configure in Roo Code
- Provider: Select "OpenAI Compatible"
- Base URL: https://generativelanguage.googleapis.com/v1beta/openai/
- API Key: Your Google AI Studio API key
- Model: text-embedding-004
- Embedding Dimension: 768

Other Compatible Providers:

Azure OpenAI: https://your-resource.openai.azure.com/openai/deployments/your-deployment/
Local OpenAI-compatible servers: Custom endpoints running compatible APIs

Vector Database

Qdrant is required for storing and searching embeddings:

Local: http://localhost:6333 (recommended for testing)
Cloud: Qdrant Cloud or self-hosted instance
Authentication: Optional API key for secured deployments

Setting Up Qdrant

Quick Local Setup

Using Docker:

docker run -p 6333:6333 qdrant/qdrant

Using Docker Compose:

version: '3.8'
services:
  qdrant:
    image: qdrant/qdrant
    ports:
      - "6333:6333"
    volumes:
      - qdrant_storage:/qdrant/storage
volumes:
  qdrant_storage:

Production Deployment

For team or production use:

Qdrant Cloud - Managed service
Self-hosted on AWS, GCP, or Azure
Local server with network access for team sharing

Configuration

Open Roo Code settings ( icon)
Navigate to Experimental section
Enable "Enable Codebase Indexing"
Configure your embedding provider:
- OpenAI: Enter API key and select model
- Ollama: Enter base URL and select model
- OpenAI Compatible: Enter base URL, API key, and select model
Set Qdrant URL and optional API key
Click Save to start initial indexing

Understanding Index Status

The interface shows real-time status with color indicators:

Standby (Gray): Not running, awaiting configuration
Indexing (Yellow): Currently processing files
Indexed (Green): Up-to-date and ready for searches
Error (Red): Failed state requiring attention

How Files Are Processed

Smart Code Parsing

Tree-sitter Integration: Uses AST parsing to identify semantic code blocks
Language Support: All languages supported by Tree-sitter
Fallback: Line-based chunking for unsupported file types
Block Sizing:
- Minimum: 100 characters
- Maximum: 1,000 characters
- Splits large functions intelligently

Automatic File Filtering

The indexer automatically excludes:

Binary files and images
Large files (>1MB)
Git repositories (.git folders)
Dependencies (node_modules, vendor, etc.)
Files matching .gitignore and .rooignore patterns

Incremental Updates

File Watching: Monitors workspace for changes
Smart Updates: Only reprocesses modified files
Hash-based Caching: Avoids reprocessing unchanged content
Branch Switching: Automatically handles Git branch changes

Best Practices

Model Selection

For OpenAI:

text-embedding-3-small: Best balance of performance and cost
text-embedding-3-large: Higher accuracy, 5x more expensive
text-embedding-ada-002: Legacy model, lower cost

For Ollama:

mxbai-embed-large: The largest and highest-quality embedding model.
nomic-embed-text: Best balance of performance and embedding quality.
all-minilm: Compact model with lower quality but faster performance.

Security Considerations

API Keys: Stored securely in VS Code's encrypted storage
Code Privacy: Only small code snippets sent for embedding (not full files)
Local Processing: All parsing happens locally
Qdrant Security: Use authentication for production deployments

Current Limitations

File Size: 1MB maximum per file
Markdown: Not currently supported due to parsing complexity
Single Workspace: One workspace at a time
Dependencies: Requires external services (embedding provider + Qdrant)
Language Coverage: Limited to Tree-sitter supported languages

Using the Search Feature

Once indexed, Roo can use the codebase_search tool to find relevant code:

Example Queries:

"How is user authentication handled?"
"Database connection setup"
"Error handling patterns"
"API endpoint definitions"

The tool provides Roo with:

Relevant code snippets
File paths and line numbers
Similarity scores
Contextual information

Privacy & Security

Code stays local: Only small code snippets sent for embedding
Embeddings are numeric: Not human-readable representations
Secure storage: API keys encrypted in VS Code storage
Local option: Use Ollama for completely local processing
Access control: Respects existing file permissions

Future Enhancements

Planned improvements:

Additional embedding providers
Improved markdown and documentation support
Multi-workspace indexing
Enhanced filtering and configuration options
Team sharing capabilities
Integration with VS Code's native search

What It Does​

Key Benefits​

Setup Requirements​

Embedding Provider​

Setting Up Ollama for Embeddings​

Setting Up OpenAI Compatible Providers​

Vector Database​

Setting Up Qdrant​

Quick Local Setup​

Production Deployment​

Configuration​

Understanding Index Status​

How Files Are Processed​

Smart Code Parsing​

Automatic File Filtering​

Incremental Updates​

Best Practices​

Model Selection​

Security Considerations​

Current Limitations​

Using the Search Feature​

Privacy & Security​

Future Enhancements​