RAG Arena &
Dynamic Routing

Compare multiple knowledge bases side-by-side, discover which performs best for each question type, and let the system automatically route future queries to the winning source.

Request demo Explore AutoRAG

Side-by-Side Knowledge Comparison

Send a question to multiple RAG configurations simultaneously. See how different knowledge bases respond, then pick the winner.

"What are the recommended dosing guidelines for this medication?"

Variant A

Qdrant

Retrieving context...

Based on the clinical documentation, the recommended starting dose is 10mg daily, with titration up to 40mg based on patient response. Key monitoring parameters include...

Relevance: 0.94Hallucination: 0.021.2s

Variant B

Cloudflare Vectorize

Retrieving context...

Dosing should begin at the lowest effective dose. The product label indicates 10-20mg as the typical range. Patients should be monitored for adverse effects during the first two weeks...

Relevance: 0.87Hallucination: 0.052.1s

How RAG Arena Works

A structured process for finding the best knowledge base configuration for every question type.

Configure Variants

Set up 2-8 arena variants, each pointing to a different RAG vector or knowledge base.

Run in Parallel

Send questions to all variants simultaneously. Each retrieves context from its own knowledge base.

Compare & Score

View responses side-by-side with inline quality scores for relevance, hallucination, and completeness.

Learn & Route

The system learns which knowledge base wins for each question type and routes future queries automatically.

Dynamic RAG Routing

Once arena results establish preferences, incoming questions are automatically routed to the best-performing knowledge base — no manual configuration needed.

Incoming

User Question

→

Semantic Match

Route Lookup

→

Best Match

Qdrant (0.94)

→

Response

Optimized Answer

Built for Enterprise RAG

Everything you need to optimize retrieval quality at scale.

Multi-Provider Support

Compare across Qdrant, Cloudflare Vectorize, Couchbase, MongoDB Atlas, and more in a single arena experiment.

Inline Quality Scoring

Every response is scored for relevance, hallucination, correctness, and completeness in real time.

Auto-Fix Integration

Arena results feed into the QA auto-fix loop, automatically learning optimal routing with quality thresholds.

Full Audit Trail

Every routing decision is tracked with source (arena, auto-fix, or manual), scores, and timestamps in MongoDB.

Fast Routing via KV

Learned preferences stored in Cloudflare KV for sub-millisecond lookups. MongoDB maintains the full history.

Arena Presets

Save, name, and reuse arena configurations. Export as portable JSON for sharing across teams.

Supported Vector Providers

Run arena experiments across any combination of vector databases.

Qdrant

Cloudflare Vectorize

Couchbase

Vertex AI Vector Search

MongoDB Atlas

Redis

PageIndex

RAG Retrieval Architectures

Route queries to different retrieval strategies based on question type. Each architecture optimizes for different use cases.

Vector Search

Semantic similarity matching via dense embeddings. Best for natural language questions and conceptual lookups.

Hybrid (BM25 + Vector)

Combines keyword matching with semantic search using Reciprocal Rank Fusion. Best for precise technical queries.

Reranking

Initial retrieval followed by cross-encoder reranking for precision. Best for high-stakes answers requiring accuracy.

Agentic RAG

LLM-driven routing across multiple knowledge bases and tools. Best for complex multi-step questions requiring synthesis.

Find your optimal RAG configuration

Stop guessing which knowledge base performs best. Let RAG Arena show you the data, then let Dynamic Routing handle the rest.

Request demo Explore Quality Assurance

RAG Arena &Dynamic Routing