Skip to main content
Request demo

RAG Arena &
Dynamic Routing

Compare multiple knowledge bases side-by-side, discover which performs best for each question type, and let the system automatically route future queries to the winning source.

Side-by-Side Knowledge Comparison

Send a question to multiple RAG configurations simultaneously. See how different knowledge bases respond, then pick the winner.

"What are the recommended dosing guidelines for this medication?"
Variant AQdrantQdrant
Retrieving context...
Based on the clinical documentation, the recommended starting dose is 10mg daily, with titration up to 40mg based on patient response. Key monitoring parameters include...
Variant BCloudflareCloudflare Vectorize
Retrieving context...
Dosing should begin at the lowest effective dose. The product label indicates 10-20mg as the typical range. Patients should be monitored for adverse effects during the first two weeks...

How RAG Arena Works

A structured process for finding the best knowledge base configuration for every question type.

1

Configure Variants

Set up 2-8 arena variants, each pointing to a different RAG vector or knowledge base.

2

Run in Parallel

Send questions to all variants simultaneously. Each retrieves context from its own knowledge base.

3

Compare & Score

View responses side-by-side with inline quality scores for relevance, hallucination, and completeness.

4

Learn & Route

The system learns which knowledge base wins for each question type and routes future queries automatically.

Dynamic RAG Routing

Once arena results establish preferences, incoming questions are automatically routed to the best-performing knowledge base — no manual configuration needed.

Incoming
User Question
Semantic Match
Route Lookup
Best Match
Qdrant (0.94)
Response
Optimized Answer

Built for Enterprise RAG

Everything you need to optimize retrieval quality at scale.

Multi-Provider Support

Compare across Qdrant, Cloudflare Vectorize, Couchbase, MongoDB Atlas, and more in a single arena experiment.

Inline Quality Scoring

Every response is scored for relevance, hallucination, correctness, and completeness in real time.

Auto-Fix Integration

Arena results feed into the QA auto-fix loop, automatically learning optimal routing with quality thresholds.

Full Audit Trail

Every routing decision is tracked with source (arena, auto-fix, or manual), scores, and timestamps in MongoDB.

Fast Routing via KV

Learned preferences stored in Cloudflare KV for sub-millisecond lookups. MongoDB maintains the full history.

Arena Presets

Save, name, and reuse arena configurations. Export as portable JSON for sharing across teams.

Supported Vector Providers

Run arena experiments across any combination of vector databases.

QdrantQdrant
CloudflareCloudflare Vectorize
CouchbaseCouchbase
GoogleVertex AI Vector Search
MongoDB Atlas
Redis
PageIndexPageIndex

RAG Retrieval Architectures

Route queries to different retrieval strategies based on question type. Each architecture optimizes for different use cases.

VECTOR SEARCHQEmbedR

Vector Search

Semantic similarity matching via dense embeddings. Best for natural language questions and conceptual lookups.

HYBRID SEARCHQVectorBM25RRF

Hybrid (BM25 + Vector)

Combines keyword matching with semantic search using Reciprocal Rank Fusion. Best for precise technical queries.

RERANKINGQRetrieveCross-encoderrerank3

Reranking

Initial retrieval followed by cross-encoder reranking for precision. Best for high-stakes answers requiring accuracy.

AGENTIC RAGQAgentRouterKB 1KB 2ToolsSyn

Agentic RAG

LLM-driven routing across multiple knowledge bases and tools. Best for complex multi-step questions requiring synthesis.

Find your optimal RAG configuration

Stop guessing which knowledge base performs best. Let RAG Arena show you the data, then let Dynamic Routing handle the rest.