Arena mode

Arena Mode lets you send the same message to two different AI models at once and compare their responses in a split view. Use it to evaluate model quality, test new models against your current default, or gather preference data across your team.

Enabling arena mode

Open any chat conversation.
Click the Swords icon in the chat input toolbar. The icon highlights when Arena Mode is active.
Two model dropdowns appear above the input, labeled A and B with vs between them.
Select a model for each side. The dropdowns show all models available to you based on your organization’s governance settings and the active agent’s supported models.
Type a message and send it.

To disable Arena Mode, click the Swords icon again. All arena state (model selections, threads, verdict) is cleared.

Note: Arena Mode requires at least two available models. If only one model is configured, the model selector is hidden and the toggle is disabled.

Split view

After sending a message, the chat area splits into two columns:

Column	Content
Left (A)	Messages from Model A’s thread
Right (B)	Messages from Model B’s thread

Each column has a header showing the model label and name. Both columns scroll independently and support the full set of chat features including approvals, file attachments, and message actions. You can continue sending messages while in Arena Mode. Each new message is sent to both models in parallel.

Recording a verdict

Once both models have responded, a verdict bar appears below the split view with four options:

Verdict	Effect
A is better	Records Model A as the preferred response
B is better	Records Model B as the preferred response and makes Thread B the active branch
Tie	Records that both responses were equally good
Both bad	Records that neither response was satisfactory

Verdicts are stored as feedback with metadata including the verdict choice, Model A ID, and Model B ID. Once recorded, the verdict buttons are disabled for that comparison round.

How it works

When you send a message in Arena Mode, the platform:

Creates two separate threads (or reuses existing arena threads).
Copies conversation history to both threads if this is the first arena message in an existing conversation.
Sends the same message to both models in parallel. Each model responds in its own thread without seeing the other’s output.
Creates a branch link so Thread B is tracked as a variant of Thread A.

This ensures a fair comparison — neither model is influenced by the other’s response.

Start here

Cloud

Self-hosted

Platform

Tutorials

Development

Legal

Enabling arena mode

Split view

Recording a verdict

How it works

Start here

Cloud

Self-hosted

Platform

Tutorials

Development

Legal

​Enabling arena mode

​Split view

​Recording a verdict

​How it works

Enabling arena mode

Split view

Recording a verdict

How it works