Debate Championship For LLM (Basic Rules)

Date

January 9, 2026

Type

Phonon

Antecedent

No prior reading is required.

Emergence

While reading an article about ‘Humanity’s Last Test’, I wondered if it would be possible to orchestrate an intellectual contest between two Large Language Models (LLMs), presided over by a third LLM acting as the judge. To test this, I established a specific set of rules for an automated debate.

Stabilization

Below is the framework for the debate competition. I designed the protocol to be as unbiased, simple, and free from human intervention as possible.

Assign two LLMs (A and B) as players and one LLM (C) as the referee.
Model C generates 10 pros-and-cons discussion topics.

‣

Prompt (C)

Ask A and B for their stance on each of the ten topics.

‣

Prompt (A, B)

Select the first contending topic and order A, B to make a claim and supporting reasoning for that opinion.

‣

Prompt (A, B)

Exchange opinions and ask each model to refute the other's claims.

‣

Prompt (A, B)

Exchange the rebuttals and ask for a counter-rebuttal.

‣

Prompt (A, B)

Based on the interactions, have A and B draw their final conclusions.

‣

Prompt (A, B)

Finally, provide Model C with the summarized transcript and ask it to select the winner.

‣

Prompt (C)

Convergence

I am looking forward to the fascinating results this competition will yield. I will write a follow-up article to post the results of the actual debates.

Descendant

The following link leads to the competition results.

Debate Championship For LLM (ChatGPT vs Gemini; Copilot)

Title	Type
Debate Championship For LLM (ChatGPT vs Gemini; Copilot)	Phonon
Debate Championship For LLM (Basic Rules)	Phonon
How to Summon Cubic Dice With Only Players’ Brains	Tachyon
The Surprising Worth of Easy Problems in Test Scoring	Phonon
Reverse Engineering My Personal Classical Music Preference	Gluon
Seeking the Hidden Unknown Chess Openings	Tachyon
A Pedestrian's Guide to Harsh Winter	Phonon
On the Usefulness of a Crosswalk Without Traffic Lights	Lepton