Multi-Agent Adversarial Debate Pipeline

An 8-phase multi-agent pipeline that uses structured adversarial debate to produce high-quality, well-researched outputs. Combines blind deliberation, web search diversity, confidence-weighted synthesis, and independent verification to defend against sycophancy, consensus inertia, and error cascade.

researchaidebatemulti-agentresearch

526

total tokens across 2 prompts

Estimated cost: $0.0016

Model

Token counts approximate (OpenAI cl100k_base encoding, within ~15% of Claude). Pricing last verified 2026-04-12.

Pipeline

Failure Mode Coverage

Phase \ Failure	Sycophancy	Majority Tyranny	Bluster Effect	Error Cascade	Context Drift	Consensus Inertia	No New Signal	Role Bleed	Complexity Creep	Decision Oscillation	Coordinator Hedging
Phase 1: Input Preparation	○	○	○	○	○	○

Phases

Phase 1

Input Preparation

Compose clean context for this round

Three components assembled: (1) Original constraints -- verbatim, never summarized. (2) Current synthesis -- compressed into Decided / Active / Minority Report. (3) Role-specific instruction with toke…

Defends: Context drift, Consensus inertia

If compressed context unavailable, fall back to last auto-saved synthesis. If none exists, start from original input.

Phase 2

Independent Proposals + Research

Agents respond blind -- each critiques before proposing

Each agent responds in isolation -- no agent sees others. Web search enabled. Every agent names one weakness before proposing. Default: 2 Optimizers + Challenger + Simplifier (4 agents, matching DeepM…

Optimizer AOptimizer BChallengerSimplifier

Defends: Sycophancy, Majority tyranny, Bluster effect, No new signal, Complexity creep

2 retries per agent. Continue with N-1 if one fails (min 2). Auto-save each response. Never skip Challenger.

Phase 3

Challenge Register + Arbitration

Challenges logged with steel man -- arbitrated by Coordinator in same call as Phase 04

Challenger objections extracted into numbered register with: steel man, objection, evidence, confidence. Relevant agents defend or concede. Phases 03 and 04 are a SINGLE Coordinator API call. The Coor…

Defends: Decision oscillation, Bluster effect

If defense calls fail, challenge marked 'undefended' but NOT auto-adopted. Coordinator still decides.

Phase 4

Coordinator Synthesizes

Arbitrate challenges + synthesize (one combined call)

Single API call that handles both challenge arbitration and synthesis. Receives: all blind agent proposals, the challenge register with defenses, Simplifier flags, dimension weights, and a ROSTER NOTE…

Coordinator

Defends: Bluster effect, Coordinator hedging, Complexity creep

Most critical call. 3 retries. If fails, save raw responses as fallback. Surface to user.

Phase 5

Independent Verification

Score, diff, sunset, trajectory -- checks synthesis against raw proposals

One API call, structured JSON output. The Verifier sees raw agent proposals alongside the Coordinator's synthesis. If the Coordinator silently dropped an agent's high-confidence point, the raw_vs_synt…

Verification Agent

Defends: Context drift, Consensus inertia, Error cascade

If fails, proceed but flag verification skipped. Double-verify next round.

Phase 6

Compress Context + Minority Report

Decided / active / minority / original -- 800 token hard cap with triage

Compress to four sections with TRIAGE PRIORITY if 800 tokens is tight: (1) ORIGINAL CONSTRAINTS -- verbatim, never cut, always first priority. (2) DECIDED -- locked decisions, bullet points only. (3)…

Defends: Context drift, Consensus inertia

If fails, pass raw synthesis. Always auto-save to storage.

Phase 7

Decision Gate + Human Review

Continue / fork / stop -- with oscillation detection + human override

Five checks: (1) OSCILLATION DETECTION -- hash key decisions and compare to prior rounds. (2) HUMAN REVIEW -- optional pause. (3) CONTINUE -- scores improved, active items remain. (4) FORK -- incompat…

Defends: Sycophancy, Complexity creep

Lightweight check, no API call. Cannot fail.

Phase 8

Output Formatting

Clean deliverable with scoring context and tradeoff transparency

Separate API call. Receives synthesis + verification JSON. The formatter strips debate rhetoric but retains actionable metadata: dimension scores, known tradeoffs, constraint renegotiations, and open…

Output Formatter

2 retries. If fails, present raw synthesis with scores appended. Save regardless.

Prompts

How To Use These Prompts

Read first -- paste order, customization guide, and API budget.

235tok

Failure Modes

Sycophancy

Agents agree more each round. RLHF rewards agreement over accuracy.

Majority Tyranny

If most agents agree (right or wrong), dissenters conform.

Bluster Effect

Vivid wrong beats calm right. LLMs prefer persuasive falsehoods.

Error Cascade

Small errors compound 17x through unstructured chains.

Context Drift

Original constraints forgotten. Semantic mutation shifts language.

Consensus Inertia

Wrong decisions solidify. Existing choices stay by default.

No New Signal

Same model, different personas = same knowledge, no diversity.

Role Bleed

Shared training biases regardless of persona.

Complexity Creep

Each round adds without resistance. By round 5, unusable.

Key Principles

Blind Deliberation — Agents never see each other. Eliminates sycophancy, conformity, bluster.
Web Search Per Agent — Different search terms = genuine new information. Primary diversity source.
Mandatory Critique Before Proposal — Every agent names one weakness before suggesting. Distributes challenger energy.
Steel Man Before Objection — Challenger articulates strongest version before attacking. Filters cheap shots.
Confidence-Weighted Decisions — Agents report 1-10 confidence. Coordinator weights evidence over rhetoric, referencing dimension weights (1-3) for tradeoffs. From ReConcile.
Dimension Weighting — Each scoring dimension has a weight (1-3) encoding user priorities. Coordinator references weights when tradeoffs conflict.
Simplifier With Veto — Tests against overwhelmed user on worst day standard. Without active complexity fighting, every round adds bloat.
Simulator Rotation — End-user persona swaps for Optimizer B via useSimulator flag. Drives more structural changes than any expert.
Challenge Register + Combined Arbitration — Logged, arbitrated, decided in a single Coordinator call combined with synthesis. No auto-adoption. Prevents oscillation.
Constraint Renegotiation — If a constraint is infeasible, agents must formally flag it: name it, explain why, propose modified version. Never silently drop.

Research Sources

Talk Isn't Always CheapWynn et al. (2025)Debate decreases accuracy over time. Models favor agreement over challenging flawed reasoning.
Why Do Multi-Agent LLM Systems Fail?Cemri et al. -- NeurIPS 202514 failure modes, 41-86.7% failure rates across 7 frameworks.
Peacemaker or TroublemakerYao et al. (2025)Sycophancy intensifies in later rounds. Cap at 2-3 rounds. Balance peacemaker/troublemaker.
ReConcileChen, Saha, Bansal -- ACL 2024Confidence-weighted voting = 11.4% improvement. Multi-model diversity = 6.8% alone.
The Multi-Agent TrapKim et al. / DeepMind (2025)Unstructured networks amplify errors 17.2x. Gains plateau beyond 4 agents.
From Spark to FirearXiv (2026)Minor errors solidify into false consensus. Governance layer raises defense from 0.32 to 0.89.
CONSENSAGENTACL Findings (2025)Dynamic prompt refinement mitigates sycophancy. State-of-the-art across 6 benchmarks.

Changelog

vv2.1Session 1base
7-phase pipeline with blind deliberation, web search, challenge register
vv2.1Session 1base
8 failure modes, 8 principles, 7 research sources
vv3.0Deep auditcritical
Added Simplifier role -- complexity creep was undefended
vv3.0Deep auditcritical
Added Phase 08 Output Formatting -- raw synthesis not usable
vv3.0Deep auditnovel
Mandatory critique-before-proposal for all agents
vv3.0Deep auditnovel
Steel man requirement for Challenger
vv3.3Currentcritical
Phases 03+04 combined into single Coordinator call -- saves 1 API call per round