Swarm Consensus

One hard question, a swarm of LLM nodes, two ways to agree. Run a round: majority voting picks the most common answer while peer-ranked Bradley-Terry consensus surfaces the best one, and the scoreboard converges on the gap Fortytwo measured. The evaluation-edge slider shows why.

simulation DOM Machine Learning Infrastructure Jun 14, 2026

⬢ loading artifact…

Swarm Consensus — tap run round · drag swarm size · drag evaluation edge · data as of Jun 14, 2026 · Fortytwo (arXiv:2510.24801) ↗ open artifact ↗

View artifact source on GitHub ↗

Appears in

Machine Learning Infrastructure LLMs

Don't Vote, Rank: Peer-Ranked Consensus for Decentralized LLM Swarms

Majority voting over LLMs throws away the one node that got it right. Fortytwo's swarm inference ranks answers pairwise instead — +17 points on GPQA Diamond — with on-chain reputation and proof-of-capability for Sybil defense. The mechanism, the math, the tradeoffs.

Jun 14, 2026 8 min read ⬢ interactive