One hard question, a swarm of LLM nodes, two ways to agree. Run a round: majority voting picks the most common answer while peer-ranked Bradley-Terry consensus surfaces the best one, and the scoreboard converges on the gap Fortytwo measured. The evaluation-edge slider shows why.
Majority voting over LLMs throws away the one node that got it right. Fortytwo's swarm inference ranks answers pairwise instead — +17 points on GPQA Diamond — with on-chain reputation and proof-of-capability for Sybil defense. The mechanism, the math, the tradeoffs.