Core component · 03

Consensus arbitration

When two or more experts run, they return competing proposals. Arbitration is the protocol that folds those proposals into one committed result - deterministically, with a recorded support level and a list of dissenters, so every decision can be explained and replayed.

The proposal contract

Each engaged expert returns a Proposal. Beyond the answer itself, a proposal carries the metadata arbitration needs: self-reported confidence, the route weight the gate assigned it, and a stable digest used to group identical answers. An expert's voting power is its route weight times its confidence - the gate's trust, scaled by the expert's own.

proposal.pypython

1# A Proposal is one expert's answer plus everything arbitration needs2# to weigh, compare, and audit it.3from dataclasses import dataclass, field4from typing import Any56@dataclass(frozen=True)7class Proposal:8    expert_id: str9    payload: Any            # the actual answer (text, patch, tool call, ...)10    confidence: float       # expert's self-reported certainty, 0..111    route_weight: float     # gate weight carried over from selection12    cost: float             # tokens / wall-clock consumed13    digest: str             # stable hash of payload, used for grouping1415    @property16    def vote(self) -> float:17        # an expert's voting power blends how the gate ranked it18        # with how sure it is of its own answer.19        return self.route_weight * self.confidence

Proposal

Field	Type	Description
payload	Any	The expert's actual answer - text, a patch, a tool call.
confidence	float	Self-reported certainty in 0..1.
route_weight	float	Normalized gate weight carried over from selection.
digest	str	Stable hash of the payload; equal digests are treated as the same answer.
vote	float	Derived voting power: route_weight × confidence.

Arbitration protocols

The protocol is declared at configuration time, never inferred from the proposals. Four ship in the box; each implements the same arbitrate(proposals) -> Decision interface.

weighted-quorum

Weighted quorum

Group identical answers, sum each group's voting power (route weight × confidence), and commit the heaviest group only if it clears the quorum threshold. The default - robust when experts can genuinely agree.

best for: factual answers, classification, tool-call selection

ranked-runoff

Ranked runoff

Each expert returns a ranked list rather than a single answer. The arbiter runs instant-runoff rounds, eliminating the lowest until one option holds a majority. Avoids splitting the vote across near-duplicate answers.

best for: ranked recommendations, candidate shortlists

synthesis

Synthesis

No single proposal wins; a designated synthesizer expert receives all proposals (weighted) and produces a merged answer. The merge step itself is logged as a proposal so the result stays auditable.

best for: long-form drafting, multi-part plans

first-quorum

First-to-quorum

Streaming variant: commit as soon as arriving proposals cross the quorum, cancelling the still-running experts. Trades a little accuracy for tail-latency wins.

best for: latency-critical, redundant experts

Weighted-quorum reference

The default protocol, in full. It groups proposals by digest, sums voting power per group, and commits the heaviest group only if it clears quorum. Tie-breaks are deterministic: highest single vote, then lexical expert id - so the same proposals always commit the same answer.

arbiter.pypython

1# Arbiter: a set of proposals -> one committed Decision.2from dataclasses import dataclass3from collections import defaultdict45class UnderQuorum(Exception):6    """Raised when surviving proposals can't meet the required quorum."""78@dataclass(frozen=True)9class Decision:10    consensus: object11    support: float          # fraction of total vote behind the winner12    engaged: list[str]13    dissenting: list[str]14    protocol: str1516@dataclass17class WeightedQuorumArbiter:18    quorum: float = 0.66    # winner must hold >= this share of total vote1920    def arbitrate(self, proposals: list[Proposal]) -> Decision:21        if not proposals:22            raise UnderQuorum("no proposals survived dispatch")2324        # group identical answers by digest, sum their voting power.25        groups: dict[str, list[Proposal]] = defaultdict(list)26        for p in proposals:27            groups[p.digest].append(p)2829        total = sum(p.vote for p in proposals)30        ranked = sorted(31            groups.values(),32            key=lambda g: (sum(p.vote for p in g), _tiebreak(g)),33            reverse=True,34        )35        winner = ranked[0]36        support = sum(p.vote for p in winner) / total37        if support < self.quorum:38            raise UnderQuorum(f"top answer held {support:.0%} < {self.quorum:.0%}")3940        dissent = [p.expert_id for g in ranked[1:] for p in g]41        return Decision(42            consensus=winner[0].payload,43            support=support,44            engaged=[p.expert_id for p in proposals],45            dissenting=dissent,46            protocol="weighted-quorum",47        )4849def _tiebreak(group: list[Proposal]) -> tuple:50    # deterministic: highest single vote, then lexical expert id.51    top = max(group, key=lambda p: p.vote)52    return (top.vote, top.expert_id)

Go data-plane arbiter

The production arbiter is the same algorithm in Go, with the identical tie-break ordering so a result never diverges between planes.

arbiter.gogo

1// Go data-plane arbiter mirrors the reference tie-break exactly so a2// dispatch commits the same answer in either plane.3package arbiter45import "sort"67type Decision struct {8    Consensus any9    Support   float6410    Protocol  string11}1213func WeightedQuorum(props []Proposal, quorum float64) (Decision, error) {14    if len(props) == 0 {15        return Decision{}, ErrUnderQuorum16    }17    groups := map[string][]Proposal{}18    for _, p := range props {19        groups[p.Digest] = append(groups[p.Digest], p)20    }21    type bucket struct {22        digest string23        vote   float6424        top    Proposal25    }26    var bs []bucket27    var total float6428    for d, g := range groups {29        var v float6430        top := g[0]31        for _, p := range g {32            v += p.Vote()33            if p.Vote() > top.Vote() {34                top = p35            }36        }37        total += v38        bs = append(bs, bucket{d, v, top})39    }40    sort.Slice(bs, func(i, j int) bool {41        if bs[i].vote != bs[j].vote {42            return bs[i].vote > bs[j].vote43        }44        return bs[i].top.ExpertID < bs[j].top.ExpertID // stable tie-break45    })46    support := bs[0].vote / total47    if support < quorum {48        return Decision{}, ErrUnderQuorum49    }50    return Decision{Consensus: bs[0].top.Payload, Support: support,51        Protocol: "weighted-quorum"}, nil52}

Under-quorum fails loud

If the heaviest group can't reach quorum - because experts disagreed or too many dropped against their budget - the arbiter raises UnderQuorum instead of committing a weak majority. Handle it by escalating, retrying with a wider plan, or falling back to a single trusted expert.

Acting on a decision

A Decision reports its support - the share of total vote behind the winner - so callers can treat strong and weak consensus differently. Escalating a weak decision into a Peer-Consult session is a common pattern.

commit.pypython

1from moe_hub import Arbiter23# protocol is declared, never inferred from the proposals.4arbiter = Arbiter(protocol="weighted-quorum", quorum=0.66)56decision = arbiter.arbitrate(session.collect())7if decision.support >= 0.9:8    commit(decision.consensus)            # strong agreement9else:10    escalate(decision, to="peer-consult") # weak: ask for a human or peer pass

The proposal contract#

Arbitration protocols#