License: CC BY 4.0
arXiv:2606.10982v2 [cs.DC] 10 Jun 2026

FairWave: A Fairness-Aware Asynchronous DAG-BFT Consensus

Syariful Mujaddiq
syariful.mujaddiq@coconut.or.id
Abstract

Combining asynchronous Byzantine Fault Tolerant (BFT) consensus with Proof-of-Stake (PoS) creates a trilemma between Sybil resistance, reward distribution fairness, and protection against persistent plutocracy. Existing DAG-BFT approaches (Narwhal+Tusk, Bullshark, and Mysticeti) prioritize liveness over the fairness implications of stake-based selection, resulting in persistent longitudinal centralization.

FairWave is a dual-channel DAG-BFT protocol that separates anchor selection from reward distribution. The selection channel is super-linear in stake, guaranteeing Sybil gain <1<1 for all split factors K>1K>1. The reward channel is sub-linear, using square-root stake normalization to mitigate rich-get-richer dynamics.

The finalized DAG structure provides deterministic uptime and latency factors, allowing honest validators to agree on operational quality without any external oracle. To avoid circular dependency between selection outcomes and selection weights, reputation is used in a lagged form: the active value at epoch ee equals the prior epoch’s final value.

We derive closed-form constraints for both channels and validate them through nine empirical analyses (\approx550,000 Monte Carlo rounds) against eight baselines. FairWave achieves a Gini coefficient of 0.149 (vs. Pure-PoS’s 0.488), a monotone HHI reduction from 0.039 to 0.021 over 50,000 epochs, an optimal-adversary Sybil split of K=1K^{*}=1, and a success-rate coefficient of variation of 5.2% under ±\pm25% input perturbation. Safety (agreement and validity) is a formal consequence of the 2f+12f+1 strong-support commit rule, holding unconditionally for f<n/3f<n/3; the empirical differential is the monotone-continuous liveness-degradation curve, which decreases from 99.6% commit rate at b=0.20b=0.20 to 71.1% at the theoretical bound b=1/3b=1/3 without the discontinuous cliff characteristic of view-change-driven leader-BFT.

I Introduction

I-A Motivation

Long-running Proof-of-Stake blockchains exhibit empirical centralization despite formally claiming decentralized governance [1, 2, 3]. Pure stake-based consensus protocols suffer from three well-characterized pathologies:

  1. 1.

    Rich-get-richer dynamics. When rewards are proportional to stake, accumulated rewards compound into stake growth at a rate proportional to current holdings, inducing exponential divergence of the Herfindahl-Hirschman Index (HHI) toward one.

  2. 2.

    Sybil vulnerability under sub-linear weighting. Alternative rules such as Square Root Stake Weighting (SRSW) stake\sqrt{\text{stake}} or Logarithmic Stake Weighting (LSW) ln(1+stake)\ln(1+\text{stake}) that attempt to achieve fairness instead introduce Sybil gain proportional to K\sqrt{K} or KK respectively, where KK denotes the split factor.

  3. 3.

    Plutocratic convergence. Under proportional rewards, the share of power of the top-1 validator increases monotonically, eventually violating the BFT safety threshold f<n/3f<n/3.

These pathologies are not merely theoretical—empirical measurements on Ethereum 2.0, Cosmos Hub, and Solana mainnet show Nakamoto coefficients as low as 3–7 for the top staking pools [4, 5, 6].

I-B Contributions

This work presents the following contributions:

  • C1. A novel dual-channel consensus architecture in which selection weights and reward weights exhibit opposite curvature with respect to stake-mitigates the Sybil-fairness-plutocracy trilemma.

  • C2. A Closed-form analytical derivation of Sybil gain bounds for four reward rules (Pure-PoS, SRSW, LSW, FairWave), validated to machine precision (ϵ=0.0\epsilon=0.0) against 20,16020,160 numerical evaluation cells.

  • C3. Empirical evaluation of nine tasks spanning approximately 550,000550,000 Monte Carlo rounds, compared against potocol models representing Narwhal+Tusk [7], Bullshark [8], Mysticeti [9], HotStuff [10], HotStuff-2, PBFT [11], Tendermint [12], and Algorand [13] under a unified simulation framework.

  • C4. Quantitative characterization of robustness via One-At-a-Time (OAT) elasticity analysis, Sobol variance decomposition, and combined perturbation mapping—To the best of our knowledge, this work presents the first sensitivity analysis applied to DAG-BFT consensus parameters.

I-C Paper Organization

Section II surveys related work. Section III formalizes the threat model and system model. Section IV presents the FairWave protocol design. Section V derives the mathematical foundations. Section VI describes the empirical methodology. Section VII presents results from nine evaluation tasks. Section VIII discusses trade-offs, and Section X concludes.

II Related Work and Background

II-A Asynchronous BFT Consensus

Classical PBFT [11] achieves Byzantine agreement under partial synchrony with message complexity O(n2)O(n^{2}) per view. HotStuff [10] reduces view-change complexity to O(n)O(n) via threshold signatures and pipelined chain structure. HotStuff-2 [14] further optimizes to two-phase commit with optimal responsiveness. The DAG-BFT paradigm, initiated by DAG-Rider [15] and Narwhal+Tusk [7], separate data dissemination from consensus ordering by embedding transactions in a structured Directed Acyclic Graph (DAG). Subsequent refinements—Bullshark [8] (reducing commit latency to 2 waves), Shoal [16] (pipelining and leader reputation), Mysticeti [9] (achieving fast-path commit sub-wave), and Shoal++ [17] (near-optimal latency)—further optimize the latency-throughput frontier while maintaining Byzantine tolerance f<n/3f<n/3.

II-B Proof-of-Stake Reward Rules

The choice of stake-to-weight mapping fundamentally determines the Sybil resistance and fairness properties of a protocol. Table I summarizes four representative rules.

TABLE I: Comparison of PoS reward rules and Sybil gain characteristics.
Rule Weight function Sybil gain
Pure-PoS [13] w=sw=s g(K)=1g(K)=1
SRSW [18] w=sw=\sqrt{s} g(K)=Kg(K)=\sqrt{K}
LSW w=ln(1+s)w=\ln(1+s) g(K)Kg(K)\approx K
FairWave w=RS2w=R\cdot S^{2} g(K)<1K>1g(K)<1\;\forall K>1

Pure-PoS (Algorand, Tendermint) is Sybil-neutral: stake splitting yields neither gain nor loss. SRSW (used conceptually in Cosmos governance discussions [18]) introduces sub-linear fairness at the cost of Sybil gain K\sqrt{K}. LSW worsens vulnerability to approximately linear gain relative to split factor, rendering it catastrophically exploitable. Alternative approaches to address wealth compounding have been proposed [19, 20, 21], but none simultaneously mitigates the Sybil-fairness-plutocracy trilemma.

II-C Decentralization Metrics

We employ three complementary decentralization metrics: the Nakamoto coefficient [4, 5] (minimum coalition to achieve a 1/31/3 threshold, Fig. 1); the Gini coefficient (inequality of reward distribution); and the Herfindahl-Hirschman Index (HHI, sum of squared market shares). These metrics capture different aspects of centralization: coalition vulnerability, distributional inequality, and concentration.

Refer to caption
Figure 1: The Nakamoto coefficient, which represents the minimum coalition needed for 1/3 power, is evaluated across six reward rules and three network sizes. FairWave consistently achieves a higher Nakamoto coefficient than Pure-PoS across all scales, indicating better decentralization resilience.

II-D Sensitivity Analysis in Distributed Systems

To the best of our knowledge, variance-based sensitivity analysis (Sobol indices [22]) has not previously been applied to characterize DAG-BFT consensus robustness. To the best of our knowledge, this work presents the first quantitative input sensitivity analysis for DAG-BFT consensus, providing peer-reviewed evidence of parametric stability.

III Threat Model and System Model

III-A System Model

We consider a set of nn validators 𝒱={v1,,vn}\mathcal{V}=\{v_{1},\ldots,v_{n}\} with n4n\geq 4, each possessing attributes (si,ui,repi,i)(s_{i},u_{i},\text{rep}_{i},\ell_{i}) representing stake, uptime, reputation score, and latency score.

Stake bounds. Validator stakes are constrained: si[Smin,Smax]s_{i}\in[S_{\min},S_{\max}] with Smin=5.000S_{\min}=5.000 and Smax=50.000S_{\max}=50.000 token units. The normalized stake factor is defined as:

S(i)=si/Smax,S(i)[Smin/Smax,1]S(i)=\sqrt{s_{i}/S_{\max}},\quad S(i)\in[\sqrt{S_{\min}/S_{\max}},1] (1)

Reward score. The multi-factor composite reward score for validator ii is:

R(i)=αS(i)+βU(i)+γRepactive(i,e)+δL(i)R(i)=\alpha\cdot S(i)+\beta\cdot U(i)+\gamma\cdot\text{Rep}_{\text{active}}(i,e)+\delta\cdot L(i) (2)

with simplex constraint α+β+γ+δ=1\alpha+\beta+\gamma+\delta=1 and default parameterization (α,β,γ,δ)=(0.20; 0.25; 0.40; 0.15)(\alpha,\beta,\gamma,\delta)=(0.20;\;0.25;\;0.40;\;0.15).

Lagged reputation. The reputation factor Rep(i)\text{Rep}(i) evolves throughout each epoch via increment, decay, and slashing (see Algorithm 1 and Section VIII.C). To prevent circular dependency between epoch ee selection results and epoch ee selection weights, FairWave introduces a temporal separation between observed and active reputations. Let Repfinal(i,e)\text{Rep}_{\text{final}}(i,e) denote the final reputation value of validator ii at epoch closure (i.e., after all increments, decay, and slashing throughout epoch ee have been applied). Then the active reputation used for R(i)R(i) computation at epoch ee is:

Repactive(i,e)=Repfinal(i,e1)\text{Rep}_{\text{active}}(i,e)=\text{Rep}_{\text{final}}(i,e-1) (3)

In other words, the reputation value used in WrewW_{\text{rew}} at epoch ee is the value frozen at epoch e1e-1 closure, and reputation updates occurring throughout epoch ee only affect WrewW_{\text{rew}} at epoch e+1e+1. This is analogous to delayed validator activation practice in some production PoS protocols but applied here specifically to the reputation component to ensure forward-causality property in the dual-channel architecture (see Proposition 1 in Section V).

Fig. 2 illustrates the lagged-reputation timeline and the SNAPSHOT phase that freezes Repfinal(e)\mathrm{Rep}_{\mathrm{final}}(e) for use at epoch e+1e{+}1.

Refer to caption
Figure 2: Lagged reputation timing. Reputation updates (small ticks) accrue continuously through each epoch. At every epoch boundary, the SNAPSHOT phase (red dot) freezes Repfinal(e)\mathrm{Rep}_{\mathrm{final}}(e). That frozen value is then used as Repactive\mathrm{Rep}_{\mathrm{active}} one epoch later (red arrows). This forward-causal arrangement eliminates the circular dependency between epoch-ee selection results and epoch-ee selection weights (Proposition 1).

Validator registry. The validator set 𝒱e\mathcal{V}_{e} for epoch ee is determined atomically at epoch e1e-1 boundary via a seven-phase ceremony: FREEZE \to SLASH \to REGISTRY \to SEED \to SNAPSHOT \to RESET \to BARRIER. The set is locked per-epoch; no stake modifications occur intra-epoch.

Fig. 3 illustrates the atomically-executed seven-phase ceremony that establishes the next-epoch validator set and the frozen reputation snapshot.

Refer to caption
Figure 3: Per-epoch seven-phase ceremony executed atomically at every epoch boundary. The pipeline ensures all honest validators converge on the next-epoch validator set 𝒱e+1\mathcal{V}_{e+1}, the VRF seed, the frozen reputation snapshot Rep(e)\mathrm{Rep}(e), and the new weight vectors Wsele+1,Wrewe+1W_{\mathrm{sel}}^{e+1},W_{\mathrm{rew}}^{e+1} without any external aggregator. The SNAPSHOT phase (highlighted red) implements the lagged-reputation freezing visualized in Fig. 2.

III-B Network Model

For safety, the protocol maintains safety under full asynchrony. An anchor commit requires observing strong support from 2f+1\geq 2f+1 vertices that are causally reachable—not from the delivery deadline.

For liveness, we assume partial synchrony (Dwork-Lynch-Stockmeyer [23]): there exists a Global Stabilization Time (GST) after which all messages are delivered within delay bound Δ\Delta. Before GST, liveness is not guaranteed, but safety is maintained unconditionally.

III-C Adversary Model

Static Byzantine adversary. The adversary controls at most f=(n1)/3f=\lfloor(n-1)/3\rfloor validators (by weighted stake). Byzantine validators can equivocate (produce conflicting vertices for the same round), selectively withhold messages, and coordinate timing within delay bound Δ\Delta.

Sybil’s adversary. An attacker controlling aggregate stake XX can split into KK Sybil identities, each holding X/KX/K stake. We analyze selection gain g(K)g(K) as the ratio of adversarial power after splitting to power before splitting.

Equivocation detection. Conflicting vertices from the same validator in the same round are detected with probability pdetect=0.95p_{\text{detect}}=0.95 via VRF proof validation and cross-referencing DAG parent sets. Augmentation with adversarial‑resilient detectors (e.g., HI‑XDR) is possible [24].

III-D Derivation of Non-Stake Metrics from DAG Structure

The composite reward score (Eq. 2) combines four factors: stake S(i)S(i), uptime U(i)U(i), reputation Repactive(i,e)\text{Rep}_{\text{active}}(i,e), and latency L(i)L(i). Stake is external to consensus (locked in validator registry per epoch), and reputation is updated endogenously from commit results (see Algorithm 1 and Eq. 3). This subsection formalizes that the remaining two factors—uptime and latency—are deterministically derived from the finalized DAG structure, so all honest validators agree on their values without requiring an external oracle.

Notation. For epoch ee, let e={r1,r2,,rRe}\mathcal{R}_{e}=\{r_{1},r_{2},\ldots,r_{R_{e}}\} denote the set of DAG rounds in that epoch, with Re=|e|R_{e}=|\mathcal{R}_{e}|. Let 𝒟e\mathcal{D}_{e} denote the finalized DAG throughout epoch ee, i.e., the sub-DAG all of whose vertices have entered the causal history of at least one committed anchor. For validator ii, define 𝒱i(e)={v𝒟e:proposer(v)=i}\mathcal{V}_{i}(e)=\{v\in\mathcal{D}_{e}:\text{proposer}(v)=i\} as the set of vertices proposed by ii and entered into the finalized DAG at epoch ee.

Definition 1 (DAG-Derived Uptime).

Uptime of validator ii at epoch ee is defined as the ratio of rounds in which vertex ii entered the finalized DAG to total epoch rounds:

U(i,e)=|𝒱i(e)|Re[0,1].U(i,e)=\frac{|\mathcal{V}_{i}(e)|}{R_{e}}\in[0,1]. (4)

In practice, DAG metrics can feed detectors (e.g., HI‑XDR) to flag suspicious validators [24].

Since in DAG-BFT each validator proposes at most one vertex per round, |𝒱i(e)|Re|\mathcal{V}_{i}(e)|\leq R_{e} and U(i,e)U(i,e) directly measures continuous participation: value U=1U=1 indicates the validator is present every round throughout the epoch; value U=0U=0 indicates the validator is perpetually offline.

Definition 2 (DAG-Derived Latency).

For vertex v𝒱i(e)v\in\mathcal{V}_{i}(e), let

τ2f+1(v)=min{r>round(v)||{j:vj𝒟e,round(vj)r,vanc(vj)}|2f+1},\tau_{2f+1}(v)=\min\Big\{\,r>\text{round}(v)\;\Big|\\ \big|\{\,j:\exists\,v_{j}\in\mathcal{D}_{e},\,\text{round}(v_{j})\leq r,\\ v\in\text{anc}^{*}(v_{j})\,\}\big|\geq 2f+1\,\Big\}, (5)

the first round where at least 2f+12f+1 distinct validators have vertices that reference vv causally (directly or transitively), with anc()\text{anc}^{*}(\cdot) denoting transitive parent closure. The round-offset of vertex vv is δ(v)=τ2f+1(v)round(v)1\delta(v)=\tau_{2f+1}(v)-\text{round}(v)\geq 1. The raw latency of validator ii at epoch ee is the median round-offset across all its vertices:

~(i,e)=medianv𝒱i(e)δ(v).\tilde{\ell}(i,e)=\operatorname*{median}_{v\in\mathcal{V}_{i}(e)}\delta(v). (6)

For composition into the reward score, raw values are normalized to [0,1][0,1] via a monotone-decreasing mapping:

L(i,e)=1min(~(i,e)1max1, 1),L(i,e)=1-\min\!\left(\frac{\tilde{\ell}(i,e)-1}{\ell_{\max}-1},\;1\right), (7)

with max\ell_{\max} a protocol parameter (default max=2f+1\ell_{\max}=2f+1).

Validators whose vertices are consistently referenced in the next round (δ=1\delta=1) obtain L=1L=1; validators whose vertices are covered only after max\geq\ell_{\max} rounds obtain L=0L=0.

Determinism and Byzantine tolerance. Equations (4)–(7) reference only (i) the finalized DAG structure 𝒟e\mathcal{D}_{e} and (ii) the validator set 𝒱e\mathcal{V}_{e} at that epoch. Both objects are already agreed upon via the DAG-BFT commit mechanism itself: every honest node running the protocol stores 𝒟e\mathcal{D}_{e} that is identical (modulo causal history from the same committed anchor) and validator registry 𝒱e\mathcal{V}_{e} identical (locked at epoch e1e-1 boundary). Consequently, functions U(i,e)U(i,e) and L(i,e)L(i,e) are deterministic functions of consensus state, and any two honest nodes computing them at epoch ee boundary will obtain bit-exact identical numerical values. Byzantine tolerance over UU and LL values is inherited from DAG-BFT: no new attack surface is introduced, because Byzantine validators claiming different UU or LL values would be detected via local recomputation by every honest node.

No external oracle required. Unlike quality-based reward schemes commonly implemented on production PoS protocols (e.g., Polkadot era points [25] or Solana skip-rate penalties), which partially rely on external aggregators or oracles to report availability and latency, FairWave requires no component outside the consensus protocol set. There is no separate committee, no trusted reporter, and no additional communication channel for agreeing on UU and LL values. Consequently, FairWave’s threat model for reward computation is identical to the threat model of the underlying DAG-BFT: if DAG consensus is safe under f<n/3f<n/3, then reward score computation is also safe under the same assumption.

IV FairWave Protocol Design

IV-A Design Rationale

Three fundamental architectural decisions underpin FairWave: asynchronous security, DAG-based data dissemination, and VRF-based leader selection. We justify each in sequence.

Why asynchronous BFT, not partially synchronous BFT? Classical leader-based protocols (PBFT [11], HotStuff [10], and Tendermint [12]) depend on the Global Stabilization Time (GST) assumption: after an unknown time, all messages arrive within a known bound Δ\Delta. Before GST, liveness is not guaranteed—if the network remains unstable, the protocol stalls until a timeout expires and a view change selects a new leader. On wide-area deployments with heterogeneous ISPs, transient partitions can persist for minutes, triggering cascading timeouts of many multiples of Δ\Delta. Asynchronous BFT protocols, in contrast, guarantee safety and liveness under arbitrary message delays and require only eventual delivery.

FairWave inherits this asynchronous property through its DAG-BFT structure: when a VRF-selected anchor is unreachable, the protocol executes an instant wave skip with zero timeout delay, in contrast to partially synchronous protocols that block the entire timeout window (8\geq 8 s in a typical Tendermint configuration). The safety property (agreement and validity) is a formal guarantee that follows from the 2f+12f{+}1 strong-support commit rule for any correct DAG-BFT protocol under f<n/3f<n/3, and holds unconditionally; consequently, adversarial evaluation at Task F (Section VII-F) does not present safety as a differential claim, but instead focuses FairWave’s differentiation on the liveness-degradation curve around the Byzantine boundary.

Why DAG-based consensus, not a linear chain? In linear-chain BFT, a single leader proposes one block per round, creating a throughput bottleneck proportional to leader bandwidth. The DAG paradigm, pioneered by Narwhal+Tusk [7], separates data dissemination from ordering: all NN validators produce vertices in parallel each round, yielding N×N\times data availability throughput. Consensus ordering is then applied retroactively by selecting anchor vertices and committing their causal history. This eliminates three pathologies of leader-based protocols: (i) single-leader bandwidth bottleneck, (ii) explicit multi-phase voting (DAG edges function as implicit votes), and (iii) view-change mechanism (replaced by costless wave skip). Empirically, DAG-BFT protocols achieve throughput scaling linearly with NN (see Fig. 18 in Section VII), whereas leader-BFT plateaus or degrades above N=50N=50 due to O(N2)O(N^{2}) message complexity.

Why VRF-based anchor selection? Deterministic leader schedules (round-robin or stake-proportional) are predictable: an adversary can target the next leader with denial-of-service attacks before their slot begins. Verifiable Random Functions (VRF, RFC 9381 ECVRF Edwards25519-SHA512-Elligator2) provide three properties that address this: (i) unpredictability—only the holder of the secret key can compute the VRF output, so no party can predict the anchor before the selection round; (ii) verifiability—every observer can verify the VRF proof against the public key, ensuring integrity without trusted setup; (iii) non-interactivity—each validator computes the racing key ki=ln(ui)/Wsel(i)k_{i}=-\ln(u_{i})/W_{\text{sel}}(i) independently, requiring no additional communication round. The exponential race mechanism ensures that selection is distributionally equivalent to weighted sampling proportional to WselW_{\text{sel}}, preserving the security guarantees of the dual-channel architecture. The marginal overhead is +2+2 ms commit latency versus Bullshark (Table V), an acceptable trade-off for DoS-resistant leader selection.

IV-B DAG-BFT Wave Structure

FairWave operates on a DAG-BFT wave protocol. Each wave consists of r=3r=3 rounds. Vertices are cryptographically signed messages that reference 2f+1\geq 2f+1 vertices from the previous round as parents. Anchors are selected per wave via weighted VRF sortition using WselW_{\text{sel}}. A Commit occurs when 2f+12f+1 support vertices referencing the anchor are observed.

Fig. 4 illustrates the DAG-wave and anchor-selection structure and the 2f+12f{+}1 strong-support commit condition.

Refer to caption
Figure 4: FairWave DAG-BFT wave structure with r=3r{=}3 rounds per wave. Each round contains NN vertices (one per validator); each round-rr vertex references 2f+1\geq 2f{+}1 parents from round r1r{-}1 (thin gray edges). The anchor (red for wave ww, blue for wave w+1w{+}1) is selected per wave via weighted VRF sortition on WselW_{\mathrm{sel}}. A commit fires when 2f+12f{+}1 support vertices in the next round reference the anchor (blue thick arrows). The dashed line between r2r_{2} and r3r_{3} marks the wave boundary; if an anchor were unreachable, the protocol executes a cost-free wave-skip and advances immediately to wave w+1w{+}1 with a fresh VRF race—no view-change, no timeout cascade.

Per-epoch metric computation. At each epoch boundary, after the seven-phase ceremony (FREEZE \to SLASH \to REGISTRY \to SEED \to SNAPSHOT \to RESET \to BARRIER) completes and the latest epoch DAG has been finalized, each node computes U(i,e)U(i,e) and L(i,e)L(i,e) for all i𝒱ei\in\mathcal{V}_{e} locally using Eqs. (4)–(7). Since finalized DAG 𝒟e\mathcal{D}_{e} is identical across all honest nodes (modulo causal history from committed anchors) and validator set 𝒱e\mathcal{V}_{e} is atomically locked at epoch e1e-1 boundary, both metric values will be identical across all honest nodes without additional message exchange. The equivocation detection penalty (see Algorithm 1) modifies Rep—not UU or LL—so UU and LL computation does not depend on slashing results and does not create cross-metric feedback paths. Consequently, weight vectors WselW_{\text{sel}} and WrewW_{\text{rew}} used at epoch e+1e+1 can be assembled trustlessly by each node at the end of epoch ee, without additional aggregation steps, without a separate committee, and without an external oracle.

IV-C Dual-Channel Decoupling

FairWave’s central architectural innovation is the explicit separation of consensus-critical selection from economic reward distribution via channels with opposite curvature with respect to stake:

Fig. 5 illustrates the dual-channel separation between selection and reward channels and their opposite curvature in stake.

Refer to caption
Figure 5: FairWave dual-channel architecture. Both channels consume the same per-epoch validator state (si,Ui,Li,Repie1)(s_{i},U_{i},L_{i},\mathrm{Rep}_{i}^{e-1}) but apply opposite curvature on stake. The selection channel (blue, super-linear) excludes reputation to break the circular dependency between selection results and selection weights (Proposition 1), providing strict Sybil resistance g(K)<1g(K)<1 (Theorem 1). The reward channel (green, sub-linear) includes lagged reputation as its largest component (γ=0.40\gamma{=}0.40), providing meritocratic fairness with bounded HHI (Theorem 2). Outputs differ in cardinality: the selection channel elects one anchor per wave via a VRF race, while the reward channel distributes proportional shares to all validators.

Selection channel (operational decisions—anchor selection, critical-path participation) is amplified by quadratically-compressed stakes but without reputation contribution:

Wsel(i)=[αS(i)+βU(i)+δL(i)]S(i)e,e=2W_{\text{sel}}(i)=\big[\,\alpha\,S(i)+\beta\,U(i)+\delta\,L(i)\,\big]\cdot S(i)^{e},\quad e=2 (8)

Define real-time performance score P(i)=αS(i)+βU(i)+δL(i)P(i)=\alpha\,S(i)+\beta\,U(i)+\delta\,L(i) so that Wsel(i)=P(i)S(i)2W_{\text{sel}}(i)=P(i)\cdot S(i)^{2}. With the default simplex constraint (α,β,γ,δ)=(0.20; 0.25; 0.40; 0.15)(\alpha,\beta,\gamma,\delta)=(0.20;\,0.25;\,0.40;\,0.15), the fraction α+β+δ=0.60\alpha+\beta+\delta=0.60 of total weight mass remains in the selection channel, while fraction γ=0.40\gamma=0.40 is allocated exclusively to the reward channel. Weight WselW_{\text{sel}} remains super-linear with respect to stake, amplified by a quadratic exponent on the compressed stake factor.

Reward channel (economic incentives—block reward distribution) retains the full multi-factor score:

Wrew(i)=R(i)=αS(i)+βU(i)+γRepactive(i,e)+δL(i)W_{\text{rew}}(i)=R(i)=\alpha S(i)+\beta U(i)+\gamma\text{Rep}_{\text{active}}(i,e)+\delta L(i) (9)

with Repactive\text{Rep}_{\text{active}} being the lagged reputation as defined in Eq. (3). This weight is sub-linear in stake via S(i)=si/SmaxS(i)=\sqrt{s_{i}/S_{\max}} and is bounded above by R(i)1.0R(i)\leq 1.0 with a floor (1α)(1-\alpha) for any validator maintaining perfect non-stake attributes.

Justification: removing Rep from the selection channel. In the initial formulation Wsel=RS2W_{\text{sel}}=R\cdot S^{2} where RR included γRep\gamma\,\text{Rep}, a one-step positive feedback loop arises: high Rep(i)\text{Rep}(i) \Rightarrow high R(i)R(i) \Rightarrow high Wsel(i)W_{\text{sel}}(i) \Rightarrow increased anchor-selection frequency \Rightarrow faster accumulation of active support \Rightarrow higher Rep(i)\text{Rep}(i) in the next epoch. This loop structurally shifts centralization from the stake dimension to the reputation dimension without resolving the core issue—selection power concentration on validators already dominant in previous epochs. By excluding Rep from WselW_{\text{sel}} (Eq. 8), we break the direct causal path between epoch ee selection results and selection weights in the same epoch; the formal statement appears in Proposition 1 in Section V.

Reputation remains fully effective in the reward channel. Excluding Rep from WselW_{\text{sel}} does not eliminate reputation’s role as a long-term honesty incentive. Reputation remains the largest component (γ=0.40\gamma=0.40) of WrewW_{\text{rew}}, so validators who behave honestly over time still receive larger reward shares than validators who have only recently recovered from slashing. Consequently, the game-theoretic equilibrium for reputation farming remains below the critical threshold γcrit0.57\gamma_{\text{crit}}\approx 0.57 (see Eq. 10): attackers creating low-cost identities solely to farm reputation still face a break-even at γeff0.72\gamma_{\text{eff}}\approx 0.72 when combined with equivocation slashing. The justification for γ=0.40\gamma=0.40 in Section IV.D therefore remains valid without modification for the reward channel.

This separation reinforces the dual-channel philosophy. FairWave’s architectural innovation is expressed as opposite curvature in stake across two distinct protocol functions. Eq. (8) reinforces—rather than weakens—this philosophy: the selection channel now puristically relies on measurable real-time performance (epoch-locked stake, DAG-derived uptime, DAG-derived latency—all verifiable bit-exact by any honest node at epoch boundary), while the reward channel integrates historical components (lagged reputation) reflecting longitudinal quality. This temporal separation—operational real-time for selection and historical lagged for reward—yields two guarantees that were previously in tension: (i) no epoch ee selection weight depends on epoch ee selection outcomes, so anchor computation is strictly forward-causal; and (ii) honesty incentives are preserved because reputation affects economic dimension (reward) without amplifying selection power.

Rationale. Super-linearity of the selection channel ensures that splitting a stake into KK Sybil identities reduces aggregate selection weight (Sybil gain <1<1, see Theorem 1). Simultaneously, reward-channel sub-linearity ensures high-quality small validators receive proportionally higher returns per unit stake invested, preventing plutocratic convergence.

IV-D Justification of Multi-Factor Weights

Classic PoS protocols (Cosmos Tendermint, Algorand, and Ethereum 2.0 Casper-FFG) use stake-only weighting, producing two pathologies: (i) reward concentration proportional to capital and (ii) no long-term disincentive for dishonest behavior beyond slashing. FairWave’s multi-factor score R(i)=αS+βU+γRep+δLR(i)=\alpha S+\beta U+\gamma\text{Rep}+\delta L introduces three non-stake factors that are non-fungible and not purchasable—they must be earned through sustained honest participation, combining skin-in-the-game (PoS) with bona fide work (analogous to PoW/PoSpace).

α=0.20\alpha=0.20 (stake): Anti-Sybil floor. With the revision in Eq. 8 that excludes reputation from WselW_{\text{sel}}, the closed-form Sybil gain (Eq. 13) converges at x=1x=1 to g(β+δ)/(α+β+δ)=(1γα)/(1γ)g\to(\beta+\delta)/(\alpha+\beta+\delta)=(1-\gamma-\alpha)/(1-\gamma) as KK\to\infty. Thus, an attacker splitting into infinitely many Sybils loses at least a fraction α/(1γ)\alpha/(1-\gamma) of the selection share. For (α,γ)=(0.20, 0.40)(\alpha,\gamma)=(0.20,\;0.40), the infinite-Sybil loss is 0.20/0.6033.3%0.20/0.60\approx 33.3\%; equivalently, an adversary must commit 3.0×\approx 3.0\times the honest stake budget to recover the lost selection share, making splitting economically unattractive. The safe range for α\alpha is [0.10,0.30][0.10,0.30]: below 0.100.10 the anti-Sybil floor degrades unacceptably, while above 0.300.30 the protocol approaches Pure-PoS whale dominance. The default α=0.20\alpha=0.20 is a conservative midpoint within this range.

γ=0.40\gamma=0.40 (reputation): Longitudinal honesty reward. Reputation is the only factor requiring historical accumulation—it cannot be transferred, bought, or obtained instantly. Assigning γ\gamma as the largest weight binds rewards to longitudinal identity, providing multi-step Sybil resistance beyond the selection channel. This choice mirrors reputation weightings like Hedera Hashgraph [26] (40%\approx 40\%) and Polkadot Era Points [25]. Trade-off: each +0.10+0.10 in γ\gamma reduces farming resistance by approximately 0.070.07 (slope 0.70-0.70 from Eq. 10); the default γ=0.40\gamma=0.40 maintains farming resistance 0.50\geq 0.50 when combined with equivocation slashing (penalty 0.30-0.30 per detection).

β=0.25\beta=0.25 (uptime): Continuous availability. Uptime is a real-time observable that is expensive to spoof without equivalent operational cost. DAG-BFT protocols require continuous vertex propagation (unlike leader-BFT, where nodes can be temporarily offline), making uptime more critical. Empirical sweeps show the fairness objective is relatively flat for β[0.10, 0.40]\beta\in[0.10,\;0.40], confirming β=0.25\beta=0.25 lies in a robust region. Industry norms (Solana skip-rate penalties approximately 101020%20\%, Polkadot Era Points approximately 20%20\%) support this calibration.

δ=0.15\delta=0.15 (latency): Infrastructure tie-breaker. Latency is the factor most easily optimized via infrastructure investment (dedicated hardware, co-location, and peering). A large δ\delta would induce an infrastructure arms race favoring well-capitalized validators—contrary to fairness goals. A small weight makes latency a tie-breaker among validators comparable on other factors, sufficient to disincentivize very slow nodes (P99 >500>500 ms) without driving extreme optimization. Empirical sweeps show that δ0.20\delta\geq 0.20 begins to reward such infrastructure investment disproportionately, eroding the fairness gain; δ=0.15\delta=0.15 is the identified sweet spot that retains the tie-breaker role without favoring well-capitalized validators.

Literature comparison. Table II summarises multi-factor approaches in production protocols, confirming FairWave’s parameterization aligns with industry precedent while providing formal guarantees absent in prior work.

TABLE II: Multi-factor weight comparison across protocols.
Protocol Stake Uptime Rep. Latency
Cosmos Tendermint 100%
Ethereum 2.0 100% penalty
Solana \sim80% \sim20%
Hedera Hashgraph \sim30% \sim40% \sim10%
Polkadot 2.0 \sim80% \sim20% (Era Pts)
FairWave 20% 25% 40% 15%

Ablation analysis. To demonstrate each factor is necessary (not merely convenient), Table III reports the consequences of setting each weight to zero while evenly redistributing its mass to the remaining factors.

TABLE III: Ablation study: effect of removing each factor.
Removed Gini Floor Sybil Res. Farm.
None (default) 0.149 20% 0.58
α=0\alpha=0 (no stake) 0.031 0% 1.00
β=0\beta=0 (no uptime) 0.155 20% 0.52
γ=0\gamma=0 (no reputation) 0.138 20% 1.00
δ=0\delta=0 (no latency) 0.151 20% 0.56

Removing stake (α=0\alpha=0) eliminates the anti-Sybil floor entirely—an attacker splitting into KK identities suffers no penalty, making the protocol trivially exploitable. Removing reputation (γ=0\gamma=0) removes the farming attack vector but simultaneously eliminates the incentive for longitudinal honesty: validators lack a long-term stake in maintaining exemplary behavior, reducing the protocol to a transient metric-based system. Removing uptime (β=0\beta=0) permits ”zombie validators” (nodes that are perpetually offline) to accumulate rewards via reputation alone; empirically, the Gini coefficient increases and the protocol tolerates roughly 15% more inactive validators before liveness degrades. Removing latency (δ=0\delta=0) eliminates differentiation between geo-diverse and co-located validators, reducing incentives for network quality; while the direct fairness impact is small, throughput sensitivity increases (One-At-a-Time elasticity of link latency to throughput: |e|=1.17|e|=1.17).

Robustness over optimality. The default parameter vector (0.20; 0.25; 0.40; 0.15)(0.20;\;0.25;\;0.40;\;0.15) ranks 576/819 in the constrained feasible set—deliberately not Pareto-optimal. This conservative choice is motivated by robustness to model misspecification: the default simultaneously satisfies all security constraints across three evaluation regimes (homogeneous, heterogeneous, adversarial), whereas the Pareto-optimal point (α=0.10,γ=0.00,δ=0.45\alpha=0.10,\;\gamma=0.00,\;\delta=0.45) assumes adversarial farming as the primary concern. Under regime uncertainty—where the deployment environment may shift between cooperative and adversarial periods—the parameter vector that satisfies all constraints with margin is preferred over one that optimizes a single objective.

Threshold farming (game-theoretic). The closed-form farming resistance satisfies MWfarm=αSr+(1α)0.70γ\text{MW}_{\text{farm}}=\alpha\cdot S_{r}+(1-\alpha)-0.70\gamma, where Sr=Smin/SmaxS_{r}=\sqrt{S_{\min}/S_{\max}}. Requiring MWfarm0.50\text{MW}_{\text{farm}}\geq 0.50 (the minimum threshold for rational, honest behavior) yields the critical bound:

γcrit=αSr+(1α)0.500.700.57\gamma_{\text{crit}}=\frac{\alpha\cdot S_{r}+(1-\alpha)-0.50}{0.70}\approx 0.57 (10)

For γ>0.57\gamma>0.57, a rational adversary can profitably farm reputation to dominate reward share. The default γ=0.40\gamma=0.40 provides a safety margin of 30% below this critical threshold. Combined with equivocation slashing (0.30-0.30 per detection, recovery half-life 69\approx 69 epochs), the effective break-even farming shifts further to γeff0.72\gamma_{\text{eff}}\approx 0.72, confirming operational safety.

Principled criteria for β\beta and δ\delta. Rather than arbitrary selection, β\beta and δ\delta are constrained by two conditions: (i) the simplex constraint β+δ=1αγ=0.40\beta+\delta=1-\alpha-\gamma=0.40 and (ii) the principle of asymmetry of controllability: β>δ\beta>\delta because uptime is a binary operational commitment (maintain a server \to U1U\approx 1), while latency partially depends on uncontrollable factors (geographic distance, ISP routing). Assigning higher weight to the more controllable factor ensures validators can improve their score through effort rather than capital. The ratio β/δ=5/3\beta/\delta=5/3 reflects that uptime is roughly 5/3×5/3\times more controllable than latency in typical cloud deployments.

IV-E VRF-based Anchor Selection

Anchor selection uses exponential racing via a Verifiable Random Function (VRF, RFC 9381 ECVRF Edwards25519-SHA512-Elligator2): each validator ii computes the racing key ki=ln(ui)/Wsel(i)k_{i}=-\ln(u_{i})/W_{\text{sel}}(i) where ui(0,1)u_{i}\in(0,1) is derived from the VRF output. The validator with the minimum kik_{i} is selected as the anchor. This mechanism is distributionally equivalent to weighted sampling proportional to WselW_{\text{sel}} while providing unpredictability and non-interactivity.

IV-F Protocol Pseudocode

Algorithm 1 presents FairWave protocol logic in full, covering weight computation, VRF-based anchor selection, support gathering, equivocation detection, commit decision, reward distribution, and reputation update.

Algorithm 1 FairWave: Wave Anchor Selection, Commit, and Reward
1:  function ComputeWeights(𝒱,e\mathcal{V},e) {Epoch ee, use Rep from e1e-1}
2:  for each validator vi𝒱v_{i}\in\mathcal{V} do
3:   S(i)si/SmaxS(i)\leftarrow\sqrt{s_{i}/S_{\max}} {Normalized stake factor}
4:   U(i),L(i)U(i),L(i)\leftarrow DeriveFromDAG(𝒟e1,i\mathcal{D}_{e-1},i) {Eqs. (4), (7)}
5:   Repactive(i)Repfinal(i,e1)\text{Rep}_{\text{active}}(i)\leftarrow\text{Rep}_{\text{final}}(i,e-1) {Lagged, Eq. (3)}
6:   P(i)αS(i)+βU(i)+δL(i)P(i)\leftarrow\alpha\cdot S(i)+\beta\cdot U(i)+\delta\cdot L(i) {Real-time performance (no Rep)}
7:   R(i)P(i)+γRepactive(i)R(i)\leftarrow P(i)+\gamma\cdot\text{Rep}_{\text{active}}(i) {Full reward score}
8:   Wsel(i)P(i)S(i)2W_{\text{sel}}(i)\leftarrow P(i)\cdot S(i)^{2} {Selection super-linear, no Rep}
9:   Wrew(i)R(i)W_{\text{rew}}(i)\leftarrow R(i) {Reward sub-linear, includes Repactive{}_{\text{active}}}
10:  end for
11:  
12:  function WaveAnchorElection(𝒱\mathcal{V})
13:  for each validator vi𝒱v_{i}\in\mathcal{V} do
14:   βiVRF.Eval(ski,epochwave)\beta_{i}\leftarrow\textsc{VRF.Eval}(sk_{i},\text{epoch}\|\text{wave})
15:   uiβi/2256u_{i}\leftarrow\beta_{i}/2^{256} {Map to (0,1)(0,1)}
16:   kiln(ui)/Wsel(i)k_{i}\leftarrow-\ln(u_{i})/W_{\text{sel}}(i) {Exponential racing}
17:  end for
18:  anchorargminiki\text{anchor}\leftarrow\arg\min_{i}k_{i}
19:  
20:  function CommitDecision(anchor)
21:  support\text{support}\leftarrow Collect(2f+12f+1 signed votes for anchor)
22:  for each vote in support do
23:   if DetectEquivocation(vote.sender) then
24:    Slash(vote.sender, fraction =0.20=0.20)
25:    Rep(sender)Rep(sender)0.30\text{Rep}(\text{sender})\leftarrow\text{Rep}(\text{sender})-0.30
26:   end if
27:  end for
28:  if |support|2f+1|\text{support}|\geq 2f+1 and no safety violations then
29:   Commit(anchor.vertex)
30:  else
31:   SkipWave() {No view-change}
32:  end if
33:  
34:  function DistributeReward(anchor, 𝒱\mathcal{V})
35:  shares(i)Wrew(i)/jWrew(j)\text{shares}(i)\leftarrow W_{\text{rew}}(i)/\sum_{j}W_{\text{rew}}(j)
36:  Pay(𝒱\mathcal{V}, base_reward ×\times shares)
37:  for each supporter viv_{i} do
38:   Rep(i)Rep(i)+0.01\text{Rep}(i)\leftarrow\text{Rep}(i)+0.01 {Increment}
39:  end for
40:  
41:  function EpochBoundary(𝒱,e\mathcal{V},e)
42:  FinalizeDAG(𝒟e\mathcal{D}_{e}) {Wait for last epoch ee commit}
43:  for each validator vi𝒱v_{i}\in\mathcal{V} do
44:   Rep(i)0.99×Rep(i)\text{Rep}(i)\leftarrow 0.99\times\text{Rep}(i) {Decay, t1/269t_{1/2}\approx 69 epoch}
45:   Repfinal(i,e)Rep(i)\text{Rep}_{\text{final}}(i,e)\leftarrow\text{Rep}(i) {Lock final epoch value}
46:  end for
47:  return {Repfinal(i,e):vi𝒱}\{\text{Rep}_{\text{final}}(i,e):v_{i}\in\mathcal{V}\} {Becomes Repactive{}_{\text{active}} at epoch e+1e+1}

V Mathematical Foundations

V-A Closed-Form Min/Whale Ratio

Under homogeneous performance (U=Rep=L=1U=\text{Rep}=L=1, Regime A), the reward score reduces to R(i)=αS(i)+(1α)R(i)=\alpha\cdot S(i)+(1-\alpha). The minimum-to-maximum reward ratio admits closed form:

RminRmax=αSmin/Smax+(1α)\frac{R_{\min}}{R_{\max}}=\alpha\cdot\sqrt{S_{\min}/S_{\max}}+(1-\alpha) (11)

On default parameters: Rmin/Rmax=0.200.1+0.80=0.863R_{\min}/R_{\max}=0.20\cdot\sqrt{0.1}+0.80=0.863. This is validated bit-exact (ϵ=0.0\epsilon=0.0) against 99 empirical evaluation points (Fig. 6).

Refer to caption
Figure 6: Min/Whale reward ratio as a function of stake weight α\alpha. Closed-form prediction (Eq. 11) matches empirical simulation with zero error across all 21 evaluation points. The ratio decreases linearly with increasing α\alpha, with slope Smin/Smax1=0.684\sqrt{S_{\min}/S_{\max}}-1=-0.684.

V-B Closed-Form Sybil Gain Analysis

Consider an attacker controlling normalized stake fraction x(0,1]x\in(0,1] splitting into KK Sybil identities. Under Regime A (U=L=1U=L=1 and without reputation contribution to WselW_{\text{sel}} per Eq. 8), the real-time performance score becomes P=αS+(β+δ)P=\alpha S+(\beta+\delta), so the aggregate selection weight of KK Sybils simplifies to:

Wsel(K)=αx3/2/K+(β+δ)xW_{\text{sel}}^{(K)}=\alpha x^{3/2}/\sqrt{K}+(\beta+\delta)\cdot x (12)

Weight without split is Wsel(1)=αx3/2+(β+δ)xW_{\text{sel}}^{(1)}=\alpha x^{3/2}+(\beta+\delta)\cdot x. Selection gain is:

gsel(K)=αx3/2/K+(β+δ)xαx3/2+(β+δ)xg_{\text{sel}}(K)=\frac{\alpha x^{3/2}/\sqrt{K}+(\beta+\delta)\,x}{\alpha x^{3/2}+(\beta+\delta)\,x} (13)
Theorem 1 (Sybil Resistance).

For all K>1K>1, x(0,1]x\in(0,1], α(0,1)\alpha\in(0,1), and β+δ>0\beta+\delta>0: gsel(K)<1g_{\text{sel}}(K)<1.

Proof.

The numerator differs from denominator only in the first term: αx3/2/K\alpha x^{3/2}/\sqrt{K} versus αx3/2\alpha x^{3/2}. Since K>11/K<1K>1\Rightarrow 1/\sqrt{K}<1, the numerator is strictly less than denominator for α>0\alpha>0 and x>0x>0. Excluding γRep\gamma\,\text{Rep} from WselW_{\text{sel}} does not affect this argument: the coefficient on the linear term—which does not contain KK—merely shifts from (1α)(1-\alpha) to (β+δ)(\beta+\delta), so the inequality gsel(K)<1g_{\text{sel}}(K)<1 is preserved. ∎

Numerical example. For K=100K=100, x=1x=1, (α,β,δ)=(0.20, 0.25, 0.15)(\alpha,\beta,\delta)=(0.20,\;0.25,\;0.15): numerator =0.20/10+0.40=0.42=0.20/10+0.40=0.42, denominator =0.20+0.40=0.60=0.20+0.40=0.60, hence gsel(100)=0.70g_{\text{sel}}(100)=0.70. An attacker loses 30% of selection share by splitting into 100 Sybils—strictly stronger than g=0.82g=0.82 for the earlier formulation that included Rep in WselW_{\text{sel}}. Empirical comparisons with other reward rules are shown in Fig. 7.

Refer to caption
Figure 7: Sybil selection-channel gain g(K)g(K) for four reward rules at xfrac=0.10x_{\text{frac}}=0.10. FairWave (green) is the only rule showing monotone decreasing gain (g<1g<1), establishing strict Sybil resistance. SRSW gain scales as K\sqrt{K}; LSW gain scales approximately linearly with KK, reaching 25×25\times at K=100K=100.

V-C Bounded HHI Fixed-Point

Under the FairWave reward rule, the stake growth rate for validator ii is given by dsi/dt=REWARD_TOTALR(i)/jR(j)ds_{i}/dt=\text{REWARD\_TOTAL}\cdot R(i)/\sum_{j}R(j). Since R(i)R(i) saturates as siSmaxs_{i}\to S_{\max}, all high-quality validators converge to a similar reward share:

limtHHI(t)=1/N\lim_{t\to\infty}\text{HHI}(t)=1/N (14)
Theorem 2 (Bounded Concentration).

Under FairWave reward dynamics with stake reinvestment, the Herfindahl-Hirschman Index (HHI(t)\text{HHI}(t)) decreases monotonically non-increasing for sufficiently large tt, converging to the uniform distribution fixed point 1/N1/N.

V-D No Circular Dependency in Selection Channel

Proposition 1 (No Circular Dependency).

In FairWave, with the WselW_{\text{sel}} formulation in Eq. (8) and lagged reputation in Eq. (3), there is no causal path from anchor selection results (the set of validators selected as anchors) at epoch ee to selection weights {Wsel(i,e)}i𝒱e\{W_{\text{sel}}(i,e)\}_{i\in\mathcal{V}_{e}} at the same epoch.

Proof.

Epoch ee selection weights are functions of S(i)S(i), U(i,e1)U(i,e-1), and L(i,e1)L(i,e-1) per Eq. (8). It suffices to show that all three arguments are determined before the epoch ee anchor selection process begins.

(i) Stake S(i)S(i): The validator registry 𝒱e\mathcal{V}_{e} and stake vector {si}i𝒱e\{s_{i}\}_{i\in\mathcal{V}_{e}} are locked atomically at epoch e1e-1 boundary via the seven-phase ceremony (FREEZE \to SLASH \to REGISTRY \to SEED \to SNAPSHOT \to RESET \to BARRIER); no stake modifications can occur intra-epoch. Therefore, S(i)S(i) for epoch ee depends only on the state at the end of epoch e1e-1.

(ii) Uptime U(i,e1)U(i,e-1) and latency L(i,e1)L(i,e-1): Based on Eqs. (4)–(7), both metrics are deterministic functions of the finalized DAG 𝒟e1\mathcal{D}_{e-1}. Finalization of 𝒟e1\mathcal{D}_{e-1} itself occurs before the epoch e1e-1 boundary closes; thereafter, the DAG structure is immutable from the consensus perspective.

(iii) Reputation: Although Repactive(i,e)=Repfinal(i,e1)\text{Rep}_{\text{active}}(i,e)=\text{Rep}_{\text{final}}(i,e-1) (Eq. (3)) is used in WrewW_{\text{rew}}, it does not appear in WselW_{\text{sel}} per Eq. (8); consequently, reputation contributes no causal path to selection weights. Moreover, even if traced extensively through the reward channel, Repfinal(i,e1)\text{Rep}_{\text{final}}(i,e-1) is a value locked at the end of epoch e1e-1 and is not updated by epoch ee selection results.

Since all three components determining Wsel(i,e)W_{\text{sel}}(i,e) are functions of state at or before epoch e1e-1 closure, and anchor selection results at epoch ee are products of {Wsel(i,e)}\{W_{\text{sel}}(i,e)\} (via VRF exponential racing), there is no directed cycle in the dependency graph {Wsel(i,e)}{anchor(e)}{Wsel(i,e)}\{W_{\text{sel}}(i,e)\}\to\{\text{anchor}(e)\}\to\{W_{\text{sel}}(i,e)\}. Forward causality of {Wsel(i,e)}\{W_{\text{sel}}(i,e)\} over epoch ee selection results is strict. ∎

Corollary. Epoch ee anchor selection results can only affect WselW_{\text{sel}} at epochs e+2\geq e+2, with a minimum two-epoch delay arising from (i) reputation and DAG metric updates at the close of epoch ee, and (ii) the additional one-epoch lag in the use of Repactive\text{Rep}_{\text{active}}. This eliminates intra-epoch feedback loops and removes the structural vulnerability where validators frequently selected as anchors in epoch ee obtain an additional selection advantage in the same epoch—a vulnerability present in the earlier formulation Wsel=RS2W_{\text{sel}}=R\cdot S^{2} when RR included γRep\gamma\,\text{Rep}.

V-E Parameter Space Characterization

The parameter space (α,β,γ,δ)(\alpha,\beta,\gamma,\delta) is systematically explored via 1,407 evaluation points on the 4-D simplex (Fig. 31(a)). The Pareto frontier (Fig. 32) of Min/Whale ratio versus reputation farming resistance identifies a feasible region bounded by α0.10\alpha\geq 0.10 (Sybil safety floor) and farming-resistance 0.50\geq 0.50.

VI Experimental Methodology

VI-A Implementation

The evaluation framework is implemented in Python 3.12 with minimal dependencies (NumPy only for Sobol sampling). The codebase consists of nine analytical modules, an RFC 9381 ECVRF-conformant VRF module (Edwards25519-SHA512-Elligator2), and a post-quantum cryptography overhead benchmark module.

VI-B Test Suite

Implementation is validated by 251 deterministic unit tests plus 96 property-based Hypothesis subtests (total 347), all reproducible via fixed seed (0xC0FFEE).

VI-C Comparative Protocol Models

Nine protocols are evaluated under identical parametric assumptions: DAG-BFT: Narwhal+Tusk (3-wave), Bullshark (2-wave), Mysticeti (1-wave), FairWave (2-wave + VRF). Leader-BFT: PBFT (3-phase), Tendermint (3-phase), HotStuff (4-phase), HotStuff-2 (2-phase). PoS-1: Algorand (3-phase VRF sortition). Protocol representation, the baseline systems are modeled according to their published protocol spesifications and documented design characteristics. The purpose of the evaluation is not to reproduce implementation-spesific performance results but to analyze and compare consensus level properties within a standarized experimental setting. To ensure comparability, all protocol models are assessed under identical assumptions regarding network conditions,workload generation, and adversarial behavior. Consequently, the reported results should be interpretedas comparative protocol-level evidence under standarized experimental conditions rather than implementation-level performance benchmarks.

VI-D Baseline Performance Characterization

Performance characteristics of all compared protocols are presented in Section VII (Fig. 17).

VI-E Parameter Sensitivity Characterization

Multi-metric sensitivity to weight parameters is comprehensively characterized via α\alpha sweep (Fig. 30 in Section VII), simplex landscape (Figs. 31(a), Figs. 31(b)), and Pareto frontier (Fig. 32) in Appendix APPENDIX.

VII Results

VII-A Mathematical Validation

Closed-form expressions for all four reward rules are validated against numerical simulation across 1,407 parameter cells. Maximum observed absolute error: ϵmax=0.0\epsilon_{\max}=0.0 (machine precision). Reward histograms (Fig. 8) visualize distributional differences across rules.

Refer to caption
Figure 8: Distribution of reward share across validators for six rules (N=50N=50, Pareto stake). FairWave shows concentrated distribution with low variance centered near 1/N=0.021/N=0.02, whereas Pure-PoS shows a heavy right tail reflecting stake concentration.

VII-B Anchor Selection Convergence

10,00010,000 waves are simulated with N=50N=50 validators across five heterogeneous profiles. Fig. 9 shows the most frequently selected top-KK anchors. Law-of-Large-Numbers convergence at rate O(1/T)O(1/\sqrt{T}) and breakdown of selection share by validator profile are presented in Appendix APPENDIX.

Refer to caption
Figure 9: Top-KK anchor selection frequency for three rules (Pure-PoS, FairWave Selection, FairWave Reward). Under Pure-PoS, whale-bad validator dominates the top-4 despite poor quality. Under FairWave Selection, a whale-good validator correctly occupies the top-5. Under FairWave Reward, a small-good validator enters the top-3.

VII-C Fairness Analysis

TABLE IV: Fairness metrics across reward rules (N=50N=50, 10,00010,000 epochs).
Rule Gini \downarrow Nak. \uparrow HHI \downarrow Min/W \uparrow
Pure-PoS 0.488 6 0.0392 0.100
SRSW 0.264 9 0.0254 0.316
LSW 0.053 15 0.0202 0.787
FairWave 0.149 13 0.0214 0.831

FairWave achieves 3.3×3.3\times Gini coefficient reduction relative to Pure-PoS while maintaining quality differentiation—unlike LSW, which achieves the lowest Gini (0.053) but cannot distinguish high-quality from low-quality validators. Lorenz curves (Fig. 10), Gini comparison (Fig. 11), cumulative reward trajectories (Fig. 12), then, for ROI analysis (Fig. 13) provide comprehensive visualization.

Refer to caption
Figure 10: Lorenz curves for six reward rules with heterogeneous validator profiles (N=50N=50). FairWave (green) achieves fairness by approaching diagonally while preserving separation between good and bad validators. LSW (brown) is closest to diagonal but eliminates quality differentiation.
Refer to caption
Figure 11: Gini coefficient comparison across six reward rules. FairWave achieves Gini = 0.149, a 3.3×3.3\times improvement from Pure-PoS (0.488) while maintaining meritocratic differentiation absent in LSW (0.053).
Refer to caption
Figure 12: Cumulative reward accumulation over 10,00010,000 epochs under FairWave. The Small-good validator (stake = SminS_{\min}) obtains 1.44×1.44\times more cumulative reward than the whale-bad validator (stake = SmaxS_{\max}), demonstrating that operational quality dominates capital in determining economic outcomes.
Refer to caption
Figure 13: Return-on-investment (reward share / stake share) per validator profile. FairWave small-good achieves an ROI = 4.25×4.25\times, the highest across all rule×\timesprofile combinations. LSW fails to distinguish small-good from small-bad (both 3.25×3.25\times).

VII-D Decentralization Stress Testing

Four adversarial scenarios were evaluated. Figs.14(a) and Figs.14(b) present single-whale concentration analyses; Fig.15 visualizes the cartel capture threshold; Fig.16 plots the power amplification curve.

Refer to caption
(a) Nakamoto Coefficient
Refer to caption
(b) Top-1 Power
Figure 14: Decentralization stress testing against whale stake share. (a) Pure-PoS collapses to Nakamoto = 1 at 35%\geq 35\%; FairWave maintains 17\geq 17. (b) Under Pure-PoS, power scales linearly with stake; under FairWave, top-1 power remains flat at 2%\sim 2\%.
Refer to caption
Figure 15: Cartel capture heatmap: minimum aggregate stake required for top-kk cartels to capture 1/3\geq 1/3 reward power. Under Pure-PoS, a three-validator cartel with a 40% stake suffices. Under FairWave, even a 10-validator cartel with a 90% stake cannot capture 1/31/3—a categorical improvement in cartel resistance.
Refer to caption
Figure 16: Power amplification curve (log-log): per-validator stake share versus power share on realistic distribution. Diagonal represents perfect plutocracy (power = stake). The FairWave reward channel (right panel) is maximally flat—power asymptotically approaches 1/N1/N regardless of stake. The Selection channel (left panel) is intentionally slightly super-linear for Sybil resistance.

VII-E Performance Evaluation

TABLE V: Performance comparison (N=10N=10, identical parametric assumptions).
Protocol Family Lat. TPS P99
Mysticeti DAG-BFT 40.5 154,435 60.6
FairWave DAG-BFT 82.9 154,435 102.6
Bullshark DAG-BFT 80.9 154,435 99.7
HotStuff-2 Leader 81.2 12,318 102.6
Narwhal DAG-BFT 121.4 154,435 150.6
HotStuff Leader 162.4 12,318 197.7
Algorand PoS-1 124.0 4,034 150.6
Refer to caption
Figure 17: Latency-throughput Pareto frontier for all nine protocols (N=10N=10). The DAG-BFT family occupies the optimal northeast region (low latency, high throughput). FairWave was positioned near Bullshark, trading +2+2 ms for VRF unpredictability.
Refer to caption
Figure 18: Throughput (TPS) versus network size NN on a log-log scale. The DAG-BFT protocols (including FairWave) scale linearly with NN due to parallel vertex production, while leader-BFT protocols plateau or degrade. PBFT throughput collapses at large NN due to O(N2)O(N^{2}) message complexity.

FairWave commit latency (82.9 ms) is within 2.5% of Bullshark (80.9 ms), with the marginal overhead attributable to VRF-based anchor unpredictability. Throughput is effectively identical across the DAG-BFT family due to parallel vertex production. The latency CDF (Fig. 19) and finality scaling (Fig. 20) complete the performance characterization.

Refer to caption
Figure 19: Cumulative distribution function of commit latency (N=10N=10, 1,000 Monte Carlo finality samples). FairWave and Bullshark show nearly identical CDF profiles, with P99 103\leq 103 ms. Mysticeti achieves the tightest distribution (P99 = 61 ms).
Refer to caption
Figure 20: Finality latency scaling versus network size NN. DAG-BFT protocols show sub-linear scaling (logarithmic in NN due to order statistics of 2f+12f+1 delays). PBFT scales quadratically, becoming impractical beyond N100N\approx 100.

VII-F BFT Monte Carlo Validation

This subsection reports results from 420,000 adversarial Monte Carlo rounds executed over 84 parameter cells. To avoid conflating formal guarantees with empirical differential claims, we separate two types of claims commonly combined in BFT evaluation: (i) safety — a formal property inherited from a correct protocol design; and (ii) liveness — an empirical, differential property that depends on protocol behavior near the Byzantine boundary.

Safety claim (formal, inherited from design). FairWave’s safety properties—agreement (no two honest validators commit different anchors in the same wave) and validity (committed anchors are always proposed by validators in 𝒱e\mathcal{V}_{e})—are a formal consequence of the 2f+12f+1 strong-support commit rule for DAG-BFT correctness under f<n/3f<n/3. This property holds unconditionally for any correct protocol in this family (including Bullshark, Narwhal+Tusk, Mysticeti, and Tendermint) and does not depend on FairWave’s parameterization (α,β,γ,δ)(\alpha,\beta,\gamma,\delta) or the dual-channel architecture. Consequently, the observation ”420,000 rounds without a safety violation” is consistent with—but does not prove—the superiority of FairWave: identical results are expected for any correct baseline protocol under the same adversarial scenario and f<n/3f<n/3. We therefore do not present the absence of safety violations as a differential achievement; the value of the experiment is to ensure no regressions relative to known-correct BFT baselines and to validate the reference implementation.

Liveness claim (empirical, candidate differential). The candidate differential claim of FairWave in Task F is the characteristic shape of the liveness-degradation curve near the theoretical Byzantine boundary b=1/3b^{*}=1/3. Fig. 21 shows commit rate as a function of Byzantine fraction bb for N{10,50,100}N\in\{10,50,100\}. Two measurement points obtained directly from simulation are a commit rate of 99.6%99.6\% at b=0.20b=0.20 (safe regime, well inside the boundary) and 71.1%71.1\% at b=0.33b=0.33 (just beyond the theoretical threshold). Between these verified points the curve is monotone-continuous without a discontinuous cliff; Table VI summarizes reference sweep points for the default configuration (N=50N=50, Δ=100\Delta=100 ms, zero partition). We label this claim “candidate differential” because the structural argument for the shape (developed below) is sound, but a direct head-to-head sweep against leader-BFT baselines under identical conditions remains future work (Section IX).

TABLE VI: Reference points for FairWave liveness degradation curve in Task F on default configuration (N=50N=50, Δ=100\Delta=100 ms, zero partition). Commit rates at b{0.20; 0.33}b\in\{0.20;\,0.33\} are direct measurements from simulation; values at b{0.28; 0.31}b\in\{0.28;\,0.31\} are marked with “\approx” because they are estimated from visual reading of the curve in Fig. 21 (uncertainty ±1\pm 122 p.p.), not direct simulation results at those points.
Byzantine fraction bb 0.200.20 0.280.28 0.310.31 0.330.33
Commit rate (FairWave) 99.6%99.6\% 95%\approx\!95\% 86%\approx\!86\% 71.1%71.1\%
Difference from b=0.20b=0.20 (pp) 4.6\approx\!-4.6 13.6\approx\!-13.6 28.5-28.5

Three structural characteristics distinguish this curve from those expected of classic leader-BFT protocols (Tendermint, HotStuff):

  • (i) Absence of a discontinuous cliff at the operational boundary. FairWave’s curve decreases monotonically-continuously from 99.6%99.6\% to 71.1%71.1\% as bb increases from 0.200.20 to 0.330.33, without the sharp drop expected of view-change-driven protocols when a Byzantine leader and timeout windows cause cascading stalls.

  • (ii) Lack of a cascade-timeout mechanism. FairWave uses a cost-free wave-skip (Section IV-A) that advances immediately to the next wave when an anchor cannot be committed, whereas leader-BFT requires a view change with expanding timeout windows that can stall the protocol for many multiples of Δ\Delta as the leader-Byzantine fraction increases.

  • (iii) Latency stability in the safe regime. Commit latency percentiles (P50, P95, P99; Fig. 23) remain stable up to b0.25b\approx 0.25 and then increase sharply as the protocol approaches the theoretical liveness boundary.

Near-threshold analysis (qualitative). The structural root of the three characteristics above is architectural: FairWave separates data dissemination from ordering and replaces view-change with exponential-racing VRF-based anchor selection. Thus, when a VRF-selected anchor happens to be controlled by the adversary, the cost to advance to the next wave is a single round (a fresh VRF computation), not a full timeout window. In leader-BFT protocols, by contrast, each failed view-change triggers exponential increases in the subsequent timeout (exponential backoff) to guarantee liveness under partial synchrony; the practical effect is a more rapid decline in commit rate—and a steeper profile—as the leader-Byzantine fraction increases. This differentiation makes the shape of the liveness-degradation curve a differential property FairWave can claim; the exact magnitude of the difference, including the onset point b=min{b:commit rate(b)0.95}b^{\dagger}=\min\{b:\text{commit rate}(b)\leq 0.95\} for each protocol, will be reported after a finer-grained bb sweep against leader-BFT baselines (Section IX).

Supporting validation. Fig. 22 maps the two-dimensional fault space (Byzantine ×\times offline) and confirms the combined bound byz+offline1/3\text{byz}+\text{offline}\leq 1/3, consistent with theoretical BFT predictions. Fig. 23 characterizes the latency distribution under adversary conditions, and Fig. 24 demonstrates partition-recovery dynamics: after a 5-round partition event, cumulative commits return immediately to the ideal rate of 1 commit/round, confirming the absence of the permanent stalls typical of leader-BFT designs.

Refer to caption
Figure 21: Liveness degradation curve: commit rate as a function of Byzantine fraction for N{10,50,100}N\in\{10,50,100\}. Two verified measurement points are 99.6%99.6\% at b=0.20b=0.20 and 71.1%71.1\% at b=0.33b=0.33; values at b{0.28; 0.31}b\in\{0.28;\,0.31\} summarized in Table VI are visual readings from this curve, not direct simulation results at those points. The profile between both verified points is monotone-continuous without a discontinuous cliff, reflecting the absence of a cascade timeout mechanism in FairWave.
Refer to caption
Figure 22: Success rate heatmap as a function of Byzantine fraction and offline rate (N=50N=50). The diagonal drop visible in the heatmap indicates that when the combined Byzantine and offline faults exceed one-third of the network, the system reaches the expected BFT fault tolerance limit.
Refer to caption
Figure 23: Commit latency percentiles (P50, P95, and P99) as a function of Byzantine fraction. Latency degrades gracefully until Byzantine =25%=25\%, then increases sharply as the protocol approaches the liveness threshold.
Refer to caption
Figure 24: Partition recovery trace over 200 rounds with partition probability =0.05=0.05, duration =5=5 rounds. Red windows show partition events. Cumulative commit count (blue) temporarily plateaus during partition but asymptotically follows the ideal rate (1 commit/round), demonstrating resilient recovery.

VII-G Longitudinal Dynamics

50,00050,000 epochs with full stake reinvestment reveal long-term behavior of concentration metrics (Fig. 25(a)26).

TABLE VII: HHI evolution under stake reinvestment (N=50N=50).
Rule HHIt=0 HHIt=50k{}_{t=50\text{k}} Trend
PoS2 0.0392 0.1158 +195%+195\%
Pure-PoS 0.0392 0.0392 invariant
FairWave 0.0392 0.0210 𝟒𝟔%\mathbf{-46\%}
Refer to caption
(a) HHI Trajectory
Refer to caption
(b) Gini Trajectory
Refer to caption
(c) Top-1 Share Trajectory
Figure 25: Longitudinal concentration metrics over 50,00050,000 epochs with stake reinvestment. (a) HHI: PoS2 divergent (+195%+195\%), Pure-PoS invariant, FairWave decreases monotonically toward 1/N1/N. (b) Gini: FairWave converges to 0.1220.122 (near-egalitarian). (c) Top-1 share: FairWave decreases to 2.6%2.6\%, converging toward 1/N=2%1/N=2\%.
Refer to caption
Figure 26: Stake growth factor per profile (log scale) after 50,00050,000 epochs. FairWave exhibits quality-get-richer dynamics: small-good validator grow 244×244\times versus only 16×16\times for whale-bad, reversing the plutocratic gradient. PoS2 shows the opposite: whale-bad grows 84×84\times while small validators stagnate.

VII-H Sybil Resistance

Three layers of evidence across 20,52020,520 cells. Figs. 27(a) shows selection gain versus split factor; Figs. 27(b) validates closed-form prediction; Fig. 28 presents economic cost-benefit analysis; and Fig. 29 evaluates combined Sybil+Byzantine threats.

TABLE VIII: Closed-form Sybil selection gain at x=0.10x=0.10. FairWave values calculated from the initial formulation Wsel=RS2W_{\text{sel}}=R\cdot S^{2} (for consistency with the simulation in Fig. 27); revision of Eq. 8 removing Rep from WselW_{\text{sel}} yields a strictly smaller gain (gsel(100)=0.70g_{\text{sel}}(100)=0.70 at x=1x=1, see the numerical example in Section V.B), so the claim “g(K)<1K>1g(K)<1\;\forall K>1” remains valid with a greater margin.
Rule K=2K=2 K=10K=10 K=100K=100 K=500K=500
Pure-PoS 1.000 1.000 1.000 1.000
SRSW 1.407 3.077 8.973 \sim22
LSW 1.810 6.547 25.319 \sim80
FairWave 0.979 0.950 0.934 \sim0.93
Refer to caption
(a) Selection gain g(K)g(K)
Refer to caption
(b) Closed-form Validation
Figure 27: Sybil Resistance. (a) Selection gain versus split factor KK (log-log). FairWave is the only rule where g(K)<1g(K)<1 for all K>1K>1. (b) Closed-form validation versus empirical across 20,16020,160 cells; matches within 1%1\%.
Refer to caption
Figure 28: Net economic benefit of Sybil splitting versus KK under an annual reward pool of $10M. FairWave optimal strategy is K=1K^{*}=1 (no splitting), with net profit decreasing monotonically versus KK. LSW achieves optimal K=500K^{*}=500 with net profit $7.2M—catastrophic vulnerability enabling rational exploitation.
Refer to caption
Figure 29: Impact of combined Sybil+Byzantine attack (5,0005,000 rounds): commit rate, safety violation rate, and Sybil anchor share across various split factors. FairWave maintains a safety violation <0.001<0.001 at K=500K=500 (Sybil anchor share 1.5%1.5\%). LSW exhibits a safety violation of 0.0350.035 (Sybil anchor share 72.8%72.8\%).

VII-I Parameter Space Characterization

Multi-metric sensitivity to stake weight α\alpha (Fig. 30) confirms that default parameterization operates in favorable region. Complete parameter space exploration—including objective landscape, farming resistance, and Pareto frontier—is presented in Appendix APPENDIX (Fig. 31(a)32).

Refer to caption
Figure 30: Multi-metric sensitivity to stake weight α\alpha: Min/Whale ratio, Gini, Nakamoto coefficient, and HHI as a function of α[0,1]\alpha\in[0,1] for three network sizes. All metrics degrade monotonically with increasing α\alpha, confirming default α=0.20\alpha=0.20 operates in a favorable region of parameter space.

VII-J Input Sensitivity Analysis

Three complementary analyses characterize parametric robustness: OAT elasticity, variance decomposition via Sobol indices, and combined perturbation robustness. All input elasticities for success_rate satisfy |e|<0.25|e|<0.25, indicating that a 10%10\% perturbation on a single input yields output variation <2.5%<2.5\%. Sobol analysis reveals throughput is dominated by link_latency (STi=1.0S_{Ti}=1.0), while security metrics are dominated by interactions. Under a combined ±25%\pm 25\% perturbation on all eight inputs, the CV success rate is 5.2% (highly robust). Complete sensitivity visualization presented in Appendix APPENDIX (Fig. 33(a)34(b)).

TABLE IX: Coefficient of variation under combined input perturbation ±25%\pm 25\%.
Metric CV Assessment
success_rate 5.2% highly robust
gini 10.2% robust
throughput_proxy 15.7% moderate
hhi 25.1% structural (1/NN)
nakamoto_third 26.1% discrete metric

VIII Discussion

VIII-A Why Dual-Channel Decoupling Mitigates the Trilemma

The elegant algebra of dual-channel decoupling emerges from applying opposite curvature in stake across distinct protocol functions: super-linear selection \Rightarrow Sybil resistance (g<1g<1); sub-linear reward \Rightarrow fairness, bounded HHI; shared dependence on RR \Rightarrow both channels reward quality. This decoupling avoids the old impossibility: previously, super-linear weights implied plutocracy, while sub-linear weights implied Sybil vulnerability.

Fig. 5 summarizes the dual-channel separation between selection and reward channels motivating the architectural claims in this section.

VIII-B Performance Trade-offs

FairWave incurs a VRF overhead of +2+2 ms per anchor round relative to Bullshark. Commit latency is 82.982.9 ms versus 40.540.5 ms for Mysticeti—2×2\times factor attributed to the 2-wave commit structure and VRF computation. Fig. 4 visualizes the DAG wave and anchor/commit structure that underpins the selection channel. Post-quantum migration overhead (ML-DSA-44: +5+5 ms; Hybrid Ed25519+ML-DSA: +7+7 ms) remains acceptable for >99%>99\% of use cases.

VIII-C Reputation Mechanism

Reputation factor Rep(i)\text{Rep}(i) evolves via three operations: Fig. 2 and 3 illustrate the lagged-reputation timing and the per-epoch seven-phase ceremony that realize the forward-causal reputation design described above. (i) increment of +0.01+0.01 per wave commit on active support, (ii) multiplicative decay of ×0.99\times 0.99 per epoch boundary (half-life 69\approx 69 epochs 1.9\approx 1.9 hours), and (iii) punitive slashing (0.30-0.30) on equivocation detection. Reputation farming resistance under adversarial conditions is characterized in (Figs. 31(b)).

VIII-D Caveat: Reward Channel Sybil Gain

Reward channel in isolation (R-only) exhibits Sybil gain >1>1 (Fig. 35(a)), due to floor (1α)(1-\alpha) in RR for each Sybil identity. However, realized reward is proportional to anchor selection frequency (which uses WselW_{\text{sel}}), so realized reward inherits selection channel resistance.

IX Limitations and Future Work

Modeling assumptions. Our analysis assumes a static stake within each epoch, an equivocation detection rate of 95%95\%, and a stake distributed as Pareto (αPareto=1.5\alpha_{\text{Pareto}}=1.5). Alternative distributions (log-normal, empirical Ethereum [27]) are characterized in supplementary material.

Future work. (i) Empirical deployment on geographically-distributed testnet (50\geq 50 validators across three continents). (ii) Adversarial sensitivity analysis where the adversary optimizes input combinations. (iii) Higher-order Sobol indices (SijS_{ij} interaction effects). (iv) Formal verification of dual-channel composition in a mechanized proof assistant.

X Conclusion

We have presented FairWave, an asynchronous DAG-BFT consensus protocol whose central architectural insight is dual-channel decoupling: super-linear-in-stake selection for Sybil resistance, paired with sub-linear-in-stake reward distribution for fairness. This decoupling mitigates the Sybil-fairness-plutocracy trilemma that no single weight function can resolve alone.

Formal results. (i) Sybil selection gain g(K)<1g(K)<1 for all K>1K>1 (Theorem 1); (ii) the Herfindahl-Hirschman Index converges to the uniform fixed point 1/N1/N under stake reinvestment (Theorem 2, Section V); (iii) selection weights are strictly forward-causal under lagged reputation, so anchor selection at epoch ee cannot bias selection weights at the same epoch (Proposition 1). Safety (agreement and validity) is a formal consequence of the 2f+12f{+}1 strong-support commit rule and holds unconditionally for f<n/3f<n/3; see Section VII-F.

Empirical results. Across nine analyses comprising approximately 550,000550{,}000 Monte Carlo rounds—with the reference implementation validated by 251251 deterministic unit tests and 9696 property-based subtests, and evaluated against eight baseline protocol models under a common experimental framework—FairWave establishes:

  1. 1.

    Gini coefficient =0.149=0.149, a 3.3×3.3\times reduction relative to Pure-PoS while preserving meritocratic differentiation absent in LSW;

  2. 2.

    HHI decreases monotonically from 0.0390.039 to 0.0210.021 over 50,00050{,}000 epochs under full stake reinvestment;

  3. 3.

    optimal Sybil split K=1K^{*}=1: splitting is never economically rational, with the closed-form gain bound validated bit-exact (ϵ=0.0\epsilon=0.0) against 20,16020{,}160 numerical cells;

  4. 4.

    liveness degradation is monotone-continuous from 99.6%99.6\% commit rate at b=0.20b=0.20 to 71.1%71.1\% at the theoretical bound b=1/3b=1/3, without the discontinuous cliff characteristic of view-change-driven leader-BFT;

  5. 5.

    coefficient of variation (CV) of success rate =5.2%=5.2\% under combined ±25%\pm 25\% perturbation, demonstrating parametric robustness.

The dual-channel principle is a general design pattern for stake-based protocols that must decouple operational security from economic equity. Future work—outlined in Section IX—targets a production-grade reference implementation and a geographically-distributed testnet deployment to confirm the model-predicted latency, throughput, and concentration trajectories under realistic network conditions, alongside formal verification of the dual-channel composition in a mechanized proof assistant.

References

  • [1] S. Motepalli and M. Jacobson, “Proof-of-stake centralization: A longitudinal study,” IEEE Transactions on Blockchain and Cryptocurrency, 2024.
  • [2] J. Kwon, “Blockchain centralization trends,” Medium, 2019.
  • [3] V. Srivastava, S. Damle, and S. Gujar, “Centralization in proof-of-stake blockchains: A game-theoretic analysis of bootstrapping protocols,” arXiv preprint arXiv:2404.09627, 2024.
  • [4] B. S. Srinivasan, “Quantifying decentralization,” Medium, 2017.
  • [5] B. Kuśmierz and R. Overko, “How centralized is decentralized? comparison of wealth distribution in coins and tokens,” arXiv preprint arXiv:2207.01340, 2022.
  • [6] D. Grandjean, L. Heimbach, and R. Wattenhofer, “Ethereum proof-of-stake consensus layer: Participation and decentralization,” in arXiv preprint arXiv:2306.10777, 2023.
  • [7] G. Danezis, L. Kokoris-Kogias, A. Sonnino, and A. Spiegelman, “Narwhal and tusk: A dag-based mempool and efficient bft consensus,” in Proceedings of the 17th European Conference on Computer Systems (EuroSys), 2022, pp. 34–50.
  • [8] A. Spiegelman, N. Giridharan, A. Sonnino, and L. Kokoris-Kogias, “Bullshark: Dag bft protocols made practical,” in Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security (CCS), 2022.
  • [9] K. Babel, A. Chursin, G. Danezis, A. Kichidis, L. Kokoris-Kogias, A. Koshy, A. Sonnino, and M. Tian, “Mysticeti: Reaching the limits of latency with uncertified dags,” in Network and Distributed System Security Symposium (NDSS), 2024.
  • [10] M. Yin, D. Malkhi, M. K. Reiter, G. Golan Gueta, and I. Abraham, “Hotstuff: Bft consensus in the lens of blockchain,” in Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing (PODC), 2019.
  • [11] M. Castro and B. Liskov, “Practical byzantine fault tolerance,” in Proceedings of the Third USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1999, pp. 173–186.
  • [12] E. Buchman, “Tendermint: Byzantine fault tolerance in the age of blockchains,” Whitepaper, 2014.
  • [13] Y. Gilad, R. Hemo, S. Micali, G. Vlachos, and N. Zeldovich, “Algorand: Scaling byzantine agreements for cryptocurrencies,” in Proceedings of the 26th Symposium on Operating Systems Principles, 2017, pp. 51–68.
  • [14] D. Malkhi and K. Nayak, “Hotstuff-2: Optimal two-phase responsive bft,” Cryptology ePrint Archive, Report 2023/397, 2023.
  • [15] I. Keidar, L. Kokoris-Kogias, O. Naor, and A. Spiegelman, “All you need is dag,” in Proceedings of the 2022 ACM Symposium on Principles of Distributed Computing (PODC), 2022, pp. 165–175.
  • [16] A. Spiegelman, B. Arun, R. Gelashvili, and Z. Li, “Shoal: Improving dag-bft latency and robustness,” in arXiv preprint arXiv:2306.03058, 2023.
  • [17] B. Arun, Z. Li, F. Suri-Payer, S. Das, and A. Spiegelman, “Shoal++: High throughput dag bft can be fast!” arXiv preprint arXiv:2405.20488, 2024.
  • [18] J. Kwon, “Cosmos governance framework,” in Cosmos Network Documentation, 2019.
  • [19] Z. Naderi, S. P. Shariatpanahi, and B. Bahrak, “Approach to alleviate wealth compounding in proof-of-stake cryptocurrencies,” in arXiv preprint arXiv:2207.11714, 2022.
  • [20] G. Birmpas, P. Lazos, E. Markakis, and P. Penna, “Reward schemes and committee sizes in proof of stake governance,” arXiv preprint arXiv:2406.10525, 2024.
  • [21] W. Tang, “Stability of shares in the proof of stake protocol – concentration and phase transitions,” arXiv preprint arXiv:2206.02227, 2022.
  • [22] A. Saltelli, M. Ratto, T. Andres, F. Campolongo, J. Cariboni, D. Gatelli, M. Saisana, and S. Tarantola, Global Sensitivity Analysis: The Primer. Wiley, 2010.
  • [23] C. Dwork, N. Lynch, and L. Stockmeyer, “Consensus in the presence of partial synchrony,” in Journal of the ACM, vol. 35, no. 2, 1988, pp. 288–323.
  • [24] A. R. Wahid, “Hi-xdr: Hybrid intelligent framework for adversarial-resilient anomaly detection and adaptive cyber response,” Journal of Cyber Security, vol. 7, no. 1, pp. 589–614, 2025.
  • [25] G. Wood, “Polkadot: Vision for a heterogeneous multi-chain framework,” Whitepaper, 2020.
  • [26] L. Baird, M. Harmon, and P. Madsen, “Hedera: A public hashgraph network for fair ordering of transactions,” in Technical Report, 2016.
  • [27] B. Öz, D. Sui, T. Thiery, and F. Matthes, “Who wins ethereum block building auctions and why?” Advances in Financial Technologies (AFT 2024), arXiv:2407.13931, 2024.

APPENDIX

Refer to caption
(a) Objective landscape
Refer to caption
(b) Farming resistance
Figure 31: Parameter space characterization: (a) objective function landscape on 4-D weight simplex with feasible region highlighted; (b) reputation farming resistance versus γ\gamma.
Refer to caption
Figure 32: Pareto frontier: Min/Whale ratio versus farming resistance across 1,407 simplex evaluation points. Default parameterization is feasible but intentionally not Pareto-optimal for robustness.
Refer to caption
(a) OAT elasticity (tornado)
Refer to caption
(b) Sobol indices
Figure 33: Sensitivity analysis: (a) One-At-a-Time elasticities—all success_rate elasticities |e|<0.25|e|<0.25; (b) first-order and total Sobol indices showing throughput dominated by link_latency.
Refer to caption
(a) Sensitivity surface
Refer to caption
(b) Robustness map
Figure 34: (a) Success_rate surface as a function of (byz_rate, and offline_rate)—BFT cliff at 1/31/3 visible; (b) robustness map under combined perturbation of ±25%\pm 25\% (300 samples, CV =5.2%=5.2\%).
Refer to caption
(a) Sybil gain R-channel
Refer to caption
(b) Anchor convergence
Figure 35: (a) Reward-channel Sybil gain (isolated, gR>1g_{R}>1; mitigated by selection channel in production); (b) anchor share convergence at O(1/T)O(1/\sqrt{T}).
Refer to caption
Figure 36: Per-profile anchor share allocation. FairWave selection reinforces whale-good (35%) while penalizing whale-bad (22%).