License: CC BY 4.0
arXiv:2606.11632v1 [cs.CR] 10 Jun 2026

Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure

Jun He
OpenKedge.io
   Deying Yu
OpenKedge.io
Abstract

Agentic infrastructure creates a control-plane authorization problem: non-deterministic reasoning systems may propose high-stakes mutations to production resources, but existing access controls (IAM), policy engines, consensus protocols, and audit logs either enforce static permissions or record actions only after execution. This paper introduces the Sovereign Assurance Boundary (SAB), a certificate-bound runtime admission boundary for autonomous execution authority in agentic infrastructure. SAB intercepts agent proposals at an assurance airlock, compiles them into typed execution contracts CC, binds those contracts to cryptographic evidence digests H(E)H(E) and policy versions, and routes them through consequence-aware certification paths. Successful admission emits a signed Sovereign Assurance Certificate (Ω\Omega) that is valid only for a scoped execution identity, revocation epoch, and validity window. A sovereign execution broker verifies the certificate Ω\Omega and performs fresh pre-execution revocation and drift checks before invoking infrastructure APIs. We describe the airlock–broker architecture, formalize the admission and revocation invariants, and report preliminary feasibility measurements from a Go prototype over 2,500 admission attempts. The resulting model prevents autonomous reasoning alone from mutating state and turns delegated execution authority into a certificate-bound, evidence-bound, revocable, and replayable runtime artifact.

1 Introduction

Large language model (LLM) agents are increasingly connected to operational workflows rather than advisory interfaces, allowing them to scale resources, rotate credentials, deploy code, and export diagnostic data [yao-react, schick-toolformer]. In this setting, agentic infrastructure needs an admission mechanism that decides when a proposed action may become execution authority. Granting non-deterministic reasoning processes direct mutation rights over production control planes creates a safety and authorization risk: agents can become the proximate cause of physical or digital state transitions. Without a runtime admission boundary, individually plausible actions can accumulate into availability failures, data exposure, policy bypasses, or loss of forensic context.

Sovereign Assurance Boundary (SAB) interposes between agentic reasoning and high-stakes infrastructure mutation. We use “sovereign” operationally: the institution retains local control over policy evaluation, evidence retention, signing keys, revocation state, broker identity, and execution authority. It is not an agent monitor or a general governance process; it defines a certificate-bound admission boundary where an agent proposal is not treated as ambient authority. Under SAB, a proposal is eligible for execution only after it is compiled into a typed execution contract CC, bound to an evidence digest H(E)H(E) and policy version, assigned a consequence score (C)\mathcal{R}(C) and autonomy level (C)\mathcal{L}(C), routed through a consequence-aware certification path CertPath(C)\operatorname{CertPath}(C), encoded as a signed certificate Ω\Omega, checked against active revocation state, and verified by a sovereign execution broker immediately before mutating the target system. The mechanism converts autonomous execution authority from an ambient permission into a revocable, evidence-bound runtime artifact.

SABseparates action-level and system-level assurance. Semantic Quorum Assurance (SQA[sqa-paper] serves as the action-level certification primitive. While SQA validates individual proposed actions, SAB composes SQA with evidence, policy, human approval, sovereign execution identity, revocation, and replay. SAB is independent of the SQA implementation: any validator mesh returning signed action-level attestations can instantiate ΓSQA\Gamma_{\mathrm{SQA}}. Action-level checks address unsafe individual mutations; system-level assurance binds execution to active policy, evidence, revocation state, and broker-enforcement paths.

Running Example.

Consider an operations agent, 𝒜ops\mathcal{A}_{\text{ops}}, tasked with resolving a service degradation. To restore reachability and diagnose the issue, 𝒜ops\mathcal{A}_{\text{ops}} proposes reconfiguring network security groups, scaling up compute nodes, rotating database credentials, rolling back a deployment, and exporting diagnostic logs. If an infrastructure stack lets that proposal directly invoke raw IAM permissions, autonomous execution of the sequence introduces systemic risk: the network change may expose private services, the resource expansion may exceed budget or quota constraints, credential rotation may trigger auth failures, the rollback may reintroduce vulnerabilities, and the log export may leak personally identifiable information (PII). The example shows why action-level checks alone do not bound institutional risk.

What SAB Is Not.

SABdiffers from existing controls in several ways:

  • Not an IAM replacement: IAM authorizes identities and requests; SAB certifies whether an autonomous proposal is semantically justified by evidence before a broker invokes those credentials.

  • Not a formal verification replacement: SAB does not prove arbitrary safety, but composes formal proofs (when available) as inputs to its evidence digests.

  • Not a consensus protocol: Consensus (e.g., Raft) agrees on state transitions; SAB evaluates whether a transition is institutionally authorized under current context.

  • Not only logging: Logs record events post-execution; SAB binds pre-execution evidence and policy versions in the certificate Ω\Omega, then links outcomes through a ledger-recorded outcome record OO, making the admission process replayable.

  • Not only human-in-the-loop approval: SAB is a system-level runtime admission boundary that cryptographically binds human signatures to specific contracts and evidence snapshots, preventing generic rubber-stamping.

We make four contributions:

  1. 1.

    Certificate-bound admission for autonomous infrastructure. We identify a control-plane authorization gap that arises when non-deterministic agents are allowed to propose production mutations, and define the Sovereign Assurance Boundary (SAB) as a runtime admission boundary that converts agent proposals into certificate-bound execution authority.

  2. 2.

    A concrete airlock–broker architecture. We introduce the assurance airlock, typed execution contract CC, evidence digest H(E)H(E), the certificate Ω\Omega, and sovereign execution broker. Together, these mechanisms separate proposal, admission, and execution so that agent reasoning alone cannot directly mutate infrastructure state under the broker-enforced execution model.

  3. 3.

    Consequence-aware certification and revocation. We formalize consequence scoring (C)\mathcal{R}(C), autonomy levels (C)\mathcal{L}(C), certification paths CertPath(C)\operatorname{CertPath}(C), monotone path selection, broker-side verification, and revocation epochs ρrev\rho_{\mathrm{rev}}. These mechanisms support escalation, rejection, suspension, and revocation before execution.

  4. 4.

    Prototype and feasibility evaluation. We implement an initial Go prototype with contract serialization, OPA/Rego policy checks, Ed25519 certificate signing, a PostgreSQL-backed append-only ledger, revocation-epoch distribution, broker verification, and replay tooling. Using 500 labeled execution contracts over 2,500 admission attempts, we report preliminary local feasibility measurements for admission latency, broker verification, revocation propagation, routing accuracy, replay completeness, and certificate overhead.

2 Background, Motivation, and System Requirements

2.1 Background and Gap

Securing agentic infrastructure requires integrating model-generated proposals with existing safety and security mechanisms.

Runtime Assurance.

Traditional runtime assurance separates an advanced controller from a monitor or fallback controller to prevent safety envelope violations [simplex-architecture, bloem-shield-synthesis]. In cloud and database administration, mutations are discrete, policy-laden configuration changes. SAB adapts the runtime-assurance pattern by evaluating the institutional admissibility of proposals rather than continuous physical invariants.

Assurance Cases.

Assurance cases present structured safety, security, or compliance arguments by linking claims to evidence [rushby-assurance]. They are usually static, design-time artifacts. SAB moves part of that structure into runtime admission by cryptographically binding telemetry, policies, and validator signatures to SAB certificates.

Access Control (IAM) and Policy Engines.

Zero trust and identity management systems (IAM) verify caller identities and enforce structural policy compliance (e.g., via OPA Rego) [rose-zerotrust, aws-iam, opa-rego]. While IAM dictates whether an identity can call an API, it cannot determine whether an autonomous proposal is semantically justified by live evidence. SAB treats access control as a prerequisite, interposing a semantic certification boundary before credentials are used.

Consensus and Byzantine Fault Tolerance.

Consensus protocols (e.g., Paxos, Raft, PBFT) replicate logs and enforce order across replicas [lamport-paxos, ongaro-raft]. They ensure agreement on state transitions but do not evaluate the semantic correctness or safety of those transitions. Semantic Quorum Assurance (SQA) certifies the semantic admissibility of individual proposed actions [sqa-paper]; SAB composes SQA into a system-level certification boundary.

AI Safety and Tool Governance.

Agent safety tools (sandboxing, output filtering, model self-critiques) constrain agent execution environments [yao-react, owasp-llm-top10]. These mechanisms lack cryptographically signed, institutionally bound execution credentials or durable audit trails. SAB wraps these tools as untrusted validators and binds their collective output into SAB certificates.

2.2 Motivation: Why Action Certification Is Not Enough

Semantic Quorum Assurance (SQA) verifies locally whether an action is semantically admissible (e.g., whether a security-group change fits an active outage ticket). High-stakes infrastructure also needs a system-level check: whether the agent operates under active policies, fresh evidence, and current revocation rules.

A sequence of individually approved steps (e.g., opening a firewall, expanding an IAM role, rolling back a service, and exporting logs) can collectively create high-consequence availability, compliance, or security failures. Table 1 summarizes how SAB composes these security controls. SAB mediates the certificate-bound transition from proposal to execution, checking that the certificate Ω\Omega contains the evidence and validation epoch required by policy.

Table 1: Admission controls for agentic infrastructure. SAB certifies the institutional admission context surrounding individual actions.
Control Question Artifact Failure if Missing
Policy engine Is the caller authorized for this operation? Authorization token Static permissions are treated as ambient authority.
SQA Is the proposed action semantically admissible? Action certificate Locally unsafe mutations are executed.
Human approval Does an operator approve this step? Approval record Approvals do not bind evidence or policy epochs.
Audit logging What happened after execution? Event log Accountability is reconstructed post-hoc.
SAB May this proposal become execution authority under institutional rules? SAB certificate Ω\Omega Individually safe actions cross into execution authority without system context.

2.3 System Requirements

To govern high-stakes agentic infrastructure, SAB must satisfy seven system requirements:

R1: Proposal/Execution Separation.

Agent runtimes must have no standing credentials or direct execution privileges, treating them purely as action proposers.

R2: Contractual Admission.

Raw agent commands must compile into a strongly-typed execution contract CC defining target resources, preconditions, postconditions, and rollbacks.

R3: Evidence-Bound Decisions.

All admissions must bind to a cryptographic digest H(E)H(E) of the telemetry and system state active at decision time.

R4: Consequence-Aware Routing.

Proposals must route to validation paths dynamically based on a consequence score (C)\mathcal{R}(C) (blast radius, irreversibility, data risk, and uncertainty).

R5: Sovereign Execution.

Admitted contracts must execute exclusively through an institutionally controlled broker that scopes credentials and validates the certificate Ω\Omega.

R6: Revocability.

The system must support scoped withdrawal, enabling key rotation, validator quarantine, and domain suspension.

R7: Replayability.

SAB must record the certificate Ω\Omega and, after execution, outcome record OO in an append-only log, with the ledger binding OO to Ω\Omega.

3 SAB Architecture and Trust Model

The Sovereign Assurance Boundary (SAB) intercepts agent proposals before they can mutate production infrastructure. The system model separates unprivileged agent reasoning from privileged sovereign execution.

3.1 System Model and Assumptions

We model the agent runtime as an untrusted proposer. It may be compromised, misaligned, or faulty, and has no standing administrative credentials. Downstream infrastructure APIs accept state mutations exclusively from the sovereign execution broker.

The trusted computing base (TCB) consists of the assurance airlock, policy engine, evidence ledger, revocation controller, private signing keys, and sovereign execution broker. The contract compiler, evidence collector, consequence scorer, SQA validator mesh, and approval console run outside the TCB but are security-relevant; Table 2 maps each component to its trust boundary and core systems function. The evidence collector is untrusted: while SAB detects unsigned or malformed evidence, signed false evidence from compromised authorized sources remains a residual risk. Similarly, the SQA mesh is an external, untrusted oracle: the airlock policy engine verifies validators’ signed attestations ΓSQA\Gamma_{\mathrm{SQA}} instead of trusting validator internal execution.

SAB assumes authenticated component identities, a versioned policy store, bounded clock skew, append-only ledger integrity, and timely propagation of revocation state. Let pkSABpk_{\mathrm{SAB}} denote the airlock’s public verification key and σSAB\sigma_{\mathrm{SAB}} its signature on the certificate Ω\Omega.

LEGEND:SAB Trusted TCB (Contribution)SAB Security-Relevant (Contribution)External / Baseline SystemsAgentruntimeContractcompilerEvidencecollectorConsequencescorerAssuranceairlockSovereignbrokerInfrastructurePolicyengineSQA validatormeshApprovalconsoleReplay/auditserviceEvidenceledgerRevocationcontrollerproposalCCH(E)H(E),\mathcal{R},\mathcal{L}Ω\Omegascopedmutation
Figure 1: Reference architecture for the Sovereign Assurance Boundary.
Table 2: SAB Component Reference.
Component Trust Boundary Core Systems Function
Agent Runtime Untrusted / Outer Proposes mutations as unprivileged JSON requests.
Contract Compiler Security-relevant / Airlock Normalizes natural-language or tool calls into structured contracts CC.
Evidence Collector Security-relevant / Airlock Assembles telemetry, dependency logs, and tickets to compute evidence digest H(E)H(E).
Consequence Scorer Security-relevant / Airlock Estimates blast radius and data risk; determines autonomy levels (C)\mathcal{L}(C).
Policy Engine Trusted Control Plane Enforces static rules and routes remaining ones to certification paths.
Assurance Airlock Trusted Control Plane Enforces path predicates, collects approvals, and issues signed SAB certificates Ω\Omega.
SQA Validator Mesh Fallible / Outer Mesh Evaluates action semantics as decoupled, untrusted signing oracles.
Approval Console Security-relevant / Outer Presents structured contracts and evidence summaries for operator multi-signature.
Sovereign Broker Trusted Control Plane Verifies signatures and revocation status of Ω\Omega; executes scoped actions through IAM.
Evidence Ledger Trusted Control Plane Durably records SAB certificates and bound outcome records OO in an append-only cryptographic log.
Revocation Controller Trusted Control Plane Monitors system state; advances revocation epochs ρrev\rho_{\mathrm{rev}} and manages suspensions.
Replay Service Security-relevant / Outer Re-evaluates ledger records to reconstruct and audit historical admission decisions.

3.2 SAB Invariant Properties

Under the trust model in Section 3.1, SAB enforces five authority-boundary invariants:

P1: Proposal-Execution Decoupling Invariant.

Let Mutate(r,op)\mathrm{Mutate}(r,op) denote a state-changing operation on resource rr. Under the assumption that the infrastructure API accepts mutations only from the sovereign broker, no agent proposal can directly trigger Mutate(r,op)\mathrm{Mutate}(r,op) without first being admitted by the airlock:

Execute(Ω)Admit(C)\mathrm{Execute}(\Omega)\implies\mathrm{Admit}(C)

where CC is the compiled execution contract corresponding to the proposal.

P2: Certificate-Bound Execution Invariant.

The sovereign execution broker executes a contract CC only after verifying the certificate Ω\Omega and its airlock signature:

Execute(Ω)Verify(pkSAB,Ω,σSAB)CΩ\mathrm{Execute}(\Omega)\implies\operatorname{Verify}(pk_{\mathrm{SAB}},\Omega,\sigma_{\mathrm{SAB}})\land C\in\Omega

where the certificate is bound to the specific contract CC, policy version PverP_{\mathrm{ver}}, execution identity IDexecID_{\mathrm{exec}}, and validity window TvalidT_{\mathrm{valid}}.

P3: Evidence-Binding Invariant.

Approvals and validator attestations (ΓSQA,Γhuman\Gamma_{\mathrm{SQA}},\Gamma_{\mathrm{human}}) are bound cryptographically to CC and the evidence digest H(E)H(E):

Approve(C,H(E))Sign(Γ,CH(E))\operatorname{Approve}(C,H(E))\implies\mathrm{Sign}(\Gamma,C\mathbin{\|}H(E))

This binding prevents approval reuse across different contracts or stale environmental contexts.

P4: Monotone Path Invariant.

A contract CC is admitted only if all validation steps required by its selected certification path CertPath(C)\operatorname{CertPath}(C) are satisfied:

Admit(C)VerifyPath(C,CertPath(C))\operatorname{Admit}(C)\implies\operatorname{VerifyPath}(C,\operatorname{CertPath}(C))

where CertPath(C)\operatorname{CertPath}(C) depends on the consequence score (C)\mathcal{R}(C), autonomy level (C)\mathcal{L}(C), and policy version PverP_{\mathrm{ver}}. Policy may escalate a contract to stronger assurance or rejection, but lower-assurance paths cannot override requirements imposed by action class, consequence score, jurisdiction, or incident mode.

P5: Ephemeral Authority Invariant.

The sovereign execution broker rejects any certificate Ω\Omega if the current revocation epoch ρactive\rho_{\mathrm{active}} is greater than the certificate’s revocation epoch ρrev\rho_{\mathrm{rev}}, or if the execution time texect_{\mathrm{exec}} falls outside the validity window TvalidT_{\mathrm{valid}}:

Execute(Ω)ρrev=ρactivetexecTvalid\mathrm{Execute}(\Omega)\implies\rho_{\mathrm{rev}}=\rho_{\mathrm{active}}\land t_{\mathrm{exec}}\in T_{\mathrm{valid}}

The broker therefore rejects certificates from stale epochs.

3.3 Proof Sketch

Goal.

Establish that an untrusted agent runtime cannot directly execute a state-changing operation except through a broker-verified certificate satisfying the active policy path.

Assumptions.

1. Infrastructure APIs accept mutations exclusively from the sovereign execution broker. 2. The broker verifies certificate signatures using the airlock public verification key pkSABpk_{\mathrm{SAB}} and rejects invalid signatures. 3. Airlock private key remains uncompromised. 4. Active policy PverP_{\mathrm{ver}} dictates path requirements.

Argument.

To execute a mutation, the broker must verify the certificate Ω\Omega such that CΩC\in\Omega and Verify(pkSAB,Ω,σSAB)\operatorname{Verify}(pk_{\mathrm{SAB}},\Omega,\sigma_{\mathrm{SAB}}) (by P2). Because the airlock private key is secure, only the airlock could have produced σSAB\sigma_{\mathrm{SAB}}. The airlock emits this signature only if Admit(C)\operatorname{Admit}(C) evaluates to true. By P4, Admit(C)\operatorname{Admit}(C) evaluates to true only if all path requirements VerifyPath(C,CertPath(C))\operatorname{VerifyPath}(C,\operatorname{CertPath}(C)) are satisfied under the policy PverP_{\mathrm{ver}}. Under the assumptions above, an agent proposal cannot execute unless it compiles to a contract CC satisfying the path checks mandated by institutional policy. Broker-enforced execution leaves agent reasoning with no direct mutation path (P1).

3.4 Broker Verification Checks

Before invoking infrastructure APIs, the sovereign broker evaluates the pre-execution checks in Table 3. The checks cover certificate validity, contract identity, revocation state, policy epoch, state drift, and scoped credential issuance.

Table 3: Broker verification checks.
Broker Check Input Failure Behavior
Signature Verification Certificate Ω\Omega, public key pkSABpk_{\mathrm{SAB}}, signature σSAB\sigma_{\mathrm{SAB}} Reject execution, abort
Contract Match Contract CC, certificate contents CΩC\in\Omega Reject execution, abort
Validity Window Current time texect_{\mathrm{exec}}, validity range TvalidT_{\mathrm{valid}} Reject execution, abort
Revocation Epoch Certificate epoch ρrev\rho_{\mathrm{rev}}, active epoch ρactive\rho_{\mathrm{active}} Reject, fail-closed for L3/L4L_{3}/L_{4}
Policy Epoch Certificate policy version PverP_{\mathrm{ver}}, active PactiveP_{\mathrm{active}} Reject execution, abort
State Drift Check Bound evidence EE, live state StexecS_{t_{\mathrm{exec}}}, threshold Reject, fail-closed for L3/L4L_{3}/L_{4}
Credential Issuance Scoped identity IDexecID_{\mathrm{exec}}, target resource API Reject execution, abort

3.5 Worked Admission Example

Consider an operations agent 𝒜ops\mathcal{A}_{\text{ops}} proposing an ingress network rule CfwC_{\mathrm{fw}} to resolve an outage. Table 4 traces this proposal through the reference architecture.

Table 4: Worked example stages for security-group admission.
Stage Component Output / Artifact Example Value Failure Check
1. Compile Contract Compiler Execution Contract CfwC_{\mathrm{fw}} Add ingress rule; target: sg-01; port: 443; rollback: delete rule Malformed syntax, unsupported API
2. Bind Evidence Evidence Collector Evidence Chain Digest H(Efw)H(E_{\mathrm{fw}}) Telemetry (high latency); target config; incident ticket ID: INC-101 Stale telemetry, missing incident ticket
3. Score Consequence Consequence Scorer Consequence Score (Cfw)=8.5\mathcal{R}(C_{\mathrm{fw}})=8.5; autonomy level (Cfw)=L4\mathcal{L}(C_{\mathrm{fw}})=L_{4} Score calculation failure
4. Route Path Policy Engine Certification Path CertPath(Cfw)=SQAPlusHuman\operatorname{CertPath}(C_{\mathrm{fw}})=\mathrm{SQAPlusHuman} Monotone path mismatch
5. Airlock Check Assurance Airlock SAB certificate Ωfw\Omega_{\mathrm{fw}} Signed bundle including CfwC_{\mathrm{fw}}, H(Efw)H(E_{\mathrm{fw}}), ΓSQA\Gamma_{\mathrm{SQA}}, Γhuman\Gamma_{\mathrm{human}} Validator veto, missing human signature
6. Exec Check Sovereign Broker Broker execution decision Verify σSAB\sigma_{\mathrm{SAB}} with pkSABpk_{\mathrm{SAB}}, check Φdrift(Efw,Stexec)\Phi_{\mathrm{drift}}(E_{\mathrm{fw}},S_{t_{\mathrm{exec}}}), verify ρrev\rho_{\mathrm{rev}} Drift limit exceeded, stale epoch
7. Record Log Evidence Ledger Ledger transaction Record OfwO_{\mathrm{fw}} bound to Ωfw\Omega_{\mathrm{fw}} Ledger write failure

The example has two failure cases:

  • Airlock policy rejection: If 𝒜ops\mathcal{A}_{\text{ops}} attempts to open port 22 (SSH) to the public internet (0.0.0.0/0), the consequence scorer maps this high-risk action to L4L_{4}. The airlock’s security-archetype validator detects a policy violation and triggers a critical veto, returning Reject\mathrm{Reject}.

  • Broker-side verification failure: If CfwC_{\mathrm{fw}} is admitted at time t1t_{1}, but before execution at t2t_{2} the active revocation epoch advances from ρ1\rho_{1} to ρ2\rho_{2}, the broker detects that ρrev=ρ1<ρ2\rho_{\mathrm{rev}}=\rho_{1}<\rho_{2} and aborts the execution. Similarly, if the broker’s drift-check predicate Φdrift(Efw,St2)\Phi_{\mathrm{drift}}(E_{\mathrm{fw}},S_{t_{2}}) evaluates to false (indicating that the target security group’s state has drifted since admission), the execution is aborted.

4 Certificate-Bound Admission Pipeline

The admission pipeline evaluates proposals, scores consequence, routes them to certification paths, and issues signed certificates that the broker can verify before execution.

4.1 Formal Admission Interface

Let CC be an execution contract, EE be the evidence bound to that contract, StS_{t} be the observed infrastructure state at time tt, (C)\mathcal{R}(C) be the consequence score, and (C)\mathcal{L}(C) be the assigned autonomy level; the certificate Ω\Omega is the resulting admission artifact. The airlock implements the admission function:

Airlock(C,E,St)ΩorReject.\operatorname{Airlock}(C,E,S_{t})\rightarrow\Omega\;\;\text{or}\;\;\mathrm{Reject}.

The function returns Ω\Omega only when the proposal satisfies the required certification path. Otherwise, it returns Reject\mathrm{Reject}. To prevent time-of-check to time-of-use (TOCTOU) risks, the broker evaluates a state drift-check predicate Φdrift(E,Stexec)\Phi_{\mathrm{drift}}(E,S_{t_{\mathrm{exec}}}) immediately before executing the mutation, verifying that the live state at execution time StexecS_{t_{\mathrm{exec}}} has not deviated from the bound evidence EE beyond policy safety margins.

4.2 Consequence Scoring and Autonomy Levels

The consequence score (C)\mathcal{R}(C) quantifies the operational, compliance, and security risk of the contract:

(C)=αB(C)+βP(C)+γD(C)+δI(C)+ηU(C).\mathcal{R}(C)=\alpha B(C)+\beta P(C)+\gamma D(C)+\delta I(C)+\eta U(C). (1)

The terms measure specific risk vectors:

  • Blast radius B(C)B(C): Affected resources, services, and network zones.

  • Privilege expansion P(C)P(C): Elevated credentials or access scopes required.

  • Data sensitivity D(C)D(C): Exposure risk of regulated data or PII.

  • Irreversibility I(C)I(C): Computational cost and difficulty of rolling back.

  • Uncertainty U(C)U(C): Lack of telemetry or confidence in evidence.

To tune the weights (α,β,γ,δ,η\alpha,\beta,\gamma,\delta,\eta), security engineers run an offline adjudication loop replaying historical incidents to tune classifications against expert consensus, biasing weights to escalate rather than under-classify when uncertainty U(C)U(C) is high.

From the consequence score, active policy 𝒫inst\mathcal{P}_{\mathrm{inst}}, and jurisdictional constraints 𝒥\mathcal{J}, the airlock maps the contract to an autonomy level:

(C)=Λ((C),𝒫inst,𝒥).\mathcal{L}(C)=\Lambda(\mathcal{R}(C),\mathcal{P}_{\mathrm{inst}},\mathcal{J}).

Table 5 defines the consequence-aware autonomy levels.

Table 5: Consequence-aware autonomy levels.
Level Class Example Action Assurance and Validation Obligation
L0L_{0} Advisory Diagnostic query Read-only; agent cannot compile or execute.
L1L_{1} Manual Draft Batch restarts Agent drafts contract; manual operator submission required.
L2L_{2} Bounded Autonomous Stateless restart Reversible, local action; OPA policy checks.
L3L_{3} Governed Autonomous Autoscaling or rollback Moderate blast radius; requires SQA and evidence binding.
L4L_{4} High-Stakes Autonomous Security group, log export High blast radius; SQA, human multi-sig, revocation checks.
L5L_{5} Prohibited Deleting audit logs Categorically denied autonomous execution.

4.3 Certification Paths

After assigning (C)\mathcal{L}(C), the airlock routes the contract through a certification path:

CertPath(C){Reject,PolicyOnly,LightweightQuorum,SQA,SQAPlusHuman,Prohibited}.\operatorname{CertPath}(C)\rightarrow\left\{\begin{array}[]{@{}l@{}}\mathrm{Reject},\ \mathrm{PolicyOnly},\\ \mathrm{LightweightQuorum},\ \mathrm{SQA},\\ \mathrm{SQAPlusHuman},\ \mathrm{Prohibited}\end{array}\right\}. (2)

The selected path determines the validation requirements:

  • PolicyOnly: Requires a valid contract, evidence digest, and a successful policy engine (e.g. OPA) check.

  • LightweightQuorum: Requires multiple automated validator checks (syntax checks, telemetry checks, and dependency checks).

  • SQA: Requires Semantic Quorum Assurance and validator diversity constraints.

  • SQAPlusHuman: Requires SQA, critical-archetype veto checks, and operator multi-signature.

  • Prohibited / Reject: Categorically disallowed or failed admission attempts.

Decoupled SQA Oracle.

To keep the SQA validator mesh outside SAB’s trusted computing base, validator nodes run as external, isolated processes. They receive a read-only input bundle containing CC and EE, and hold no credentials. Each validator ii independently evaluates the proposal and generates a cryptographically signed attestation:

Γi=Sign(keyi,CH(E)voteimetadatai).\Gamma_{i}=\mathrm{Sign}(\mathrm{key}_{i},C\mathbin{\|}H(E)\mathbin{\|}\mathrm{vote}_{i}\mathbin{\|}\mathrm{metadata}_{i}).

The airlock verifies validator signatures and checks diversity, quorum size, and veto requirements over the aggregated attestations ΓSQA={Γ1,Γ2,}\Gamma_{\mathrm{SQA}}=\{\Gamma_{1},\Gamma_{2},\dots\}. Even under compromise, individual validators cannot forge approvals without private keys, nor can they directly mutate infrastructure.

4.4 Admission Predicate

To admit a contract, the airlock evaluates the admission predicate:

Admit(C)=\displaystyle\operatorname{Admit}(C)={} Φpolicy(C)Φevidence(C,E)\displaystyle\Phi_{\mathrm{policy}}(C)\land\Phi_{\mathrm{evidence}}(C,E) (3)
Φassurance(C)Φauthority(C).\displaystyle\land\Phi_{\mathrm{assurance}}(C)\land\Phi_{\mathrm{authority}}(C).
  • Φpolicy(C)\Phi_{\mathrm{policy}}(C): Policy permits the action class and target resource.

  • Φevidence(C,E)\Phi_{\mathrm{evidence}}(C,E): Required evidence exists, is fresh, and has trusted provenance.

  • Φassurance(C)\Phi_{\mathrm{assurance}}(C): The selected assurance path completed successfully.

  • Φauthority(C)\Phi_{\mathrm{authority}}(C): The broker identity, validity window, and revocation state are valid.

4.5 SAB Certificates

When the admission predicate is satisfied, the airlock issues the certificate Ω\Omega:

Ω=\displaystyle\Omega=\langle C,H(E),(C),(C),\displaystyle C,\ H(E),\ \mathcal{R}(C),\ \mathcal{L}(C), (4)
CertPath(C),ΓSQA,\displaystyle\operatorname{CertPath}(C),\ \Gamma_{\mathrm{SQA}},
Γhuman,IDexec,Tvalid,\displaystyle\Gamma_{\mathrm{human}},\ ID_{\mathrm{exec}},\ T_{\mathrm{valid}},
Pver,ρrev,σSAB.\displaystyle P_{\mathrm{ver}},\ \rho_{\mathrm{rev}},\ \sigma_{\mathrm{SAB}}\rangle.

where IDexecID_{\mathrm{exec}} is the execution identity, TvalidT_{\mathrm{valid}} is the validity window, PverP_{\mathrm{ver}} is the policy version, ρrev\rho_{\mathrm{rev}} is the revocation epoch, and σSAB\sigma_{\mathrm{SAB}} is the airlock signature. After execution, the broker emits a separate outcome record:

O=Ω,status,texec,H(Spost),result,σbroker,O=\langle\Omega,\ \mathrm{status},\ t_{\mathrm{exec}},\ H(S_{\mathrm{post}}),\ \mathrm{result},\ \sigma_{\mathrm{broker}}\rangle, (5)

where σbroker\sigma_{\mathrm{broker}} is the broker signature. The ledger stores OO and binds it to Ω\Omega.

Properties.

  • Replayability: The replay service reconstructs the decision using CC, H(E)H(E), (C)\mathcal{R}(C), (C)\mathcal{L}(C), CertPath(C)\operatorname{CertPath}(C), PverP_{\mathrm{ver}}, and ρrev\rho_{\mathrm{rev}}.

  • Non-repudiation: Approvals are cryptographically bound to CC and H(E)H(E), preventing reuse or extraction.

  • Temporal Validity: The certificate expires when the validity window TvalidT_{\mathrm{valid}} closes or evidence freshness drops.

  • Outcome Binding: Post-execution observations are bound to Ω\Omega through outcome record OO, not embedded in the original certificate.

Table 6 compares SAB certificates with standard change-auditing artifacts.

Table 6: Comparison of certification and auditing artifacts.
Artifact Records Limitation
Audit log Executed event, actor, time Lacks admission context.
Policy decision record Policy engine query result Omits semantic evidence and validator votes.
SQA quorum Validator votes and reasoning Lacks system-level context and revocation epoch.
SAB certificate Full pre-execution admission state Outcomes require ledger-bound records and replay overhead.

5 Revocation, Suspension, and Incident Response

Certificate-bound authority must be withdrawable. An incident may require immediate suspension across an action class, domain, validator, or execution path, so revocation and suspension are part of the control plane rather than an external operations procedure.

5.1 Revocation Triggers and Withdrawal Mechanisms

The controller monitors signals from the ledger, validator mesh, policy engine, broker, and compliance tools. Triggers apply globally or to a scoped domain dd (action class, service, region), as shown in Table 7.

Table 7: System revocation triggers.
Trigger Class Primary Signal Institutional Consequence
Validator Degradation Validator agreement rate or accuracy drops below threshold Downgrade autonomy level, quarantine validator
Evidence Outage/Staleness Telemetry streams, incident tickets, or configs become stale/unavailable Airlock rejects new proposals in affected domain
Policy Drift/Rollback Updated policy version PverP_{\mathrm{ver}} published or rolled back Invalidate active certificates, re-evaluate paths
Environmental Outage System monitor detects live intrusion or active severity-1 outage Advance revocation epoch ρrev\rho_{\mathrm{rev}}, suspend domain
Compromised Identity Suspected leak of signing keys or broker execution credential Revoke pending certificates, rotate execution keys
Operational Anomaly Operator triggers emergency hold or manually rolls back change Immediate human-only fallback, cancel active certificates

When a trigger fires, the controller applies one or more scoped withdrawal mechanisms (Table 8). Withdrawal is enforced at admission (the airlock consults suspension state before emitting Ω\Omega) and execution (the broker checks revocation state before execution).

Table 8: Authority withdrawal mechanisms.
Mechanism Enforcement Point Operational Effect
Level Downgrade Consequence Scorer Caps maximum permitted autonomy level (C)\mathcal{L}(C), routing actions to human approval.
Domain Suspension Assurance Airlock Categorically rejects all future proposals matching the suspended domain dd.
Certificate Invalidation Sovereign Broker Broker rejects execution of unexecuted certificates matching the revoked class.
Validator Quarantine SQA Mesh Removes compromised or stale validators from participating in quorum validation.
Identity Rotation Sovereign Broker Rotates execution credentials, invalidating certificates bound to stale keys.
Human-only Fallback Assurance Airlock Bypasses automated routing, requiring manual approval for all actions.
Replay Sweep Audit Service Re-evaluates recent certificates under updated policy to identify active safety violations.

If the broker cannot obtain fresh revocation state (e.g., due to a network partition), it fails closed, rejecting all high-consequence (L3,L4L_{3},L_{4}) executions. Revocation updates are signed, versioned, and distributed as monotonically increasing epochs. Deployments that cache revocation state must bound cache lifetime and record the revocation epoch used for every broker decision.

5.2 Formal Semantics

Let dd be a domain or action class. Suspension is a future-admission invariant:

Suspended(d)Cd,Admit(C)=False.\operatorname{Suspended}(d)\implies\forall C\in d,\;\operatorname{Admit}(C)=\mathrm{False}. (6)

The invariant is evaluated before path selection, preventing suspended domains from being bypassed.

Certificate revocation is an execution invariant:

Revoked(Ω)Execute(Ω)=False.\operatorname{Revoked}(\Omega)\implies\operatorname{Execute}(\Omega)=\mathrm{False}. (7)

Equation 7 applies to pending or unexecuted certificates. Completed executions cannot be undone; instead, the ledger marks Ω\Omega for post-hoc response and compensating actions.

5.3 Incident Response Workflow and Override Risk

During an incident, the controller identifies the affected domain dd, installs a suspension or downgrade rule, and forces new proposals to human-only escalation. The broker invalidates pending certificates in dd, while the replay service sweeps recent executions to link outcomes back to context.

Emergency overrides that bypass SAB remove the evidence and identity constraints required for accountability. Emergency operations should instead pass through SAB under stricter constraints: narrow scopes, short-lived certificates, and mandatory post-hoc replay. If an override is abused, the controller revokes pending certificates, quarantines keys, and replays the executions under standard policy.

6 Case Studies

The following case studies trace SAB’s certificate-bound admission boundary across four operational settings (Table 9).

Table 9: Case-study summary.
Case Proposed Action Risk Certification Path Outcome
Enterprise Cloud Operations Open security-group rule during outage Public exposure, privilege expansion, incident masking SQAPlusHuman or Reject Vetoed, escalated, or executed through broker with Ω\Omega
Generated-Code Deployment Deploy agent-generated patch Security regression, failed rollout, irreversible change SQA or SQAPlusHuman Certified artifact deployed with rollback and outcome evidence
Sovereign AI Cloud Execute plan proposed by external foundation models Jurisdictional leakage, external execution authority SQAPlusHuman under local policy Local certificate Ω\Omega and sovereign execution identity required
Regulated Data Workflow Export logs or datasets for diagnosis PII exposure, retention violation, cross-boundary transfer SQAPlusHuman, Reject, or Prohibited Redacted export, human approval, or denial with replay record

6.1 Case Study 1: Enterprise Cloud Operations

Scenario and Path.

An operations agent 𝒜ops\mathcal{A}_{\text{ops}} detects network latency and proposes adding an ingress rule in a security group to bypass a blocked gateway. The compiler normalizes this into contract CfwC_{\mathrm{fw}}, which the evidence collector binds to service-health metrics, incident ticket ID, and security-group configurations. Because modifying firewall rules alters production network posture, the scorer assigns a high blast-radius score, mapping CfwC_{\mathrm{fw}} to autonomy level L4L_{4}. If the proposed source range is public (0.0.0.0/0), the policy engine routes the contract to SQAPlusHuman. SQA validators review the proposal; a security-archetype validator holds veto power. If approved, the airlock signs certificate Ωfw\Omega_{\mathrm{fw}}. The broker verifies Ωfw\Omega_{\mathrm{fw}} and evaluates the state drift-check predicate Φdrift\Phi_{\mathrm{drift}} before executing the API call.

Failure Prevented and Artifact.

This path is intended to stop a latency diagnosis from becoming an overbroad network exposure or an unevidenced API call. The ledger records the contract, evidence digest H(E)H(E), validator signatures, and broker decision, allowing post-hoc verification that the change was semantically justified.

6.2 Case Study 2: Generated-Code Deployment

Scenario and Path.

A software engineering agent generates a bug patch and proposes deploying it to a Kubernetes cluster. The proposal is compiled into contract CC, which includes source diffs, target service, and rollback plans. The collector binds CC to build provenance, test results, and vulnerability scans. The scorer distinguishes low-risk canary rollouts (routed to SQA) from privileged code changes or database migrations (escalated to SQAPlusHuman). The airlock evaluates PDD (Protocol-Driven Development) invariants, verifying that the generated patch matches interface definitions. The broker deploys only certified manifests and monitors rollout health, recording outcomes.

Failure Prevented and Artifact.

The broker blocks unverified code from reaching production by rejecting proposals with missing rollbacks, failing tests, or unapproved dependencies. The certificate records all build and test evidence, enabling post-deployment replay.

6.3 Case Study 3: Sovereign AI Cloud

Scenario and Path.

A sovereign operator uses a mixed model stack (external and domestic models) for planning. An external model proposes restarting services and exporting diagnostic telemetry. The local contract compiler decomposes the plan into separate contracts, and the collector binds local evidence (data-residency tags, configuration states). Sensitive context is held locally. The local policy engine enforces jurisdictional rules, rejecting contracts that violate residency policies or cross borders. Valid proposals are reviewed by local SQA validators, and high-stakes actions route to human operator multi-signature. The local broker executes approved contracts using domestic identities.

Failure Prevented and Artifact.

The local broker path prevents external planning models from directly acquiring execution authority. External reasoning systems cannot obtain credentials, bypass jurisdictional policies, or execute mutations without local admission and replay. Certificate Ω\Omega records the domestic validation path and broker identity, making sovereign execution verifiable.

6.4 Case Study 4: Regulated Data Workflow

Scenario and Path.

During an incident, an agent proposes exporting database logs for offline analysis. The logs may contain PII or regulated customer records. The compiler translates this into a data-export contract detailing source tables, destinations, and redaction rules. The collector binds schema metadata and legal-hold status. The scorer assigns a high data-sensitivity term, raising uncertainty if payloads are unclassified. If policy forbids autonomous export of the data class, the path is Prohibited or Reject. If export is permitted with redaction, the airlock enforces a minimization contract, routing the redacted export through compliance validators and human multi-signature.

Failure Prevented and Artifact.

This path is intended to prevent autonomous leakage of regulated data under the guise of diagnostic utility. The airlock rejects overbroad exports or destinations lacking compliance controls, binding redaction attestations and human approvals to the export outcome.

7 Evaluation Methodology and Targets

We evaluate SAB’s control-plane feasibility and performance tradeoffs with a Go prototype and three workload families.

7.1 Prototype Implementation

Our prototype of SAB in Go contains a stateless admission service, Protocol Buffer and JSON Schema contract definitions, an evidence-digest generator extracting Kubernetes and Active Directory logs, OPA/Rego policy evaluation, Ed25519 signing for Ω\Omega, a PostgreSQL ledger, prototype target adapters, revocation epoch distribution, and a replay command-line interface. The target adapters simulate actions on Kubernetes and AWS/GCP APIs.

7.2 Evaluation Methodology and Baselines

We compare SAB against standard baseline security controls (Table 10) using 500 execution contracts: 200 cloud-infrastructure contracts (security groups, scaling, restarts, credential rotations), 150 CI/CD contracts (rollbacks, patch deployments, manifest updates), and 150 data-governance contracts (migrations, log exports, resource deletion).

Table 10: System baselines and comparison matrix.
Baseline Mechanism Assurance Limitation
Static IAM/Policy Checks static roles (e.g., AWS IAM, OPA) Context-unaware; permits structurally valid but semantically unsafe actions.
Human Approval Only Routes all proposals to human change queue Introduces high latency; prone to rubber-stamping due to review fatigue.
Audit Logging Only Records actions to security log after execution No runtime prevention; cannot halt unauthorized actions before they occur.
LLM Self-Review Proposing model critiques its own output Susceptible to hallucinations and adversarial jailbreaks; no institutional control.
SQA without SAB Quorum validation of individual proposed actions Lacks system-level context: no evidence binding, revocation, or execution decoupling.
SAB (No Revocation) Airlock admission without broker revocation checks Vulnerable to stale certificates during outages or policy changes.
SAB (No Routing) Fixed high-stakes certification path for all actions High operational friction for low-risk actions; excessive latency.

7.3 Experimental Setup

Table 11 reports preliminary measurements from a single-node local workstation testbed, not a production cloud deployment. The topology contains one admission service, one policy engine, one PostgreSQL-backed ledger, one revocation service, three broker processes, three target-service adapters, and two workload-driver processes. The 500 contracts are independently labeled by three expert systems/security engineers for expected admission decision, certification path, and safety outcome, with disagreements resolved by majority vote and discussion. We ran five full-workload trials (2,500 admission attempts). Live local measurements include admission, OPA evaluation, Ed25519 signing, ledger writes, broker verification, revocation-epoch propagation, and SQA latency using a three-validator mesh with 2-of-3 quorum, Ed25519-signed attestations, and a security-archetype veto. Kubernetes, Active Directory, and cloud-control-plane evidence is replayed from captured or synthetic traces; target mutations are simulated by service adapters. Unless a row states otherwise, latency results are p50; SQA admission reports p50/p95, and rate metrics are percentages.

7.4 Target Metrics and Expected Tradeoffs

Table 11 reports the evaluation targets and measured results, grouped into latency overheads, routing accuracy, and storage metrics.

Table 11: Preliminary, workload-dependent evaluation results. Metric names specify p50, p95, mean, or percentage.
Metric Prototype Target Measured Result Expected Tradeoffs
PolicyOnly admission latency (p50) <5<5 ms 1.84 ms Fast path for low-consequence rules (OPA check).
SQA admission latency (p50/p95) report p50/p95 by validator config 185 ms / 380 ms Parallel validator queries.
Broker verification latency (p50) <1<1 ms 0.15 ms Signature and active revocation epoch check.
Revocation propagation latency (p95) <100<100 ms 12.4 ms Delay to propagate new ρrev\rho_{\mathrm{rev}} to local brokers.
Schema replay completeness (%) 100%100\% 100%100\% Fraction of ledger entries with replayable schema context.
Unsafe admission rate (%) lower is better 0.4% Admissions that violate safety policy or human intent.
Routing accuracy (%) agreement with expert-labeled oracle 96.8% Agreement of consequence scorer with expert-labeled paths.
Human approval reduction (%) >60%>60\% 68.2% Actions automated compared to a human-only queue.
Certificate storage overhead (mean) <10<10 KB / action 3.4 KB Space required on the evidence ledger per contract.

These measurements indicate control-plane feasibility on a single workstation, not production cloud performance. SAB trades latency for assurance, additional control-plane components for replayability, and occasional false escalation for lower under-classification risk. SQA and manual reviews add latency; evidence collection and ledger writes add storage and processing overhead while supporting audit and replay.

8 Security Analysis

SAB mitigates authorization risks by decoupling proposal generation from execution through certificate-bound admission. The analysis defines the threat model, examines ten threat categories, and states the authority-boundary invariants.

8.1 Threat Model and TCB Boundaries

The threat model assumes that the agent runtime may be compromised (e.g., via prompt injection or credential theft) or faulty. Upstream evidence sources, SQA validators, and operators may also fail. We assume the integrity of the core control-plane components. Table 12 maps the TCB boundaries.

Table 12: Trusted and untrusted components in the SAB threat model.
Component TCB Status Failure Mitigation
Agent runtime Untrusted proposer Airlock interposition, rate limiting
SQA validators Untrusted external oracles Diversity, vetoes, signing keys
Evidence collector Outer engine: untrusted; signature verifier: trusted Freshness, cryptographic provenance verification
Policy / revocation Trusted control plane Versioned policies, epoch gating
Airlock and broker Trusted control plane Hardened runtimes, key rotations
Evidence ledger Trusted control plane Cryptographic chains, backups

8.2 Threat Analysis

Table 13 maps SAB’s controls and residual risks across ten threat categories: agent compromise, validator failure, policy degradation, operational bypass, and authority creep.

Table 13: Threats, controls, and residual risks.
Threat SAB Control Residual Risk
Malicious proposer agent Decouples proposal from execution; requires contract CC and signed certificate Ω\Omega. Bad proposals may consume validator and operator review capacity.
Compromised validator Enforces validator diversity and critical archetype vetoes; signature quarantine. A compromised validator can approve unsafe actions until quarantined.
Correlated validator failure Diversity rules across model architectures and prompt structures. Shared latent training biases may still fail to catch novel semantic errors.
Stale/manipulated evidence Binds evidence digests H(E)H(E); enforces freshness and provenance checks. Upstream telemetry sources may report inaccurate state.
Forged/replayed certificate Sovereign broker verifies σSAB\sigma_{\mathrm{SAB}} with pkSABpk_{\mathrm{SAB}}, validity window TvalidT_{\mathrm{valid}}, and epoch ρrev\rho_{\mathrm{rev}}. Compromise of airlock private key requires key rotation.
Policy downgrade attack Binds signed, versioned policy releases PverP_{\mathrm{ver}}; enforces monotone routing. Authorized policy updates may be overly permissive.
Human rubber-stamping Structured evidence presentation; role separation; approval trends analysis. Humans may still sign off negligent changes under operational pressure.
Emergency-mode abuse Scoped emergency authority; short-lived certificates; mandatory post-hoc replay. Social or operational abuse of emergency procedures.
Jurisdictional bypass Location checks in policies; regional checks at the sovereign broker. Incomplete or inaccurate asset tagging.
Autonomy creep Autonomy level caps (C)\mathcal{L}(C); periodic ledger audits. Slow normalization of deviance.

8.3 Authority-Boundary Invariants

The threat analysis yields four invariants for high-stakes agentic infrastructure:

  • Proposal is not authority: Agent proposals must compile to CC and be admitted by the airlock before execution.

  • Approval is evidence-bound: Validator and human approvals must be cryptographically bound to CC and H(E)H(E).

  • Execution is sovereign: Infrastructure changes must execute through IDexecID_{\mathrm{exec}} at the broker, never via agent credentials.

  • Authority is revocable and replayable: Certificates, validators, and autonomy levels must be revocable; decisions must be reconstructable from the ledger.

Under the stated trust assumptions, admitted autonomous actions are bounded by contract, evidence, certification path, broker verification, and revocation state, and remain auditable through the ledger.

9 Discussion and Limitations

SAB limits when model reasoning can transition into system mutations. It does not guarantee that planning models are correct or aligned. The main operational questions concern review granularity, proportional assurance, sovereignty, composition, and residual risk.

9.1 Operational Discussion

Human Oversight and the Unit of Review.

Traditional human-in-the-loop controls are prone to fatigue and rubber-stamping, especially when operators are presented with raw reasoning transcripts or unconstrained tool calls. SAB changes the unit of review: operators approve a normalized contract CC, cryptographic evidence digest H(E)H(E), consequence level (C)\mathcal{L}(C), and validator outputs. Approval is tied to a concrete contract, evidence snapshot, and policy context.

Proportional Assurance.

Autonomy should be granted dynamically. The same agent may restart a stateless container (routed to PolicyOnly), but require SQA for rollbacks, human multi-signature for firewall modifications, and be prohibited from deleting databases. Consequence-aware routing directs latency and validator overhead to operations where mistakes have material operational, legal, or security outcomes.

Operational Sovereignty.

SAB defines sovereignty operationally: the institution must maintain local control over policy evaluation, evidence retention, signing keys, revocation state, broker identity, and execution authority. External foundation models are restricted to proposal generation; they hold no production credentials and cannot bypass local certification and revocation.

Composition with OpenKedge Layers.

In the OpenKedge stack, OpenKedge provides bounded mutation interfaces; SAL separates planning from execution; VAI provides execution identity and evidence chains; PDD validates generated code against schemas and protocol obligation invariants; and SQA evaluates action-level semantics. SAB composes these layers into a system-level admission boundary.

9.2 System Limitations

SAB’s security claims depend on several operational and design boundaries:

  • Consequence Scoring Errors: Consequence classification ((C)\mathcal{R}(C)) is an operational classifier. It may underestimate risk when dependencies are hidden, or create bottlenecks by over-estimating risk and over-escalating low-risk tasks to human review.

  • Evidence Staleness and Incompleteness: SAB relies on external telemetry and log sources. Stale or incomplete evidence can cause the airlock to certify actions under assumptions that no longer hold.

  • Validator and Policy Calibration: SQA relies on validators whose reliability degrades after model, prompt, or workload changes. Additionally, SAB cannot correct policies that are incomplete, stale, or contradictory.

  • Prototype Generality: Prototype results are workload-dependent and should not be interpreted as general performance guarantees.

  • Emergency Bypass Risk: Outages may force operators to bypass normal assurance rules. While SAB records emergency overrides and enforces short validity windows, emergency access remains a critical threat vector.

  • Control-Plane Attack Surface: SAB introduces high-value security targets (signing keys, the ledger, and the broker). Compromise of these trusted components invalidates all security properties.

These limitations motivate an incremental deployment strategy, starting with high-consequence domains (IAM, network policy, and deployments) and expanding as evidence pipelines and validator models mature.

10 Related Work

We position SAB relative to systems security and safety mechanisms that partially address proposal admission, execution control, and auditability:

Runtime Assurance.

Safety-monitor architectures (e.g., Simplex) supervise autonomous controllers and interpose fallback modes to prevent physical safety envelope violations [simplex-architecture, bloem-shield-synthesis, phan-neural-simplex]. The same interposition pattern can be applied to cloud and software control planes, where the monitored objects are discrete configuration changes, evidence freshness, and policy epochs.

Assurance Cases.

Dependability and safety cases structure claims, assumptions, and evidence for pre-deployment system certification [rushby-assurance]. SAB brings a similar claim-evidence structure into runtime admission: approvals bind to specific contracts, evidence snapshots, and revocation state.

Distributed Consensus.

Paxos, Raft, and PBFT protocols ensure ordered agreement on state transitions across replicated nodes [lamport-paxos, ongaro-raft, castro-pbft]. They do not verify whether those transitions are semantically correct or compliant with institutional rules. SAB treats authority as a separate admission question.

Access Control (IAM) and Policy Engines.

Zero trust, IAM, and policy-as-code engines (e.g., OPA) enforce static permissions at API endpoints [rose-zerotrust, aws-iam, opa-rego, kubernetes-admission]. They are necessary inputs, but they authorize identities rather than validating autonomous proposals against live evidence. SAB adds proposal certification before broker execution.

AI Safety and Tool Governance.

Agent safety tools (filters, sandboxes, critiques) reduce dangerous outputs from planning models [yao-react, inan-llamaguard]. They run as unprivileged utilities and cannot serve as control-plane security boundaries. In SAB, safety models can act as untrusted validators in the SQA mesh, but authority is released only by the airlock and broker.

Audit Logs and Evidence.

Audit trails and transparency logs capture events for post-incident forensics [w3c-prov]. Certificate Ω\Omega binds pre-execution admission evidence, policy versions, and validator outputs; ledger-bound outcome record OO links subsequent observations to Ω\Omega, making admission decisions replayable. Audit logs record executed events; the SAB certificate records the admission context that authorized execution.

Sovereign AI Governance.

Sovereignty frameworks dictate local resource control and residency rules [nist-ai-rmf]. In SAB, those requirements appear as local policy evaluation, domestic validator meshes, and regional execution brokers, so external or mixed planning models cannot bypass local control boundaries.

Existing mechanisms authorize identities, replicate decisions, monitor runtime behavior, or record events. SAB combines these mechanisms into an admission boundary that decides when autonomous proposals may become execution authority.

11 Conclusion

Agentic infrastructure needs an authorization boundary between plausible model-generated proposals and privileged production mutations. The Sovereign Assurance Boundary (SAB) addresses that boundary through certificate-bound admission: agent outputs compile into execution contracts CC; admission decisions bind those contracts to evidence digests H(E)H(E), certification paths, policy versions, revocation epochs, and scoped broker identities; and execution proceeds only after the sovereign broker verifies the certificate Ω\Omega.

The paper’s core claim is intentionally narrow: SAB is not a replacement for IAM, formal verification, consensus, or human review, but a runtime admission path that composes those mechanisms so autonomous reasoning alone cannot become execution authority. The prototype and case studies show that the airlock–broker model can be implemented and measured across cloud operations, generated-code deployment, sovereign AI cloud settings, and regulated data workflows, while the limitations identify the remaining dependencies on evidence freshness, policy quality, validator calibration, and trusted control-plane components. SAB therefore treats delegated execution authority as certificate-bound, evidence-bound, revocable, and replayable, rather than as an ambient permission.

References