Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure

Jun He
OpenKedge.io Deying Yu
OpenKedge.io

Abstract

Agentic infrastructure creates a control-plane authorization problem: non-deterministic reasoning systems may propose high-stakes mutations to production resources, but existing access controls (IAM), policy engines, consensus protocols, and audit logs either enforce static permissions or record actions only after execution. This paper introduces the Sovereign Assurance Boundary (SAB), a certificate-bound runtime admission boundary for autonomous execution authority in agentic infrastructure. SAB intercepts agent proposals at an assurance airlock, compiles them into typed execution contracts $C$ , binds those contracts to cryptographic evidence digests $H(E)$ and policy versions, and routes them through consequence-aware certification paths. Successful admission emits a signed Sovereign Assurance Certificate ( $\Omega$ ) that is valid only for a scoped execution identity, revocation epoch, and validity window. A sovereign execution broker verifies the certificate $\Omega$ and performs fresh pre-execution revocation and drift checks before invoking infrastructure APIs. We describe the airlock–broker architecture, formalize the admission and revocation invariants, and report preliminary feasibility measurements from a Go prototype over 2,500 admission attempts. The resulting model prevents autonomous reasoning alone from mutating state and turns delegated execution authority into a certificate-bound, evidence-bound, revocable, and replayable runtime artifact.

1 Introduction

Large language model (LLM) agents are increasingly connected to operational workflows rather than advisory interfaces, allowing them to scale resources, rotate credentials, deploy code, and export diagnostic data [yao-react, schick-toolformer]. In this setting, agentic infrastructure needs an admission mechanism that decides when a proposed action may become execution authority. Granting non-deterministic reasoning processes direct mutation rights over production control planes creates a safety and authorization risk: agents can become the proximate cause of physical or digital state transitions. Without a runtime admission boundary, individually plausible actions can accumulate into availability failures, data exposure, policy bypasses, or loss of forensic context.

Sovereign Assurance Boundary (SAB) interposes between agentic reasoning and high-stakes infrastructure mutation. We use “sovereign” operationally: the institution retains local control over policy evaluation, evidence retention, signing keys, revocation state, broker identity, and execution authority. It is not an agent monitor or a general governance process; it defines a certificate-bound admission boundary where an agent proposal is not treated as ambient authority. Under SAB, a proposal is eligible for execution only after it is compiled into a typed execution contract $C$ , bound to an evidence digest $H(E)$ and policy version, assigned a consequence score $\mathcal{R}(C)$ and autonomy level $\mathcal{L}(C)$ , routed through a consequence-aware certification path $\operatorname{CertPath}(C)$ , encoded as a signed certificate $\Omega$ , checked against active revocation state, and verified by a sovereign execution broker immediately before mutating the target system. The mechanism converts autonomous execution authority from an ambient permission into a revocable, evidence-bound runtime artifact.

SABseparates action-level and system-level assurance. Semantic Quorum Assurance (SQA) [sqa-paper] serves as the action-level certification primitive. While SQA validates individual proposed actions, SAB composes SQA with evidence, policy, human approval, sovereign execution identity, revocation, and replay. SAB is independent of the SQA implementation: any validator mesh returning signed action-level attestations can instantiate $\Gamma_{\mathrm{SQA}}$ . Action-level checks address unsafe individual mutations; system-level assurance binds execution to active policy, evidence, revocation state, and broker-enforcement paths.

Running Example.

Consider an operations agent, $\mathcal{A}_{\text{ops}}$ , tasked with resolving a service degradation. To restore reachability and diagnose the issue, $\mathcal{A}_{\text{ops}}$ proposes reconfiguring network security groups, scaling up compute nodes, rotating database credentials, rolling back a deployment, and exporting diagnostic logs. If an infrastructure stack lets that proposal directly invoke raw IAM permissions, autonomous execution of the sequence introduces systemic risk: the network change may expose private services, the resource expansion may exceed budget or quota constraints, credential rotation may trigger auth failures, the rollback may reintroduce vulnerabilities, and the log export may leak personally identifiable information (PII). The example shows why action-level checks alone do not bound institutional risk.

What SAB Is Not.

SABdiffers from existing controls in several ways:

•

Not an IAM replacement: IAM authorizes identities and requests; SAB certifies whether an autonomous proposal is semantically justified by evidence before a broker invokes those credentials.
•

Not a formal verification replacement: SAB does not prove arbitrary safety, but composes formal proofs (when available) as inputs to its evidence digests.
•

Not a consensus protocol: Consensus (e.g., Raft) agrees on state transitions; SAB evaluates whether a transition is institutionally authorized under current context.
•

Not only logging: Logs record events post-execution; SAB binds pre-execution evidence and policy versions in the certificate $\Omega$ , then links outcomes through a ledger-recorded outcome record $O$ , making the admission process replayable.
•

Not only human-in-the-loop approval: SAB is a system-level runtime admission boundary that cryptographically binds human signatures to specific contracts and evidence snapshots, preventing generic rubber-stamping.

We make four contributions:

1.

Certificate-bound admission for autonomous infrastructure. We identify a control-plane authorization gap that arises when non-deterministic agents are allowed to propose production mutations, and define the Sovereign Assurance Boundary (SAB) as a runtime admission boundary that converts agent proposals into certificate-bound execution authority.
2.

A concrete airlock–broker architecture. We introduce the assurance airlock, typed execution contract $C$ , evidence digest $H(E)$ , the certificate $\Omega$ , and sovereign execution broker. Together, these mechanisms separate proposal, admission, and execution so that agent reasoning alone cannot directly mutate infrastructure state under the broker-enforced execution model.
3.

Consequence-aware certification and revocation. We formalize consequence scoring $\mathcal{R}(C)$ , autonomy levels $\mathcal{L}(C)$ , certification paths $\operatorname{CertPath}(C)$ , monotone path selection, broker-side verification, and revocation epochs $\rho_{\mathrm{rev}}$ . These mechanisms support escalation, rejection, suspension, and revocation before execution.
4.

Prototype and feasibility evaluation. We implement an initial Go prototype with contract serialization, OPA/Rego policy checks, Ed25519 certificate signing, a PostgreSQL-backed append-only ledger, revocation-epoch distribution, broker verification, and replay tooling. Using 500 labeled execution contracts over 2,500 admission attempts, we report preliminary local feasibility measurements for admission latency, broker verification, revocation propagation, routing accuracy, replay completeness, and certificate overhead.

2 Background, Motivation, and System Requirements

2.1 Background and Gap

Securing agentic infrastructure requires integrating model-generated proposals with existing safety and security mechanisms.

Runtime Assurance.

Traditional runtime assurance separates an advanced controller from a monitor or fallback controller to prevent safety envelope violations [simplex-architecture, bloem-shield-synthesis]. In cloud and database administration, mutations are discrete, policy-laden configuration changes. SAB adapts the runtime-assurance pattern by evaluating the institutional admissibility of proposals rather than continuous physical invariants.

Assurance Cases.

Assurance cases present structured safety, security, or compliance arguments by linking claims to evidence [rushby-assurance]. They are usually static, design-time artifacts. SAB moves part of that structure into runtime admission by cryptographically binding telemetry, policies, and validator signatures to SAB certificates.

Access Control (IAM) and Policy Engines.

Zero trust and identity management systems (IAM) verify caller identities and enforce structural policy compliance (e.g., via OPA Rego) [rose-zerotrust, aws-iam, opa-rego]. While IAM dictates whether an identity can call an API, it cannot determine whether an autonomous proposal is semantically justified by live evidence. SAB treats access control as a prerequisite, interposing a semantic certification boundary before credentials are used.

Consensus and Byzantine Fault Tolerance.

Consensus protocols (e.g., Paxos, Raft, PBFT) replicate logs and enforce order across replicas [lamport-paxos, ongaro-raft]. They ensure agreement on state transitions but do not evaluate the semantic correctness or safety of those transitions. Semantic Quorum Assurance (SQA) certifies the semantic admissibility of individual proposed actions [sqa-paper]; SAB composes SQA into a system-level certification boundary.

AI Safety and Tool Governance.

Agent safety tools (sandboxing, output filtering, model self-critiques) constrain agent execution environments [yao-react, owasp-llm-top10]. These mechanisms lack cryptographically signed, institutionally bound execution credentials or durable audit trails. SAB wraps these tools as untrusted validators and binds their collective output into SAB certificates.

2.2 Motivation: Why Action Certification Is Not Enough

Semantic Quorum Assurance (SQA) verifies locally whether an action is semantically admissible (e.g., whether a security-group change fits an active outage ticket). High-stakes infrastructure also needs a system-level check: whether the agent operates under active policies, fresh evidence, and current revocation rules.

A sequence of individually approved steps (e.g., opening a firewall, expanding an IAM role, rolling back a service, and exporting logs) can collectively create high-consequence availability, compliance, or security failures. Table 1 summarizes how SAB composes these security controls. SAB mediates the certificate-bound transition from proposal to execution, checking that the certificate $\Omega$ contains the evidence and validation epoch required by policy.

Table 1: Admission controls for agentic infrastructure. SAB certifies the institutional admission context surrounding individual actions.

Control	Question	Artifact	Failure if Missing
Policy engine	Is the caller authorized for this operation?	Authorization token	Static permissions are treated as ambient authority.
SQA	Is the proposed action semantically admissible?	Action certificate	Locally unsafe mutations are executed.
Human approval	Does an operator approve this step?	Approval record	Approvals do not bind evidence or policy epochs.
Audit logging	What happened after execution?	Event log	Accountability is reconstructed post-hoc.
SAB	May this proposal become execution authority under institutional rules?	SAB certificate $\Omega$	Individually safe actions cross into execution authority without system context.

2.3 System Requirements

To govern high-stakes agentic infrastructure, SAB must satisfy seven system requirements:

R1: Proposal/Execution Separation.: Agent runtimes must have no standing credentials or direct execution privileges, treating them purely as action proposers.
R2: Contractual Admission.: Raw agent commands must compile into a strongly-typed execution contract $C$ defining target resources, preconditions, postconditions, and rollbacks.
R3: Evidence-Bound Decisions.: All admissions must bind to a cryptographic digest $H(E)$ of the telemetry and system state active at decision time.
R4: Consequence-Aware Routing.: Proposals must route to validation paths dynamically based on a consequence score $\mathcal{R}(C)$ (blast radius, irreversibility, data risk, and uncertainty).
R5: Sovereign Execution.: Admitted contracts must execute exclusively through an institutionally controlled broker that scopes credentials and validates the certificate $\Omega$ .
R6: Revocability.: The system must support scoped withdrawal, enabling key rotation, validator quarantine, and domain suspension.
R7: Replayability.: SAB must record the certificate $\Omega$ and, after execution, outcome record $O$ in an append-only log, with the ledger binding $O$ to $\Omega$ .

3 SAB Architecture and Trust Model

The Sovereign Assurance Boundary (SAB) intercepts agent proposals before they can mutate production infrastructure. The system model separates unprivileged agent reasoning from privileged sovereign execution.

3.1 System Model and Assumptions

We model the agent runtime as an untrusted proposer. It may be compromised, misaligned, or faulty, and has no standing administrative credentials. Downstream infrastructure APIs accept state mutations exclusively from the sovereign execution broker.

The trusted computing base (TCB) consists of the assurance airlock, policy engine, evidence ledger, revocation controller, private signing keys, and sovereign execution broker. The contract compiler, evidence collector, consequence scorer, SQA validator mesh, and approval console run outside the TCB but are security-relevant; Table 2 maps each component to its trust boundary and core systems function. The evidence collector is untrusted: while SAB detects unsigned or malformed evidence, signed false evidence from compromised authorized sources remains a residual risk. Similarly, the SQA mesh is an external, untrusted oracle: the airlock policy engine verifies validators’ signed attestations $\Gamma_{\mathrm{SQA}}$ instead of trusting validator internal execution.

SAB assumes authenticated component identities, a versioned policy store, bounded clock skew, append-only ledger integrity, and timely propagation of revocation state. Let $pk_{\mathrm{SAB}}$ denote the airlock’s public verification key and $\sigma_{\mathrm{SAB}}$ its signature on the certificate $\Omega$ .

Figure 1: Reference architecture for the Sovereign Assurance Boundary.

Table 2: SAB Component Reference.

Component	Trust Boundary	Core Systems Function
Agent Runtime	Untrusted / Outer	Proposes mutations as unprivileged JSON requests.
Contract Compiler	Security-relevant / Airlock	Normalizes natural-language or tool calls into structured contracts $C$ .
Evidence Collector	Security-relevant / Airlock	Assembles telemetry, dependency logs, and tickets to compute evidence digest $H(E)$ .
Consequence Scorer	Security-relevant / Airlock	Estimates blast radius and data risk; determines autonomy levels $\mathcal{L}(C)$ .
Policy Engine	Trusted Control Plane	Enforces static rules and routes remaining ones to certification paths.
Assurance Airlock	Trusted Control Plane	Enforces path predicates, collects approvals, and issues signed SAB certificates $\Omega$ .
SQA Validator Mesh	Fallible / Outer Mesh	Evaluates action semantics as decoupled, untrusted signing oracles.
Approval Console	Security-relevant / Outer	Presents structured contracts and evidence summaries for operator multi-signature.
Sovereign Broker	Trusted Control Plane	Verifies signatures and revocation status of $\Omega$ ; executes scoped actions through IAM.
Evidence Ledger	Trusted Control Plane	Durably records SAB certificates and bound outcome records $O$ in an append-only cryptographic log.
Revocation Controller	Trusted Control Plane	Monitors system state; advances revocation epochs $\rho_{\mathrm{rev}}$ and manages suspensions.
Replay Service	Security-relevant / Outer	Re-evaluates ledger records to reconstruct and audit historical admission decisions.

3.2 SAB Invariant Properties

Under the trust model in Section 3.1, SAB enforces five authority-boundary invariants:

P1: Proposal-Execution Decoupling Invariant.

Let $\mathrm{Mutate}(r,op)$ denote a state-changing operation on resource $r$ . Under the assumption that the infrastructure API accepts mutations only from the sovereign broker, no agent proposal can directly trigger $\mathrm{Mutate}(r,op)$ without first being admitted by the airlock:

\mathrm{Execute}(\Omega)\implies\mathrm{Admit}(C)

where $C$ is the compiled execution contract corresponding to the proposal.

P2: Certificate-Bound Execution Invariant.

The sovereign execution broker executes a contract $C$ only after verifying the certificate $\Omega$ and its airlock signature:

\mathrm{Execute}(\Omega)\implies\operatorname{Verify}(pk_{\mathrm{SAB}},\Omega,\sigma_{\mathrm{SAB}})\land C\in\Omega

where the certificate is bound to the specific contract $C$ , policy version $P_{\mathrm{ver}}$ , execution identity $ID_{\mathrm{exec}}$ , and validity window $T_{\mathrm{valid}}$ .

P3: Evidence-Binding Invariant.

Approvals and validator attestations ( $\Gamma_{\mathrm{SQA}},\Gamma_{\mathrm{human}}$ ) are bound cryptographically to $C$ and the evidence digest $H(E)$ :

\operatorname{Approve}(C,H(E))\implies\mathrm{Sign}(\Gamma,C\mathbin{\|}H(E))

This binding prevents approval reuse across different contracts or stale environmental contexts.

P4: Monotone Path Invariant.

A contract $C$ is admitted only if all validation steps required by its selected certification path $\operatorname{CertPath}(C)$ are satisfied:

\operatorname{Admit}(C)\implies\operatorname{VerifyPath}(C,\operatorname{CertPath}(C))

where $\operatorname{CertPath}(C)$ depends on the consequence score $\mathcal{R}(C)$ , autonomy level $\mathcal{L}(C)$ , and policy version $P_{\mathrm{ver}}$ . Policy may escalate a contract to stronger assurance or rejection, but lower-assurance paths cannot override requirements imposed by action class, consequence score, jurisdiction, or incident mode.

P5: Ephemeral Authority Invariant.

The sovereign execution broker rejects any certificate $\Omega$ if the current revocation epoch $\rho_{\mathrm{active}}$ is greater than the certificate’s revocation epoch $\rho_{\mathrm{rev}}$ , or if the execution time $t_{\mathrm{exec}}$ falls outside the validity window $T_{\mathrm{valid}}$ :

\mathrm{Execute}(\Omega)\implies\rho_{\mathrm{rev}}=\rho_{\mathrm{active}}\land t_{\mathrm{exec}}\in T_{\mathrm{valid}}

The broker therefore rejects certificates from stale epochs.

3.3 Proof Sketch

Goal.

Establish that an untrusted agent runtime cannot directly execute a state-changing operation except through a broker-verified certificate satisfying the active policy path.

Assumptions.

1. Infrastructure APIs accept mutations exclusively from the sovereign execution broker. 2. The broker verifies certificate signatures using the airlock public verification key $pk_{\mathrm{SAB}}$ and rejects invalid signatures. 3. Airlock private key remains uncompromised. 4. Active policy $P_{\mathrm{ver}}$ dictates path requirements.

Argument.

To execute a mutation, the broker must verify the certificate $\Omega$ such that $C\in\Omega$ and $\operatorname{Verify}(pk_{\mathrm{SAB}},\Omega,\sigma_{\mathrm{SAB}})$ (by P2). Because the airlock private key is secure, only the airlock could have produced $\sigma_{\mathrm{SAB}}$ . The airlock emits this signature only if $\operatorname{Admit}(C)$ evaluates to true. By P4, $\operatorname{Admit}(C)$ evaluates to true only if all path requirements $\operatorname{VerifyPath}(C,\operatorname{CertPath}(C))$ are satisfied under the policy $P_{\mathrm{ver}}$ . Under the assumptions above, an agent proposal cannot execute unless it compiles to a contract $C$ satisfying the path checks mandated by institutional policy. Broker-enforced execution leaves agent reasoning with no direct mutation path (P1).

3.4 Broker Verification Checks

Before invoking infrastructure APIs, the sovereign broker evaluates the pre-execution checks in Table 3. The checks cover certificate validity, contract identity, revocation state, policy epoch, state drift, and scoped credential issuance.

Table 3: Broker verification checks.

Broker Check	Input	Failure Behavior
Signature Verification	Certificate $\Omega$ , public key $pk_{\mathrm{SAB}}$ , signature $\sigma_{\mathrm{SAB}}$	Reject execution, abort
Contract Match	Contract $C$ , certificate contents $C\in\Omega$	Reject execution, abort
Validity Window	Current time $t_{\mathrm{exec}}$ , validity range $T_{\mathrm{valid}}$	Reject execution, abort
Revocation Epoch	Certificate epoch $\rho_{\mathrm{rev}}$ , active epoch $\rho_{\mathrm{active}}$	Reject, fail-closed for $L_{3}/L_{4}$
Policy Epoch	Certificate policy version $P_{\mathrm{ver}}$ , active $P_{\mathrm{active}}$	Reject execution, abort
State Drift Check	Bound evidence $E$ , live state $S_{t_{\mathrm{exec}}}$ , threshold	Reject, fail-closed for $L_{3}/L_{4}$
Credential Issuance	Scoped identity $ID_{\mathrm{exec}}$ , target resource API	Reject execution, abort

3.5 Worked Admission Example

Consider an operations agent $\mathcal{A}_{\text{ops}}$ proposing an ingress network rule $C_{\mathrm{fw}}$ to resolve an outage. Table 4 traces this proposal through the reference architecture.

Table 4: Worked example stages for security-group admission.

Stage	Component	Output / Artifact	Example Value	Failure Check
1. Compile	Contract Compiler	Execution Contract $C_{\mathrm{fw}}$	Add ingress rule; target: sg-01; port: 443; rollback: delete rule	Malformed syntax, unsupported API
2. Bind Evidence	Evidence Collector	Evidence Chain Digest $H(E_{\mathrm{fw}})$	Telemetry (high latency); target config; incident ticket ID: INC-101	Stale telemetry, missing incident ticket
3. Score Consequence	Consequence Scorer	Consequence Score	$\mathcal{R}(C_{\mathrm{fw}})=8.5$ ; autonomy level $\mathcal{L}(C_{\mathrm{fw}})=L_{4}$	Score calculation failure
4. Route Path	Policy Engine	Certification Path	$\operatorname{CertPath}(C_{\mathrm{fw}})=\mathrm{SQAPlusHuman}$	Monotone path mismatch
5. Airlock Check	Assurance Airlock	SAB certificate $\Omega_{\mathrm{fw}}$	Signed bundle including $C_{\mathrm{fw}}$ , $H(E_{\mathrm{fw}})$ , $\Gamma_{\mathrm{SQA}}$ , $\Gamma_{\mathrm{human}}$	Validator veto, missing human signature
6. Exec Check	Sovereign Broker	Broker execution decision	Verify $\sigma_{\mathrm{SAB}}$ with $pk_{\mathrm{SAB}}$ , check $\Phi_{\mathrm{drift}}(E_{\mathrm{fw}},S_{t_{\mathrm{exec}}})$ , verify $\rho_{\mathrm{rev}}$	Drift limit exceeded, stale epoch
7. Record Log	Evidence Ledger	Ledger transaction	Record $O_{\mathrm{fw}}$ bound to $\Omega_{\mathrm{fw}}$	Ledger write failure

The example has two failure cases:

•

Airlock policy rejection: If $\mathcal{A}_{\text{ops}}$ attempts to open port 22 (SSH) to the public internet (0.0.0.0/0), the consequence scorer maps this high-risk action to $L_{4}$ . The airlock’s security-archetype validator detects a policy violation and triggers a critical veto, returning $\mathrm{Reject}$ .
•

Broker-side verification failure: If $C_{\mathrm{fw}}$ is admitted at time $t_{1}$ , but before execution at $t_{2}$ the active revocation epoch advances from $\rho_{1}$ to $\rho_{2}$ , the broker detects that $\rho_{\mathrm{rev}}=\rho_{1}<\rho_{2}$ and aborts the execution. Similarly, if the broker’s drift-check predicate $\Phi_{\mathrm{drift}}(E_{\mathrm{fw}},S_{t_{2}})$ evaluates to false (indicating that the target security group’s state has drifted since admission), the execution is aborted.

4 Certificate-Bound Admission Pipeline

The admission pipeline evaluates proposals, scores consequence, routes them to certification paths, and issues signed certificates that the broker can verify before execution.

4.1 Formal Admission Interface

Let $C$ be an execution contract, $E$ be the evidence bound to that contract, $S_{t}$ be the observed infrastructure state at time $t$ , $\mathcal{R}(C)$ be the consequence score, and $\mathcal{L}(C)$ be the assigned autonomy level; the certificate $\Omega$ is the resulting admission artifact. The airlock implements the admission function:

\operatorname{Airlock}(C,E,S_{t})\rightarrow\Omega\;\;\text{or}\;\;\mathrm{Reject}.

The function returns $\Omega$ only when the proposal satisfies the required certification path. Otherwise, it returns $\mathrm{Reject}$ . To prevent time-of-check to time-of-use (TOCTOU) risks, the broker evaluates a state drift-check predicate $\Phi_{\mathrm{drift}}(E,S_{t_{\mathrm{exec}}})$ immediately before executing the mutation, verifying that the live state at execution time $S_{t_{\mathrm{exec}}}$ has not deviated from the bound evidence $E$ beyond policy safety margins.

4.2 Consequence Scoring and Autonomy Levels

The consequence score $\mathcal{R}(C)$ quantifies the operational, compliance, and security risk of the contract:

\mathcal{R}(C)=\alpha B(C)+\beta P(C)+\gamma D(C)+\delta I(C)+\eta U(C).

(1)

The terms measure specific risk vectors:

•

Blast radius $B(C)$ : Affected resources, services, and network zones.
•

Privilege expansion $P(C)$ : Elevated credentials or access scopes required.
•

Data sensitivity $D(C)$ : Exposure risk of regulated data or PII.
•

Irreversibility $I(C)$ : Computational cost and difficulty of rolling back.
•

Uncertainty $U(C)$ : Lack of telemetry or confidence in evidence.

To tune the weights ( $\alpha,\beta,\gamma,\delta,\eta$ ), security engineers run an offline adjudication loop replaying historical incidents to tune classifications against expert consensus, biasing weights to escalate rather than under-classify when uncertainty $U(C)$ is high.

From the consequence score, active policy $\mathcal{P}_{\mathrm{inst}}$ , and jurisdictional constraints $\mathcal{J}$ , the airlock maps the contract to an autonomy level:

\mathcal{L}(C)=\Lambda(\mathcal{R}(C),\mathcal{P}_{\mathrm{inst}},\mathcal{J}).

Table 5 defines the consequence-aware autonomy levels.

Table 5: Consequence-aware autonomy levels.

Level	Class	Example Action	Assurance and Validation Obligation
$L_{0}$	Advisory	Diagnostic query	Read-only; agent cannot compile or execute.
$L_{1}$	Manual Draft	Batch restarts	Agent drafts contract; manual operator submission required.
$L_{2}$	Bounded Autonomous	Stateless restart	Reversible, local action; OPA policy checks.
$L_{3}$	Governed Autonomous	Autoscaling or rollback	Moderate blast radius; requires SQA and evidence binding.
$L_{4}$	High-Stakes Autonomous	Security group, log export	High blast radius; SQA, human multi-sig, revocation checks.
$L_{5}$	Prohibited	Deleting audit logs	Categorically denied autonomous execution.

4.3 Certification Paths

After assigning $\mathcal{L}(C)$ , the airlock routes the contract through a certification path:

\operatorname{CertPath}(C)\rightarrow\left\{\begin{array}[]{@{}l@{}}\mathrm{Reject},\ \mathrm{PolicyOnly},\\ \mathrm{LightweightQuorum},\ \mathrm{SQA},\\ \mathrm{SQAPlusHuman},\ \mathrm{Prohibited}\end{array}\right\}.

(2)

The selected path determines the validation requirements:

•

PolicyOnly: Requires a valid contract, evidence digest, and a successful policy engine (e.g. OPA) check.
•

LightweightQuorum: Requires multiple automated validator checks (syntax checks, telemetry checks, and dependency checks).
•

SQA: Requires Semantic Quorum Assurance and validator diversity constraints.
•

SQAPlusHuman: Requires SQA, critical-archetype veto checks, and operator multi-signature.
•

Prohibited / Reject: Categorically disallowed or failed admission attempts.

Decoupled SQA Oracle.

To keep the SQA validator mesh outside SAB’s trusted computing base, validator nodes run as external, isolated processes. They receive a read-only input bundle containing $C$ and $E$ , and hold no credentials. Each validator $i$ independently evaluates the proposal and generates a cryptographically signed attestation:

\Gamma_{i}=\mathrm{Sign}(\mathrm{key}_{i},C\mathbin{\|}H(E)\mathbin{\|}\mathrm{vote}_{i}\mathbin{\|}\mathrm{metadata}_{i}).

The airlock verifies validator signatures and checks diversity, quorum size, and veto requirements over the aggregated attestations $\Gamma_{\mathrm{SQA}}=\{\Gamma_{1},\Gamma_{2},\dots\}$ . Even under compromise, individual validators cannot forge approvals without private keys, nor can they directly mutate infrastructure.

4.4 Admission Predicate

To admit a contract, the airlock evaluates the admission predicate:

	$\displaystyle\operatorname{Admit}(C)={}$	$\displaystyle\Phi_{\mathrm{policy}}(C)\land\Phi_{\mathrm{evidence}}(C,E)$		(3)
		$\displaystyle\land\Phi_{\mathrm{assurance}}(C)\land\Phi_{\mathrm{authority}}(C).$		(3)

•

$\Phi_{\mathrm{policy}}(C)$ : Policy permits the action class and target resource.
•

$\Phi_{\mathrm{evidence}}(C,E)$ : Required evidence exists, is fresh, and has trusted provenance.
•

$\Phi_{\mathrm{assurance}}(C)$ : The selected assurance path completed successfully.
•

$\Phi_{\mathrm{authority}}(C)$ : The broker identity, validity window, and revocation state are valid.

4.5 SAB Certificates

When the admission predicate is satisfied, the airlock issues the certificate $\Omega$ :

$\displaystyle\Omega=\langle$	$\displaystyle C,\ H(E),\ \mathcal{R}(C),\ \mathcal{L}(C),$	(4)
	$\displaystyle\operatorname{CertPath}(C),\ \Gamma_{\mathrm{SQA}},$
	$\displaystyle\Gamma_{\mathrm{human}},\ ID_{\mathrm{exec}},\ T_{\mathrm{valid}},$
	$\displaystyle P_{\mathrm{ver}},\ \rho_{\mathrm{rev}},\ \sigma_{\mathrm{SAB}}\rangle.$

where $ID_{\mathrm{exec}}$ is the execution identity, $T_{\mathrm{valid}}$ is the validity window, $P_{\mathrm{ver}}$ is the policy version, $\rho_{\mathrm{rev}}$ is the revocation epoch, and $\sigma_{\mathrm{SAB}}$ is the airlock signature. After execution, the broker emits a separate outcome record:

O=\langle\Omega,\ \mathrm{status},\ t_{\mathrm{exec}},\ H(S_{\mathrm{post}}),\ \mathrm{result},\ \sigma_{\mathrm{broker}}\rangle,

(5)

where $\sigma_{\mathrm{broker}}$ is the broker signature. The ledger stores $O$ and binds it to $\Omega$ .

Properties.

•

Replayability: The replay service reconstructs the decision using $C$ , $H(E)$ , $\mathcal{R}(C)$ , $\mathcal{L}(C)$ , $\operatorname{CertPath}(C)$ , $P_{\mathrm{ver}}$ , and $\rho_{\mathrm{rev}}$ .
•

Non-repudiation: Approvals are cryptographically bound to $C$ and $H(E)$ , preventing reuse or extraction.
•

Temporal Validity: The certificate expires when the validity window $T_{\mathrm{valid}}$ closes or evidence freshness drops.
•

Outcome Binding: Post-execution observations are bound to $\Omega$ through outcome record $O$ , not embedded in the original certificate.

Table 6 compares SAB certificates with standard change-auditing artifacts.

Table 6: Comparison of certification and auditing artifacts.

Artifact	Records	Limitation
Audit log	Executed event, actor, time	Lacks admission context.
Policy decision record	Policy engine query result	Omits semantic evidence and validator votes.
SQA quorum	Validator votes and reasoning	Lacks system-level context and revocation epoch.
SAB certificate	Full pre-execution admission state	Outcomes require ledger-bound records and replay overhead.

5 Revocation, Suspension, and Incident Response

Certificate-bound authority must be withdrawable. An incident may require immediate suspension across an action class, domain, validator, or execution path, so revocation and suspension are part of the control plane rather than an external operations procedure.

5.1 Revocation Triggers and Withdrawal Mechanisms

The controller monitors signals from the ledger, validator mesh, policy engine, broker, and compliance tools. Triggers apply globally or to a scoped domain $d$ (action class, service, region), as shown in Table 7.

Table 7: System revocation triggers.

Trigger Class	Primary Signal	Institutional Consequence
Validator Degradation	Validator agreement rate or accuracy drops below threshold	Downgrade autonomy level, quarantine validator
Evidence Outage/Staleness	Telemetry streams, incident tickets, or configs become stale/unavailable	Airlock rejects new proposals in affected domain
Policy Drift/Rollback	Updated policy version $P_{\mathrm{ver}}$ published or rolled back	Invalidate active certificates, re-evaluate paths
Environmental Outage	System monitor detects live intrusion or active severity-1 outage	Advance revocation epoch $\rho_{\mathrm{rev}}$ , suspend domain
Compromised Identity	Suspected leak of signing keys or broker execution credential	Revoke pending certificates, rotate execution keys
Operational Anomaly	Operator triggers emergency hold or manually rolls back change	Immediate human-only fallback, cancel active certificates

When a trigger fires, the controller applies one or more scoped withdrawal mechanisms (Table 8). Withdrawal is enforced at admission (the airlock consults suspension state before emitting $\Omega$ ) and execution (the broker checks revocation state before execution).

Table 8: Authority withdrawal mechanisms.

Mechanism	Enforcement Point	Operational Effect
Level Downgrade	Consequence Scorer	Caps maximum permitted autonomy level $\mathcal{L}(C)$ , routing actions to human approval.
Domain Suspension	Assurance Airlock	Categorically rejects all future proposals matching the suspended domain $d$ .
Certificate Invalidation	Sovereign Broker	Broker rejects execution of unexecuted certificates matching the revoked class.
Validator Quarantine	SQA Mesh	Removes compromised or stale validators from participating in quorum validation.
Identity Rotation	Sovereign Broker	Rotates execution credentials, invalidating certificates bound to stale keys.
Human-only Fallback	Assurance Airlock	Bypasses automated routing, requiring manual approval for all actions.
Replay Sweep	Audit Service	Re-evaluates recent certificates under updated policy to identify active safety violations.

If the broker cannot obtain fresh revocation state (e.g., due to a network partition), it fails closed, rejecting all high-consequence ( $L_{3},L_{4}$ ) executions. Revocation updates are signed, versioned, and distributed as monotonically increasing epochs. Deployments that cache revocation state must bound cache lifetime and record the revocation epoch used for every broker decision.

5.2 Formal Semantics

Let $d$ be a domain or action class. Suspension is a future-admission invariant:

\operatorname{Suspended}(d)\implies\forall C\in d,\;\operatorname{Admit}(C)=\mathrm{False}.

(6)

The invariant is evaluated before path selection, preventing suspended domains from being bypassed.

Certificate revocation is an execution invariant:

\operatorname{Revoked}(\Omega)\implies\operatorname{Execute}(\Omega)=\mathrm{False}.

(7)

Equation 7 applies to pending or unexecuted certificates. Completed executions cannot be undone; instead, the ledger marks $\Omega$ for post-hoc response and compensating actions.

5.3 Incident Response Workflow and Override Risk

During an incident, the controller identifies the affected domain $d$ , installs a suspension or downgrade rule, and forces new proposals to human-only escalation. The broker invalidates pending certificates in $d$ , while the replay service sweeps recent executions to link outcomes back to context.

Emergency overrides that bypass SAB remove the evidence and identity constraints required for accountability. Emergency operations should instead pass through SAB under stricter constraints: narrow scopes, short-lived certificates, and mandatory post-hoc replay. If an override is abused, the controller revokes pending certificates, quarantines keys, and replays the executions under standard policy.

6 Case Studies

The following case studies trace SAB’s certificate-bound admission boundary across four operational settings (Table 9).

Table 9: Case-study summary.

Case	Proposed Action	Risk	Certification Path	Outcome
Enterprise Cloud Operations	Open security-group rule during outage	Public exposure, privilege expansion, incident masking	SQAPlusHuman or Reject	Vetoed, escalated, or executed through broker with $\Omega$
Generated-Code Deployment	Deploy agent-generated patch	Security regression, failed rollout, irreversible change	SQA or SQAPlusHuman	Certified artifact deployed with rollback and outcome evidence
Sovereign AI Cloud	Execute plan proposed by external foundation models	Jurisdictional leakage, external execution authority	SQAPlusHuman under local policy	Local certificate $\Omega$ and sovereign execution identity required
Regulated Data Workflow	Export logs or datasets for diagnosis	PII exposure, retention violation, cross-boundary transfer	SQAPlusHuman, Reject, or Prohibited	Redacted export, human approval, or denial with replay record

6.1 Case Study 1: Enterprise Cloud Operations

Scenario and Path.

An operations agent $\mathcal{A}_{\text{ops}}$ detects network latency and proposes adding an ingress rule in a security group to bypass a blocked gateway. The compiler normalizes this into contract $C_{\mathrm{fw}}$ , which the evidence collector binds to service-health metrics, incident ticket ID, and security-group configurations. Because modifying firewall rules alters production network posture, the scorer assigns a high blast-radius score, mapping $C_{\mathrm{fw}}$ to autonomy level $L_{4}$ . If the proposed source range is public (0.0.0.0/0), the policy engine routes the contract to SQAPlusHuman. SQA validators review the proposal; a security-archetype validator holds veto power. If approved, the airlock signs certificate $\Omega_{\mathrm{fw}}$ . The broker verifies $\Omega_{\mathrm{fw}}$ and evaluates the state drift-check predicate $\Phi_{\mathrm{drift}}$ before executing the API call.

Failure Prevented and Artifact.

This path is intended to stop a latency diagnosis from becoming an overbroad network exposure or an unevidenced API call. The ledger records the contract, evidence digest $H(E)$ , validator signatures, and broker decision, allowing post-hoc verification that the change was semantically justified.

6.2 Case Study 2: Generated-Code Deployment

Scenario and Path.

A software engineering agent generates a bug patch and proposes deploying it to a Kubernetes cluster. The proposal is compiled into contract $C$ , which includes source diffs, target service, and rollback plans. The collector binds $C$ to build provenance, test results, and vulnerability scans. The scorer distinguishes low-risk canary rollouts (routed to SQA) from privileged code changes or database migrations (escalated to SQAPlusHuman). The airlock evaluates PDD (Protocol-Driven Development) invariants, verifying that the generated patch matches interface definitions. The broker deploys only certified manifests and monitors rollout health, recording outcomes.

Failure Prevented and Artifact.

The broker blocks unverified code from reaching production by rejecting proposals with missing rollbacks, failing tests, or unapproved dependencies. The certificate records all build and test evidence, enabling post-deployment replay.

6.3 Case Study 3: Sovereign AI Cloud

Scenario and Path.

A sovereign operator uses a mixed model stack (external and domestic models) for planning. An external model proposes restarting services and exporting diagnostic telemetry. The local contract compiler decomposes the plan into separate contracts, and the collector binds local evidence (data-residency tags, configuration states). Sensitive context is held locally. The local policy engine enforces jurisdictional rules, rejecting contracts that violate residency policies or cross borders. Valid proposals are reviewed by local SQA validators, and high-stakes actions route to human operator multi-signature. The local broker executes approved contracts using domestic identities.

Failure Prevented and Artifact.

The local broker path prevents external planning models from directly acquiring execution authority. External reasoning systems cannot obtain credentials, bypass jurisdictional policies, or execute mutations without local admission and replay. Certificate $\Omega$ records the domestic validation path and broker identity, making sovereign execution verifiable.

6.4 Case Study 4: Regulated Data Workflow

Scenario and Path.

During an incident, an agent proposes exporting database logs for offline analysis. The logs may contain PII or regulated customer records. The compiler translates this into a data-export contract detailing source tables, destinations, and redaction rules. The collector binds schema metadata and legal-hold status. The scorer assigns a high data-sensitivity term, raising uncertainty if payloads are unclassified. If policy forbids autonomous export of the data class, the path is Prohibited or Reject. If export is permitted with redaction, the airlock enforces a minimization contract, routing the redacted export through compliance validators and human multi-signature.

Failure Prevented and Artifact.

This path is intended to prevent autonomous leakage of regulated data under the guise of diagnostic utility. The airlock rejects overbroad exports or destinations lacking compliance controls, binding redaction attestations and human approvals to the export outcome.

7 Evaluation Methodology and Targets

We evaluate SAB’s control-plane feasibility and performance tradeoffs with a Go prototype and three workload families.

7.1 Prototype Implementation

Our prototype of SAB in Go contains a stateless admission service, Protocol Buffer and JSON Schema contract definitions, an evidence-digest generator extracting Kubernetes and Active Directory logs, OPA/Rego policy evaluation, Ed25519 signing for $\Omega$ , a PostgreSQL ledger, prototype target adapters, revocation epoch distribution, and a replay command-line interface. The target adapters simulate actions on Kubernetes and AWS/GCP APIs.

7.2 Evaluation Methodology and Baselines

We compare SAB against standard baseline security controls (Table 10) using 500 execution contracts: 200 cloud-infrastructure contracts (security groups, scaling, restarts, credential rotations), 150 CI/CD contracts (rollbacks, patch deployments, manifest updates), and 150 data-governance contracts (migrations, log exports, resource deletion).

Table 10: System baselines and comparison matrix.

Baseline	Mechanism	Assurance Limitation
Static IAM/Policy	Checks static roles (e.g., AWS IAM, OPA)	Context-unaware; permits structurally valid but semantically unsafe actions.
Human Approval Only	Routes all proposals to human change queue	Introduces high latency; prone to rubber-stamping due to review fatigue.
Audit Logging Only	Records actions to security log after execution	No runtime prevention; cannot halt unauthorized actions before they occur.
LLM Self-Review	Proposing model critiques its own output	Susceptible to hallucinations and adversarial jailbreaks; no institutional control.
SQA without SAB	Quorum validation of individual proposed actions	Lacks system-level context: no evidence binding, revocation, or execution decoupling.
SAB (No Revocation)	Airlock admission without broker revocation checks	Vulnerable to stale certificates during outages or policy changes.
SAB (No Routing)	Fixed high-stakes certification path for all actions	High operational friction for low-risk actions; excessive latency.

7.3 Experimental Setup

Table 11 reports preliminary measurements from a single-node local workstation testbed, not a production cloud deployment. The topology contains one admission service, one policy engine, one PostgreSQL-backed ledger, one revocation service, three broker processes, three target-service adapters, and two workload-driver processes. The 500 contracts are independently labeled by three expert systems/security engineers for expected admission decision, certification path, and safety outcome, with disagreements resolved by majority vote and discussion. We ran five full-workload trials (2,500 admission attempts). Live local measurements include admission, OPA evaluation, Ed25519 signing, ledger writes, broker verification, revocation-epoch propagation, and SQA latency using a three-validator mesh with 2-of-3 quorum, Ed25519-signed attestations, and a security-archetype veto. Kubernetes, Active Directory, and cloud-control-plane evidence is replayed from captured or synthetic traces; target mutations are simulated by service adapters. Unless a row states otherwise, latency results are p50; SQA admission reports p50/p95, and rate metrics are percentages.

7.4 Target Metrics and Expected Tradeoffs

Table 11 reports the evaluation targets and measured results, grouped into latency overheads, routing accuracy, and storage metrics.

Table 11: Preliminary, workload-dependent evaluation results. Metric names specify p50, p95, mean, or percentage.

Metric	Prototype Target	Measured Result	Expected Tradeoffs
PolicyOnly admission latency (p50)	$<5$ ms	1.84 ms	Fast path for low-consequence rules (OPA check).
SQA admission latency (p50/p95)	report p50/p95 by validator config	185 ms / 380 ms	Parallel validator queries.
Broker verification latency (p50)	$<1$ ms	0.15 ms	Signature and active revocation epoch check.
Revocation propagation latency (p95)	$<100$ ms	12.4 ms	Delay to propagate new $\rho_{\mathrm{rev}}$ to local brokers.
Schema replay completeness (%)	$100\%$	$100\%$	Fraction of ledger entries with replayable schema context.
Unsafe admission rate (%)	lower is better	0.4%	Admissions that violate safety policy or human intent.
Routing accuracy (%)	agreement with expert-labeled oracle	96.8%	Agreement of consequence scorer with expert-labeled paths.
Human approval reduction (%)	$>60\%$	68.2%	Actions automated compared to a human-only queue.
Certificate storage overhead (mean)	$<10$ KB / action	3.4 KB	Space required on the evidence ledger per contract.

These measurements indicate control-plane feasibility on a single workstation, not production cloud performance. SAB trades latency for assurance, additional control-plane components for replayability, and occasional false escalation for lower under-classification risk. SQA and manual reviews add latency; evidence collection and ledger writes add storage and processing overhead while supporting audit and replay.

8 Security Analysis

SAB mitigates authorization risks by decoupling proposal generation from execution through certificate-bound admission. The analysis defines the threat model, examines ten threat categories, and states the authority-boundary invariants.

8.1 Threat Model and TCB Boundaries

The threat model assumes that the agent runtime may be compromised (e.g., via prompt injection or credential theft) or faulty. Upstream evidence sources, SQA validators, and operators may also fail. We assume the integrity of the core control-plane components. Table 12 maps the TCB boundaries.

Table 12: Trusted and untrusted components in the SAB threat model.

Component	TCB Status	Failure Mitigation
Agent runtime	Untrusted proposer	Airlock interposition, rate limiting
SQA validators	Untrusted external oracles	Diversity, vetoes, signing keys
Evidence collector	Outer engine: untrusted; signature verifier: trusted	Freshness, cryptographic provenance verification
Policy / revocation	Trusted control plane	Versioned policies, epoch gating
Airlock and broker	Trusted control plane	Hardened runtimes, key rotations
Evidence ledger	Trusted control plane	Cryptographic chains, backups

8.2 Threat Analysis

Table 13 maps SAB’s controls and residual risks across ten threat categories: agent compromise, validator failure, policy degradation, operational bypass, and authority creep.

Table 13: Threats, controls, and residual risks.

Threat	SAB Control	Residual Risk
Malicious proposer agent	Decouples proposal from execution; requires contract $C$ and signed certificate $\Omega$ .	Bad proposals may consume validator and operator review capacity.
Compromised validator	Enforces validator diversity and critical archetype vetoes; signature quarantine.	A compromised validator can approve unsafe actions until quarantined.
Correlated validator failure	Diversity rules across model architectures and prompt structures.	Shared latent training biases may still fail to catch novel semantic errors.
Stale/manipulated evidence	Binds evidence digests $H(E)$ ; enforces freshness and provenance checks.	Upstream telemetry sources may report inaccurate state.
Forged/replayed certificate	Sovereign broker verifies $\sigma_{\mathrm{SAB}}$ with $pk_{\mathrm{SAB}}$ , validity window $T_{\mathrm{valid}}$ , and epoch $\rho_{\mathrm{rev}}$ .	Compromise of airlock private key requires key rotation.
Policy downgrade attack	Binds signed, versioned policy releases $P_{\mathrm{ver}}$ ; enforces monotone routing.	Authorized policy updates may be overly permissive.
Human rubber-stamping	Structured evidence presentation; role separation; approval trends analysis.	Humans may still sign off negligent changes under operational pressure.
Emergency-mode abuse	Scoped emergency authority; short-lived certificates; mandatory post-hoc replay.	Social or operational abuse of emergency procedures.
Jurisdictional bypass	Location checks in policies; regional checks at the sovereign broker.	Incomplete or inaccurate asset tagging.
Autonomy creep	Autonomy level caps $\mathcal{L}(C)$ ; periodic ledger audits.	Slow normalization of deviance.

8.3 Authority-Boundary Invariants

The threat analysis yields four invariants for high-stakes agentic infrastructure:

•

Proposal is not authority: Agent proposals must compile to $C$ and be admitted by the airlock before execution.
•

Approval is evidence-bound: Validator and human approvals must be cryptographically bound to $C$ and $H(E)$ .
•

Execution is sovereign: Infrastructure changes must execute through $ID_{\mathrm{exec}}$ at the broker, never via agent credentials.
•

Authority is revocable and replayable: Certificates, validators, and autonomy levels must be revocable; decisions must be reconstructable from the ledger.

Under the stated trust assumptions, admitted autonomous actions are bounded by contract, evidence, certification path, broker verification, and revocation state, and remain auditable through the ledger.

9 Discussion and Limitations

SAB limits when model reasoning can transition into system mutations. It does not guarantee that planning models are correct or aligned. The main operational questions concern review granularity, proportional assurance, sovereignty, composition, and residual risk.

9.1 Operational Discussion

Human Oversight and the Unit of Review.

Traditional human-in-the-loop controls are prone to fatigue and rubber-stamping, especially when operators are presented with raw reasoning transcripts or unconstrained tool calls. SAB changes the unit of review: operators approve a normalized contract $C$ , cryptographic evidence digest $H(E)$ , consequence level $\mathcal{L}(C)$ , and validator outputs. Approval is tied to a concrete contract, evidence snapshot, and policy context.

Proportional Assurance.

Autonomy should be granted dynamically. The same agent may restart a stateless container (routed to PolicyOnly), but require SQA for rollbacks, human multi-signature for firewall modifications, and be prohibited from deleting databases. Consequence-aware routing directs latency and validator overhead to operations where mistakes have material operational, legal, or security outcomes.

Operational Sovereignty.

SAB defines sovereignty operationally: the institution must maintain local control over policy evaluation, evidence retention, signing keys, revocation state, broker identity, and execution authority. External foundation models are restricted to proposal generation; they hold no production credentials and cannot bypass local certification and revocation.

Composition with OpenKedge Layers.

In the OpenKedge stack, OpenKedge provides bounded mutation interfaces; SAL separates planning from execution; VAI provides execution identity and evidence chains; PDD validates generated code against schemas and protocol obligation invariants; and SQA evaluates action-level semantics. SAB composes these layers into a system-level admission boundary.

9.2 System Limitations

SAB’s security claims depend on several operational and design boundaries:

•

Consequence Scoring Errors: Consequence classification ( $\mathcal{R}(C)$ ) is an operational classifier. It may underestimate risk when dependencies are hidden, or create bottlenecks by over-estimating risk and over-escalating low-risk tasks to human review.
•

Evidence Staleness and Incompleteness: SAB relies on external telemetry and log sources. Stale or incomplete evidence can cause the airlock to certify actions under assumptions that no longer hold.
•

Validator and Policy Calibration: SQA relies on validators whose reliability degrades after model, prompt, or workload changes. Additionally, SAB cannot correct policies that are incomplete, stale, or contradictory.
•

Prototype Generality: Prototype results are workload-dependent and should not be interpreted as general performance guarantees.
•

Emergency Bypass Risk: Outages may force operators to bypass normal assurance rules. While SAB records emergency overrides and enforces short validity windows, emergency access remains a critical threat vector.
•

Control-Plane Attack Surface: SAB introduces high-value security targets (signing keys, the ledger, and the broker). Compromise of these trusted components invalidates all security properties.

These limitations motivate an incremental deployment strategy, starting with high-consequence domains (IAM, network policy, and deployments) and expanding as evidence pipelines and validator models mature.

10 Related Work

We position SAB relative to systems security and safety mechanisms that partially address proposal admission, execution control, and auditability:

Runtime Assurance.

Safety-monitor architectures (e.g., Simplex) supervise autonomous controllers and interpose fallback modes to prevent physical safety envelope violations [simplex-architecture, bloem-shield-synthesis, phan-neural-simplex]. The same interposition pattern can be applied to cloud and software control planes, where the monitored objects are discrete configuration changes, evidence freshness, and policy epochs.

Assurance Cases.

Dependability and safety cases structure claims, assumptions, and evidence for pre-deployment system certification [rushby-assurance]. SAB brings a similar claim-evidence structure into runtime admission: approvals bind to specific contracts, evidence snapshots, and revocation state.

Distributed Consensus.

Paxos, Raft, and PBFT protocols ensure ordered agreement on state transitions across replicated nodes [lamport-paxos, ongaro-raft, castro-pbft]. They do not verify whether those transitions are semantically correct or compliant with institutional rules. SAB treats authority as a separate admission question.

Access Control (IAM) and Policy Engines.

Zero trust, IAM, and policy-as-code engines (e.g., OPA) enforce static permissions at API endpoints [rose-zerotrust, aws-iam, opa-rego, kubernetes-admission]. They are necessary inputs, but they authorize identities rather than validating autonomous proposals against live evidence. SAB adds proposal certification before broker execution.

AI Safety and Tool Governance.

Agent safety tools (filters, sandboxes, critiques) reduce dangerous outputs from planning models [yao-react, inan-llamaguard]. They run as unprivileged utilities and cannot serve as control-plane security boundaries. In SAB, safety models can act as untrusted validators in the SQA mesh, but authority is released only by the airlock and broker.

Audit Logs and Evidence.

Audit trails and transparency logs capture events for post-incident forensics [w3c-prov]. Certificate $\Omega$ binds pre-execution admission evidence, policy versions, and validator outputs; ledger-bound outcome record $O$ links subsequent observations to $\Omega$ , making admission decisions replayable. Audit logs record executed events; the SAB certificate records the admission context that authorized execution.

Sovereign AI Governance.

Sovereignty frameworks dictate local resource control and residency rules [nist-ai-rmf]. In SAB, those requirements appear as local policy evaluation, domestic validator meshes, and regional execution brokers, so external or mixed planning models cannot bypass local control boundaries.

Existing mechanisms authorize identities, replicate decisions, monitor runtime behavior, or record events. SAB combines these mechanisms into an admission boundary that decides when autonomous proposals may become execution authority.

11 Conclusion

Agentic infrastructure needs an authorization boundary between plausible model-generated proposals and privileged production mutations. The Sovereign Assurance Boundary (SAB) addresses that boundary through certificate-bound admission: agent outputs compile into execution contracts $C$ ; admission decisions bind those contracts to evidence digests $H(E)$ , certification paths, policy versions, revocation epochs, and scoped broker identities; and execution proceeds only after the sovereign broker verifies the certificate $\Omega$ .

The paper’s core claim is intentionally narrow: SAB is not a replacement for IAM, formal verification, consensus, or human review, but a runtime admission path that composes those mechanisms so autonomous reasoning alone cannot become execution authority. The prototype and case studies show that the airlock–broker model can be implemented and measured across cloud operations, generated-code deployment, sovereign AI cloud settings, and regulated data workflows, while the limitations identify the remaining dependencies on evidence freshness, policy quality, validator calibration, and trusted control-plane components. SAB therefore treats delegated execution authority as certificate-bound, evidence-bound, revocable, and replayable, rather than as an ambient permission.

Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure

Abstract

1 Introduction

Running Example.

What SAB Is Not.

2 Background, Motivation, and System Requirements

2.1 Background and Gap

Runtime Assurance.

Assurance Cases.

Access Control (IAM) and Policy Engines.

Consensus and Byzantine Fault Tolerance.

AI Safety and Tool Governance.

2.2 Motivation: Why Action Certification Is Not Enough

2.3 System Requirements

3 SAB Architecture and Trust Model

3.1 System Model and Assumptions

3.2 SAB Invariant Properties

3.3 Proof Sketch

Goal.

Assumptions.

Argument.

3.4 Broker Verification Checks

3.5 Worked Admission Example

4 Certificate-Bound Admission Pipeline

4.1 Formal Admission Interface

4.2 Consequence Scoring and Autonomy Levels

4.3 Certification Paths

Decoupled SQA Oracle.

4.4 Admission Predicate

4.5 SAB Certificates

Properties.

5 Revocation, Suspension, and Incident Response

5.1 Revocation Triggers and Withdrawal Mechanisms

5.2 Formal Semantics

5.3 Incident Response Workflow and Override Risk

6 Case Studies

6.1 Case Study 1: Enterprise Cloud Operations

Scenario and Path.

Failure Prevented and Artifact.

6.2 Case Study 2: Generated-Code Deployment

Scenario and Path.

Failure Prevented and Artifact.

6.3 Case Study 3: Sovereign AI Cloud

Scenario and Path.

Failure Prevented and Artifact.

6.4 Case Study 4: Regulated Data Workflow

Scenario and Path.

Failure Prevented and Artifact.

7 Evaluation Methodology and Targets

7.1 Prototype Implementation

7.2 Evaluation Methodology and Baselines

7.3 Experimental Setup

7.4 Target Metrics and Expected Tradeoffs

8 Security Analysis

8.1 Threat Model and TCB Boundaries

8.2 Threat Analysis

8.3 Authority-Boundary Invariants

9 Discussion and Limitations

9.1 Operational Discussion

Human Oversight and the Unit of Review.

Proportional Assurance.

Operational Sovereignty.

Composition with OpenKedge Layers.

9.2 System Limitations

10 Related Work

Runtime Assurance.

Assurance Cases.

Distributed Consensus.

Access Control (IAM) and Policy Engines.

AI Safety and Tool Governance.

Audit Logs and Evidence.

Sovereign AI Governance.

11 Conclusion

References