Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure
Abstract
Agentic infrastructure creates a control-plane authorization problem: non-deterministic reasoning systems may propose high-stakes mutations to production resources, but existing access controls (IAM), policy engines, consensus protocols, and audit logs either enforce static permissions or record actions only after execution. This paper introduces the Sovereign Assurance Boundary (SAB), a certificate-bound runtime admission boundary for autonomous execution authority in agentic infrastructure. SAB intercepts agent proposals at an assurance airlock, compiles them into typed execution contracts , binds those contracts to cryptographic evidence digests and policy versions, and routes them through consequence-aware certification paths. Successful admission emits a signed Sovereign Assurance Certificate () that is valid only for a scoped execution identity, revocation epoch, and validity window. A sovereign execution broker verifies the certificate and performs fresh pre-execution revocation and drift checks before invoking infrastructure APIs. We describe the airlock–broker architecture, formalize the admission and revocation invariants, and report preliminary feasibility measurements from a Go prototype over 2,500 admission attempts. The resulting model prevents autonomous reasoning alone from mutating state and turns delegated execution authority into a certificate-bound, evidence-bound, revocable, and replayable runtime artifact.
1 Introduction
Large language model (LLM) agents are increasingly connected to operational workflows rather than advisory interfaces, allowing them to scale resources, rotate credentials, deploy code, and export diagnostic data [yao-react, schick-toolformer]. In this setting, agentic infrastructure needs an admission mechanism that decides when a proposed action may become execution authority. Granting non-deterministic reasoning processes direct mutation rights over production control planes creates a safety and authorization risk: agents can become the proximate cause of physical or digital state transitions. Without a runtime admission boundary, individually plausible actions can accumulate into availability failures, data exposure, policy bypasses, or loss of forensic context.
Sovereign Assurance Boundary (SAB) interposes between agentic reasoning and high-stakes infrastructure mutation. We use “sovereign” operationally: the institution retains local control over policy evaluation, evidence retention, signing keys, revocation state, broker identity, and execution authority. It is not an agent monitor or a general governance process; it defines a certificate-bound admission boundary where an agent proposal is not treated as ambient authority. Under SAB, a proposal is eligible for execution only after it is compiled into a typed execution contract , bound to an evidence digest and policy version, assigned a consequence score and autonomy level , routed through a consequence-aware certification path , encoded as a signed certificate , checked against active revocation state, and verified by a sovereign execution broker immediately before mutating the target system. The mechanism converts autonomous execution authority from an ambient permission into a revocable, evidence-bound runtime artifact.
SABseparates action-level and system-level assurance. Semantic Quorum Assurance (SQA) [sqa-paper] serves as the action-level certification primitive. While SQA validates individual proposed actions, SAB composes SQA with evidence, policy, human approval, sovereign execution identity, revocation, and replay. SAB is independent of the SQA implementation: any validator mesh returning signed action-level attestations can instantiate . Action-level checks address unsafe individual mutations; system-level assurance binds execution to active policy, evidence, revocation state, and broker-enforcement paths.
Running Example.
Consider an operations agent, , tasked with resolving a service degradation. To restore reachability and diagnose the issue, proposes reconfiguring network security groups, scaling up compute nodes, rotating database credentials, rolling back a deployment, and exporting diagnostic logs. If an infrastructure stack lets that proposal directly invoke raw IAM permissions, autonomous execution of the sequence introduces systemic risk: the network change may expose private services, the resource expansion may exceed budget or quota constraints, credential rotation may trigger auth failures, the rollback may reintroduce vulnerabilities, and the log export may leak personally identifiable information (PII). The example shows why action-level checks alone do not bound institutional risk.
What SAB Is Not.
SABdiffers from existing controls in several ways:
-
•
Not an IAM replacement: IAM authorizes identities and requests; SAB certifies whether an autonomous proposal is semantically justified by evidence before a broker invokes those credentials.
-
•
Not a formal verification replacement: SAB does not prove arbitrary safety, but composes formal proofs (when available) as inputs to its evidence digests.
-
•
Not a consensus protocol: Consensus (e.g., Raft) agrees on state transitions; SAB evaluates whether a transition is institutionally authorized under current context.
-
•
Not only logging: Logs record events post-execution; SAB binds pre-execution evidence and policy versions in the certificate , then links outcomes through a ledger-recorded outcome record , making the admission process replayable.
-
•
Not only human-in-the-loop approval: SAB is a system-level runtime admission boundary that cryptographically binds human signatures to specific contracts and evidence snapshots, preventing generic rubber-stamping.
We make four contributions:
-
1.
Certificate-bound admission for autonomous infrastructure. We identify a control-plane authorization gap that arises when non-deterministic agents are allowed to propose production mutations, and define the Sovereign Assurance Boundary (SAB) as a runtime admission boundary that converts agent proposals into certificate-bound execution authority.
-
2.
A concrete airlock–broker architecture. We introduce the assurance airlock, typed execution contract , evidence digest , the certificate , and sovereign execution broker. Together, these mechanisms separate proposal, admission, and execution so that agent reasoning alone cannot directly mutate infrastructure state under the broker-enforced execution model.
-
3.
Consequence-aware certification and revocation. We formalize consequence scoring , autonomy levels , certification paths , monotone path selection, broker-side verification, and revocation epochs . These mechanisms support escalation, rejection, suspension, and revocation before execution.
-
4.
Prototype and feasibility evaluation. We implement an initial Go prototype with contract serialization, OPA/Rego policy checks, Ed25519 certificate signing, a PostgreSQL-backed append-only ledger, revocation-epoch distribution, broker verification, and replay tooling. Using 500 labeled execution contracts over 2,500 admission attempts, we report preliminary local feasibility measurements for admission latency, broker verification, revocation propagation, routing accuracy, replay completeness, and certificate overhead.
2 Background, Motivation, and System Requirements
2.1 Background and Gap
Securing agentic infrastructure requires integrating model-generated proposals with existing safety and security mechanisms.
Runtime Assurance.
Traditional runtime assurance separates an advanced controller from a monitor or fallback controller to prevent safety envelope violations [simplex-architecture, bloem-shield-synthesis]. In cloud and database administration, mutations are discrete, policy-laden configuration changes. SAB adapts the runtime-assurance pattern by evaluating the institutional admissibility of proposals rather than continuous physical invariants.
Assurance Cases.
Assurance cases present structured safety, security, or compliance arguments by linking claims to evidence [rushby-assurance]. They are usually static, design-time artifacts. SAB moves part of that structure into runtime admission by cryptographically binding telemetry, policies, and validator signatures to SAB certificates.
Access Control (IAM) and Policy Engines.
Zero trust and identity management systems (IAM) verify caller identities and enforce structural policy compliance (e.g., via OPA Rego) [rose-zerotrust, aws-iam, opa-rego]. While IAM dictates whether an identity can call an API, it cannot determine whether an autonomous proposal is semantically justified by live evidence. SAB treats access control as a prerequisite, interposing a semantic certification boundary before credentials are used.
Consensus and Byzantine Fault Tolerance.
Consensus protocols (e.g., Paxos, Raft, PBFT) replicate logs and enforce order across replicas [lamport-paxos, ongaro-raft]. They ensure agreement on state transitions but do not evaluate the semantic correctness or safety of those transitions. Semantic Quorum Assurance (SQA) certifies the semantic admissibility of individual proposed actions [sqa-paper]; SAB composes SQA into a system-level certification boundary.
AI Safety and Tool Governance.
Agent safety tools (sandboxing, output filtering, model self-critiques) constrain agent execution environments [yao-react, owasp-llm-top10]. These mechanisms lack cryptographically signed, institutionally bound execution credentials or durable audit trails. SAB wraps these tools as untrusted validators and binds their collective output into SAB certificates.
2.2 Motivation: Why Action Certification Is Not Enough
Semantic Quorum Assurance (SQA) verifies locally whether an action is semantically admissible (e.g., whether a security-group change fits an active outage ticket). High-stakes infrastructure also needs a system-level check: whether the agent operates under active policies, fresh evidence, and current revocation rules.
A sequence of individually approved steps (e.g., opening a firewall, expanding an IAM role, rolling back a service, and exporting logs) can collectively create high-consequence availability, compliance, or security failures. Table 1 summarizes how SAB composes these security controls. SAB mediates the certificate-bound transition from proposal to execution, checking that the certificate contains the evidence and validation epoch required by policy.
| Control | Question | Artifact | Failure if Missing |
|---|---|---|---|
| Policy engine | Is the caller authorized for this operation? | Authorization token | Static permissions are treated as ambient authority. |
| SQA | Is the proposed action semantically admissible? | Action certificate | Locally unsafe mutations are executed. |
| Human approval | Does an operator approve this step? | Approval record | Approvals do not bind evidence or policy epochs. |
| Audit logging | What happened after execution? | Event log | Accountability is reconstructed post-hoc. |
| SAB | May this proposal become execution authority under institutional rules? | SAB certificate | Individually safe actions cross into execution authority without system context. |
2.3 System Requirements
To govern high-stakes agentic infrastructure, SAB must satisfy seven system requirements:
- R1: Proposal/Execution Separation.
-
Agent runtimes must have no standing credentials or direct execution privileges, treating them purely as action proposers.
- R2: Contractual Admission.
-
Raw agent commands must compile into a strongly-typed execution contract defining target resources, preconditions, postconditions, and rollbacks.
- R3: Evidence-Bound Decisions.
-
All admissions must bind to a cryptographic digest of the telemetry and system state active at decision time.
- R4: Consequence-Aware Routing.
-
Proposals must route to validation paths dynamically based on a consequence score (blast radius, irreversibility, data risk, and uncertainty).
- R5: Sovereign Execution.
-
Admitted contracts must execute exclusively through an institutionally controlled broker that scopes credentials and validates the certificate .
- R6: Revocability.
-
The system must support scoped withdrawal, enabling key rotation, validator quarantine, and domain suspension.
- R7: Replayability.
-
SAB must record the certificate and, after execution, outcome record in an append-only log, with the ledger binding to .
3 SAB Architecture and Trust Model
The Sovereign Assurance Boundary (SAB) intercepts agent proposals before they can mutate production infrastructure. The system model separates unprivileged agent reasoning from privileged sovereign execution.
3.1 System Model and Assumptions
We model the agent runtime as an untrusted proposer. It may be compromised, misaligned, or faulty, and has no standing administrative credentials. Downstream infrastructure APIs accept state mutations exclusively from the sovereign execution broker.
The trusted computing base (TCB) consists of the assurance airlock, policy engine, evidence ledger, revocation controller, private signing keys, and sovereign execution broker. The contract compiler, evidence collector, consequence scorer, SQA validator mesh, and approval console run outside the TCB but are security-relevant; Table 2 maps each component to its trust boundary and core systems function. The evidence collector is untrusted: while SAB detects unsigned or malformed evidence, signed false evidence from compromised authorized sources remains a residual risk. Similarly, the SQA mesh is an external, untrusted oracle: the airlock policy engine verifies validators’ signed attestations instead of trusting validator internal execution.
SAB assumes authenticated component identities, a versioned policy store, bounded clock skew, append-only ledger integrity, and timely propagation of revocation state. Let denote the airlock’s public verification key and its signature on the certificate .
| Component | Trust Boundary | Core Systems Function |
|---|---|---|
| Agent Runtime | Untrusted / Outer | Proposes mutations as unprivileged JSON requests. |
| Contract Compiler | Security-relevant / Airlock | Normalizes natural-language or tool calls into structured contracts . |
| Evidence Collector | Security-relevant / Airlock | Assembles telemetry, dependency logs, and tickets to compute evidence digest . |
| Consequence Scorer | Security-relevant / Airlock | Estimates blast radius and data risk; determines autonomy levels . |
| Policy Engine | Trusted Control Plane | Enforces static rules and routes remaining ones to certification paths. |
| Assurance Airlock | Trusted Control Plane | Enforces path predicates, collects approvals, and issues signed SAB certificates . |
| SQA Validator Mesh | Fallible / Outer Mesh | Evaluates action semantics as decoupled, untrusted signing oracles. |
| Approval Console | Security-relevant / Outer | Presents structured contracts and evidence summaries for operator multi-signature. |
| Sovereign Broker | Trusted Control Plane | Verifies signatures and revocation status of ; executes scoped actions through IAM. |
| Evidence Ledger | Trusted Control Plane | Durably records SAB certificates and bound outcome records in an append-only cryptographic log. |
| Revocation Controller | Trusted Control Plane | Monitors system state; advances revocation epochs and manages suspensions. |
| Replay Service | Security-relevant / Outer | Re-evaluates ledger records to reconstruct and audit historical admission decisions. |
3.2 SAB Invariant Properties
Under the trust model in Section 3.1, SAB enforces five authority-boundary invariants:
- P1: Proposal-Execution Decoupling Invariant.
-
Let denote a state-changing operation on resource . Under the assumption that the infrastructure API accepts mutations only from the sovereign broker, no agent proposal can directly trigger without first being admitted by the airlock:
where is the compiled execution contract corresponding to the proposal.
- P2: Certificate-Bound Execution Invariant.
-
The sovereign execution broker executes a contract only after verifying the certificate and its airlock signature:
where the certificate is bound to the specific contract , policy version , execution identity , and validity window .
- P3: Evidence-Binding Invariant.
-
Approvals and validator attestations () are bound cryptographically to and the evidence digest :
This binding prevents approval reuse across different contracts or stale environmental contexts.
- P4: Monotone Path Invariant.
-
A contract is admitted only if all validation steps required by its selected certification path are satisfied:
where depends on the consequence score , autonomy level , and policy version . Policy may escalate a contract to stronger assurance or rejection, but lower-assurance paths cannot override requirements imposed by action class, consequence score, jurisdiction, or incident mode.
- P5: Ephemeral Authority Invariant.
-
The sovereign execution broker rejects any certificate if the current revocation epoch is greater than the certificate’s revocation epoch , or if the execution time falls outside the validity window :
The broker therefore rejects certificates from stale epochs.
3.3 Proof Sketch
Goal.
Establish that an untrusted agent runtime cannot directly execute a state-changing operation except through a broker-verified certificate satisfying the active policy path.
Assumptions.
1. Infrastructure APIs accept mutations exclusively from the sovereign execution broker. 2. The broker verifies certificate signatures using the airlock public verification key and rejects invalid signatures. 3. Airlock private key remains uncompromised. 4. Active policy dictates path requirements.
Argument.
To execute a mutation, the broker must verify the certificate such that and (by P2). Because the airlock private key is secure, only the airlock could have produced . The airlock emits this signature only if evaluates to true. By P4, evaluates to true only if all path requirements are satisfied under the policy . Under the assumptions above, an agent proposal cannot execute unless it compiles to a contract satisfying the path checks mandated by institutional policy. Broker-enforced execution leaves agent reasoning with no direct mutation path (P1).
3.4 Broker Verification Checks
Before invoking infrastructure APIs, the sovereign broker evaluates the pre-execution checks in Table 3. The checks cover certificate validity, contract identity, revocation state, policy epoch, state drift, and scoped credential issuance.
| Broker Check | Input | Failure Behavior |
|---|---|---|
| Signature Verification | Certificate , public key , signature | Reject execution, abort |
| Contract Match | Contract , certificate contents | Reject execution, abort |
| Validity Window | Current time , validity range | Reject execution, abort |
| Revocation Epoch | Certificate epoch , active epoch | Reject, fail-closed for |
| Policy Epoch | Certificate policy version , active | Reject execution, abort |
| State Drift Check | Bound evidence , live state , threshold | Reject, fail-closed for |
| Credential Issuance | Scoped identity , target resource API | Reject execution, abort |
3.5 Worked Admission Example
Consider an operations agent proposing an ingress network rule to resolve an outage. Table 4 traces this proposal through the reference architecture.
| Stage | Component | Output / Artifact | Example Value | Failure Check |
|---|---|---|---|---|
| 1. Compile | Contract Compiler | Execution Contract | Add ingress rule; target: sg-01; port: 443; rollback: delete rule | Malformed syntax, unsupported API |
| 2. Bind Evidence | Evidence Collector | Evidence Chain Digest | Telemetry (high latency); target config; incident ticket ID: INC-101 | Stale telemetry, missing incident ticket |
| 3. Score Consequence | Consequence Scorer | Consequence Score | ; autonomy level | Score calculation failure |
| 4. Route Path | Policy Engine | Certification Path | Monotone path mismatch | |
| 5. Airlock Check | Assurance Airlock | SAB certificate | Signed bundle including , , , | Validator veto, missing human signature |
| 6. Exec Check | Sovereign Broker | Broker execution decision | Verify with , check , verify | Drift limit exceeded, stale epoch |
| 7. Record Log | Evidence Ledger | Ledger transaction | Record bound to | Ledger write failure |
The example has two failure cases:
-
•
Airlock policy rejection: If attempts to open port 22 (SSH) to the public internet (0.0.0.0/0), the consequence scorer maps this high-risk action to . The airlock’s security-archetype validator detects a policy violation and triggers a critical veto, returning .
-
•
Broker-side verification failure: If is admitted at time , but before execution at the active revocation epoch advances from to , the broker detects that and aborts the execution. Similarly, if the broker’s drift-check predicate evaluates to false (indicating that the target security group’s state has drifted since admission), the execution is aborted.
4 Certificate-Bound Admission Pipeline
The admission pipeline evaluates proposals, scores consequence, routes them to certification paths, and issues signed certificates that the broker can verify before execution.
4.1 Formal Admission Interface
Let be an execution contract, be the evidence bound to that contract, be the observed infrastructure state at time , be the consequence score, and be the assigned autonomy level; the certificate is the resulting admission artifact. The airlock implements the admission function:
The function returns only when the proposal satisfies the required certification path. Otherwise, it returns . To prevent time-of-check to time-of-use (TOCTOU) risks, the broker evaluates a state drift-check predicate immediately before executing the mutation, verifying that the live state at execution time has not deviated from the bound evidence beyond policy safety margins.
4.2 Consequence Scoring and Autonomy Levels
The consequence score quantifies the operational, compliance, and security risk of the contract:
| (1) |
The terms measure specific risk vectors:
-
•
Blast radius : Affected resources, services, and network zones.
-
•
Privilege expansion : Elevated credentials or access scopes required.
-
•
Data sensitivity : Exposure risk of regulated data or PII.
-
•
Irreversibility : Computational cost and difficulty of rolling back.
-
•
Uncertainty : Lack of telemetry or confidence in evidence.
To tune the weights (), security engineers run an offline adjudication loop replaying historical incidents to tune classifications against expert consensus, biasing weights to escalate rather than under-classify when uncertainty is high.
From the consequence score, active policy , and jurisdictional constraints , the airlock maps the contract to an autonomy level:
Table 5 defines the consequence-aware autonomy levels.
| Level | Class | Example Action | Assurance and Validation Obligation |
|---|---|---|---|
| Advisory | Diagnostic query | Read-only; agent cannot compile or execute. | |
| Manual Draft | Batch restarts | Agent drafts contract; manual operator submission required. | |
| Bounded Autonomous | Stateless restart | Reversible, local action; OPA policy checks. | |
| Governed Autonomous | Autoscaling or rollback | Moderate blast radius; requires SQA and evidence binding. | |
| High-Stakes Autonomous | Security group, log export | High blast radius; SQA, human multi-sig, revocation checks. | |
| Prohibited | Deleting audit logs | Categorically denied autonomous execution. |
4.3 Certification Paths
After assigning , the airlock routes the contract through a certification path:
| (2) |
The selected path determines the validation requirements:
-
•
PolicyOnly: Requires a valid contract, evidence digest, and a successful policy engine (e.g. OPA) check.
-
•
LightweightQuorum: Requires multiple automated validator checks (syntax checks, telemetry checks, and dependency checks).
-
•
SQA: Requires Semantic Quorum Assurance and validator diversity constraints.
-
•
SQAPlusHuman: Requires SQA, critical-archetype veto checks, and operator multi-signature.
-
•
Prohibited / Reject: Categorically disallowed or failed admission attempts.
Decoupled SQA Oracle.
To keep the SQA validator mesh outside SAB’s trusted computing base, validator nodes run as external, isolated processes. They receive a read-only input bundle containing and , and hold no credentials. Each validator independently evaluates the proposal and generates a cryptographically signed attestation:
The airlock verifies validator signatures and checks diversity, quorum size, and veto requirements over the aggregated attestations . Even under compromise, individual validators cannot forge approvals without private keys, nor can they directly mutate infrastructure.
4.4 Admission Predicate
To admit a contract, the airlock evaluates the admission predicate:
| (3) | ||||
-
•
: Policy permits the action class and target resource.
-
•
: Required evidence exists, is fresh, and has trusted provenance.
-
•
: The selected assurance path completed successfully.
-
•
: The broker identity, validity window, and revocation state are valid.
4.5 SAB Certificates
When the admission predicate is satisfied, the airlock issues the certificate :
| (4) | ||||
where is the execution identity, is the validity window, is the policy version, is the revocation epoch, and is the airlock signature. After execution, the broker emits a separate outcome record:
| (5) |
where is the broker signature. The ledger stores and binds it to .
Properties.
-
•
Replayability: The replay service reconstructs the decision using , , , , , , and .
-
•
Non-repudiation: Approvals are cryptographically bound to and , preventing reuse or extraction.
-
•
Temporal Validity: The certificate expires when the validity window closes or evidence freshness drops.
-
•
Outcome Binding: Post-execution observations are bound to through outcome record , not embedded in the original certificate.
Table 6 compares SAB certificates with standard change-auditing artifacts.
| Artifact | Records | Limitation |
|---|---|---|
| Audit log | Executed event, actor, time | Lacks admission context. |
| Policy decision record | Policy engine query result | Omits semantic evidence and validator votes. |
| SQA quorum | Validator votes and reasoning | Lacks system-level context and revocation epoch. |
| SAB certificate | Full pre-execution admission state | Outcomes require ledger-bound records and replay overhead. |
5 Revocation, Suspension, and Incident Response
Certificate-bound authority must be withdrawable. An incident may require immediate suspension across an action class, domain, validator, or execution path, so revocation and suspension are part of the control plane rather than an external operations procedure.
5.1 Revocation Triggers and Withdrawal Mechanisms
The controller monitors signals from the ledger, validator mesh, policy engine, broker, and compliance tools. Triggers apply globally or to a scoped domain (action class, service, region), as shown in Table 7.
| Trigger Class | Primary Signal | Institutional Consequence |
|---|---|---|
| Validator Degradation | Validator agreement rate or accuracy drops below threshold | Downgrade autonomy level, quarantine validator |
| Evidence Outage/Staleness | Telemetry streams, incident tickets, or configs become stale/unavailable | Airlock rejects new proposals in affected domain |
| Policy Drift/Rollback | Updated policy version published or rolled back | Invalidate active certificates, re-evaluate paths |
| Environmental Outage | System monitor detects live intrusion or active severity-1 outage | Advance revocation epoch , suspend domain |
| Compromised Identity | Suspected leak of signing keys or broker execution credential | Revoke pending certificates, rotate execution keys |
| Operational Anomaly | Operator triggers emergency hold or manually rolls back change | Immediate human-only fallback, cancel active certificates |
When a trigger fires, the controller applies one or more scoped withdrawal mechanisms (Table 8). Withdrawal is enforced at admission (the airlock consults suspension state before emitting ) and execution (the broker checks revocation state before execution).
| Mechanism | Enforcement Point | Operational Effect |
|---|---|---|
| Level Downgrade | Consequence Scorer | Caps maximum permitted autonomy level , routing actions to human approval. |
| Domain Suspension | Assurance Airlock | Categorically rejects all future proposals matching the suspended domain . |
| Certificate Invalidation | Sovereign Broker | Broker rejects execution of unexecuted certificates matching the revoked class. |
| Validator Quarantine | SQA Mesh | Removes compromised or stale validators from participating in quorum validation. |
| Identity Rotation | Sovereign Broker | Rotates execution credentials, invalidating certificates bound to stale keys. |
| Human-only Fallback | Assurance Airlock | Bypasses automated routing, requiring manual approval for all actions. |
| Replay Sweep | Audit Service | Re-evaluates recent certificates under updated policy to identify active safety violations. |
If the broker cannot obtain fresh revocation state (e.g., due to a network partition), it fails closed, rejecting all high-consequence () executions. Revocation updates are signed, versioned, and distributed as monotonically increasing epochs. Deployments that cache revocation state must bound cache lifetime and record the revocation epoch used for every broker decision.
5.2 Formal Semantics
Let be a domain or action class. Suspension is a future-admission invariant:
| (6) |
The invariant is evaluated before path selection, preventing suspended domains from being bypassed.
Certificate revocation is an execution invariant:
| (7) |
Equation 7 applies to pending or unexecuted certificates. Completed executions cannot be undone; instead, the ledger marks for post-hoc response and compensating actions.
5.3 Incident Response Workflow and Override Risk
During an incident, the controller identifies the affected domain , installs a suspension or downgrade rule, and forces new proposals to human-only escalation. The broker invalidates pending certificates in , while the replay service sweeps recent executions to link outcomes back to context.
Emergency overrides that bypass SAB remove the evidence and identity constraints required for accountability. Emergency operations should instead pass through SAB under stricter constraints: narrow scopes, short-lived certificates, and mandatory post-hoc replay. If an override is abused, the controller revokes pending certificates, quarantines keys, and replays the executions under standard policy.
6 Case Studies
The following case studies trace SAB’s certificate-bound admission boundary across four operational settings (Table 9).
| Case | Proposed Action | Risk | Certification Path | Outcome |
|---|---|---|---|---|
| Enterprise Cloud Operations | Open security-group rule during outage | Public exposure, privilege expansion, incident masking | SQAPlusHuman or Reject | Vetoed, escalated, or executed through broker with |
| Generated-Code Deployment | Deploy agent-generated patch | Security regression, failed rollout, irreversible change | SQA or SQAPlusHuman | Certified artifact deployed with rollback and outcome evidence |
| Sovereign AI Cloud | Execute plan proposed by external foundation models | Jurisdictional leakage, external execution authority | SQAPlusHuman under local policy | Local certificate and sovereign execution identity required |
| Regulated Data Workflow | Export logs or datasets for diagnosis | PII exposure, retention violation, cross-boundary transfer | SQAPlusHuman, Reject, or Prohibited | Redacted export, human approval, or denial with replay record |
6.1 Case Study 1: Enterprise Cloud Operations
Scenario and Path.
An operations agent detects network latency and proposes adding an ingress rule in a security group to bypass a blocked gateway. The compiler normalizes this into contract , which the evidence collector binds to service-health metrics, incident ticket ID, and security-group configurations. Because modifying firewall rules alters production network posture, the scorer assigns a high blast-radius score, mapping to autonomy level . If the proposed source range is public (0.0.0.0/0), the policy engine routes the contract to SQAPlusHuman. SQA validators review the proposal; a security-archetype validator holds veto power. If approved, the airlock signs certificate . The broker verifies and evaluates the state drift-check predicate before executing the API call.
Failure Prevented and Artifact.
This path is intended to stop a latency diagnosis from becoming an overbroad network exposure or an unevidenced API call. The ledger records the contract, evidence digest , validator signatures, and broker decision, allowing post-hoc verification that the change was semantically justified.
6.2 Case Study 2: Generated-Code Deployment
Scenario and Path.
A software engineering agent generates a bug patch and proposes deploying it to a Kubernetes cluster. The proposal is compiled into contract , which includes source diffs, target service, and rollback plans. The collector binds to build provenance, test results, and vulnerability scans. The scorer distinguishes low-risk canary rollouts (routed to SQA) from privileged code changes or database migrations (escalated to SQAPlusHuman). The airlock evaluates PDD (Protocol-Driven Development) invariants, verifying that the generated patch matches interface definitions. The broker deploys only certified manifests and monitors rollout health, recording outcomes.
Failure Prevented and Artifact.
The broker blocks unverified code from reaching production by rejecting proposals with missing rollbacks, failing tests, or unapproved dependencies. The certificate records all build and test evidence, enabling post-deployment replay.
6.3 Case Study 3: Sovereign AI Cloud
Scenario and Path.
A sovereign operator uses a mixed model stack (external and domestic models) for planning. An external model proposes restarting services and exporting diagnostic telemetry. The local contract compiler decomposes the plan into separate contracts, and the collector binds local evidence (data-residency tags, configuration states). Sensitive context is held locally. The local policy engine enforces jurisdictional rules, rejecting contracts that violate residency policies or cross borders. Valid proposals are reviewed by local SQA validators, and high-stakes actions route to human operator multi-signature. The local broker executes approved contracts using domestic identities.
Failure Prevented and Artifact.
The local broker path prevents external planning models from directly acquiring execution authority. External reasoning systems cannot obtain credentials, bypass jurisdictional policies, or execute mutations without local admission and replay. Certificate records the domestic validation path and broker identity, making sovereign execution verifiable.
6.4 Case Study 4: Regulated Data Workflow
Scenario and Path.
During an incident, an agent proposes exporting database logs for offline analysis. The logs may contain PII or regulated customer records. The compiler translates this into a data-export contract detailing source tables, destinations, and redaction rules. The collector binds schema metadata and legal-hold status. The scorer assigns a high data-sensitivity term, raising uncertainty if payloads are unclassified. If policy forbids autonomous export of the data class, the path is Prohibited or Reject. If export is permitted with redaction, the airlock enforces a minimization contract, routing the redacted export through compliance validators and human multi-signature.
Failure Prevented and Artifact.
This path is intended to prevent autonomous leakage of regulated data under the guise of diagnostic utility. The airlock rejects overbroad exports or destinations lacking compliance controls, binding redaction attestations and human approvals to the export outcome.
7 Evaluation Methodology and Targets
We evaluate SAB’s control-plane feasibility and performance tradeoffs with a Go prototype and three workload families.
7.1 Prototype Implementation
Our prototype of SAB in Go contains a stateless admission service, Protocol Buffer and JSON Schema contract definitions, an evidence-digest generator extracting Kubernetes and Active Directory logs, OPA/Rego policy evaluation, Ed25519 signing for , a PostgreSQL ledger, prototype target adapters, revocation epoch distribution, and a replay command-line interface. The target adapters simulate actions on Kubernetes and AWS/GCP APIs.
7.2 Evaluation Methodology and Baselines
We compare SAB against standard baseline security controls (Table 10) using 500 execution contracts: 200 cloud-infrastructure contracts (security groups, scaling, restarts, credential rotations), 150 CI/CD contracts (rollbacks, patch deployments, manifest updates), and 150 data-governance contracts (migrations, log exports, resource deletion).
| Baseline | Mechanism | Assurance Limitation |
|---|---|---|
| Static IAM/Policy | Checks static roles (e.g., AWS IAM, OPA) | Context-unaware; permits structurally valid but semantically unsafe actions. |
| Human Approval Only | Routes all proposals to human change queue | Introduces high latency; prone to rubber-stamping due to review fatigue. |
| Audit Logging Only | Records actions to security log after execution | No runtime prevention; cannot halt unauthorized actions before they occur. |
| LLM Self-Review | Proposing model critiques its own output | Susceptible to hallucinations and adversarial jailbreaks; no institutional control. |
| SQA without SAB | Quorum validation of individual proposed actions | Lacks system-level context: no evidence binding, revocation, or execution decoupling. |
| SAB (No Revocation) | Airlock admission without broker revocation checks | Vulnerable to stale certificates during outages or policy changes. |
| SAB (No Routing) | Fixed high-stakes certification path for all actions | High operational friction for low-risk actions; excessive latency. |
7.3 Experimental Setup
Table 11 reports preliminary measurements from a single-node local workstation testbed, not a production cloud deployment. The topology contains one admission service, one policy engine, one PostgreSQL-backed ledger, one revocation service, three broker processes, three target-service adapters, and two workload-driver processes. The 500 contracts are independently labeled by three expert systems/security engineers for expected admission decision, certification path, and safety outcome, with disagreements resolved by majority vote and discussion. We ran five full-workload trials (2,500 admission attempts). Live local measurements include admission, OPA evaluation, Ed25519 signing, ledger writes, broker verification, revocation-epoch propagation, and SQA latency using a three-validator mesh with 2-of-3 quorum, Ed25519-signed attestations, and a security-archetype veto. Kubernetes, Active Directory, and cloud-control-plane evidence is replayed from captured or synthetic traces; target mutations are simulated by service adapters. Unless a row states otherwise, latency results are p50; SQA admission reports p50/p95, and rate metrics are percentages.
7.4 Target Metrics and Expected Tradeoffs
Table 11 reports the evaluation targets and measured results, grouped into latency overheads, routing accuracy, and storage metrics.
| Metric | Prototype Target | Measured Result | Expected Tradeoffs |
|---|---|---|---|
| PolicyOnly admission latency (p50) | ms | 1.84 ms | Fast path for low-consequence rules (OPA check). |
| SQA admission latency (p50/p95) | report p50/p95 by validator config | 185 ms / 380 ms | Parallel validator queries. |
| Broker verification latency (p50) | ms | 0.15 ms | Signature and active revocation epoch check. |
| Revocation propagation latency (p95) | ms | 12.4 ms | Delay to propagate new to local brokers. |
| Schema replay completeness (%) | Fraction of ledger entries with replayable schema context. | ||
| Unsafe admission rate (%) | lower is better | 0.4% | Admissions that violate safety policy or human intent. |
| Routing accuracy (%) | agreement with expert-labeled oracle | 96.8% | Agreement of consequence scorer with expert-labeled paths. |
| Human approval reduction (%) | 68.2% | Actions automated compared to a human-only queue. | |
| Certificate storage overhead (mean) | KB / action | 3.4 KB | Space required on the evidence ledger per contract. |
These measurements indicate control-plane feasibility on a single workstation, not production cloud performance. SAB trades latency for assurance, additional control-plane components for replayability, and occasional false escalation for lower under-classification risk. SQA and manual reviews add latency; evidence collection and ledger writes add storage and processing overhead while supporting audit and replay.
8 Security Analysis
SAB mitigates authorization risks by decoupling proposal generation from execution through certificate-bound admission. The analysis defines the threat model, examines ten threat categories, and states the authority-boundary invariants.
8.1 Threat Model and TCB Boundaries
The threat model assumes that the agent runtime may be compromised (e.g., via prompt injection or credential theft) or faulty. Upstream evidence sources, SQA validators, and operators may also fail. We assume the integrity of the core control-plane components. Table 12 maps the TCB boundaries.
| Component | TCB Status | Failure Mitigation |
|---|---|---|
| Agent runtime | Untrusted proposer | Airlock interposition, rate limiting |
| SQA validators | Untrusted external oracles | Diversity, vetoes, signing keys |
| Evidence collector | Outer engine: untrusted; signature verifier: trusted | Freshness, cryptographic provenance verification |
| Policy / revocation | Trusted control plane | Versioned policies, epoch gating |
| Airlock and broker | Trusted control plane | Hardened runtimes, key rotations |
| Evidence ledger | Trusted control plane | Cryptographic chains, backups |
8.2 Threat Analysis
Table 13 maps SAB’s controls and residual risks across ten threat categories: agent compromise, validator failure, policy degradation, operational bypass, and authority creep.
| Threat | SAB Control | Residual Risk |
|---|---|---|
| Malicious proposer agent | Decouples proposal from execution; requires contract and signed certificate . | Bad proposals may consume validator and operator review capacity. |
| Compromised validator | Enforces validator diversity and critical archetype vetoes; signature quarantine. | A compromised validator can approve unsafe actions until quarantined. |
| Correlated validator failure | Diversity rules across model architectures and prompt structures. | Shared latent training biases may still fail to catch novel semantic errors. |
| Stale/manipulated evidence | Binds evidence digests ; enforces freshness and provenance checks. | Upstream telemetry sources may report inaccurate state. |
| Forged/replayed certificate | Sovereign broker verifies with , validity window , and epoch . | Compromise of airlock private key requires key rotation. |
| Policy downgrade attack | Binds signed, versioned policy releases ; enforces monotone routing. | Authorized policy updates may be overly permissive. |
| Human rubber-stamping | Structured evidence presentation; role separation; approval trends analysis. | Humans may still sign off negligent changes under operational pressure. |
| Emergency-mode abuse | Scoped emergency authority; short-lived certificates; mandatory post-hoc replay. | Social or operational abuse of emergency procedures. |
| Jurisdictional bypass | Location checks in policies; regional checks at the sovereign broker. | Incomplete or inaccurate asset tagging. |
| Autonomy creep | Autonomy level caps ; periodic ledger audits. | Slow normalization of deviance. |
8.3 Authority-Boundary Invariants
The threat analysis yields four invariants for high-stakes agentic infrastructure:
-
•
Proposal is not authority: Agent proposals must compile to and be admitted by the airlock before execution.
-
•
Approval is evidence-bound: Validator and human approvals must be cryptographically bound to and .
-
•
Execution is sovereign: Infrastructure changes must execute through at the broker, never via agent credentials.
-
•
Authority is revocable and replayable: Certificates, validators, and autonomy levels must be revocable; decisions must be reconstructable from the ledger.
Under the stated trust assumptions, admitted autonomous actions are bounded by contract, evidence, certification path, broker verification, and revocation state, and remain auditable through the ledger.
9 Discussion and Limitations
SAB limits when model reasoning can transition into system mutations. It does not guarantee that planning models are correct or aligned. The main operational questions concern review granularity, proportional assurance, sovereignty, composition, and residual risk.
9.1 Operational Discussion
Human Oversight and the Unit of Review.
Traditional human-in-the-loop controls are prone to fatigue and rubber-stamping, especially when operators are presented with raw reasoning transcripts or unconstrained tool calls. SAB changes the unit of review: operators approve a normalized contract , cryptographic evidence digest , consequence level , and validator outputs. Approval is tied to a concrete contract, evidence snapshot, and policy context.
Proportional Assurance.
Autonomy should be granted dynamically. The same agent may restart a stateless container (routed to PolicyOnly), but require SQA for rollbacks, human multi-signature for firewall modifications, and be prohibited from deleting databases. Consequence-aware routing directs latency and validator overhead to operations where mistakes have material operational, legal, or security outcomes.
Operational Sovereignty.
SAB defines sovereignty operationally: the institution must maintain local control over policy evaluation, evidence retention, signing keys, revocation state, broker identity, and execution authority. External foundation models are restricted to proposal generation; they hold no production credentials and cannot bypass local certification and revocation.
Composition with OpenKedge Layers.
In the OpenKedge stack, OpenKedge provides bounded mutation interfaces; SAL separates planning from execution; VAI provides execution identity and evidence chains; PDD validates generated code against schemas and protocol obligation invariants; and SQA evaluates action-level semantics. SAB composes these layers into a system-level admission boundary.
9.2 System Limitations
SAB’s security claims depend on several operational and design boundaries:
-
•
Consequence Scoring Errors: Consequence classification () is an operational classifier. It may underestimate risk when dependencies are hidden, or create bottlenecks by over-estimating risk and over-escalating low-risk tasks to human review.
-
•
Evidence Staleness and Incompleteness: SAB relies on external telemetry and log sources. Stale or incomplete evidence can cause the airlock to certify actions under assumptions that no longer hold.
-
•
Validator and Policy Calibration: SQA relies on validators whose reliability degrades after model, prompt, or workload changes. Additionally, SAB cannot correct policies that are incomplete, stale, or contradictory.
-
•
Prototype Generality: Prototype results are workload-dependent and should not be interpreted as general performance guarantees.
-
•
Emergency Bypass Risk: Outages may force operators to bypass normal assurance rules. While SAB records emergency overrides and enforces short validity windows, emergency access remains a critical threat vector.
-
•
Control-Plane Attack Surface: SAB introduces high-value security targets (signing keys, the ledger, and the broker). Compromise of these trusted components invalidates all security properties.
These limitations motivate an incremental deployment strategy, starting with high-consequence domains (IAM, network policy, and deployments) and expanding as evidence pipelines and validator models mature.
10 Related Work
We position SAB relative to systems security and safety mechanisms that partially address proposal admission, execution control, and auditability:
Runtime Assurance.
Safety-monitor architectures (e.g., Simplex) supervise autonomous controllers and interpose fallback modes to prevent physical safety envelope violations [simplex-architecture, bloem-shield-synthesis, phan-neural-simplex]. The same interposition pattern can be applied to cloud and software control planes, where the monitored objects are discrete configuration changes, evidence freshness, and policy epochs.
Assurance Cases.
Dependability and safety cases structure claims, assumptions, and evidence for pre-deployment system certification [rushby-assurance]. SAB brings a similar claim-evidence structure into runtime admission: approvals bind to specific contracts, evidence snapshots, and revocation state.
Distributed Consensus.
Paxos, Raft, and PBFT protocols ensure ordered agreement on state transitions across replicated nodes [lamport-paxos, ongaro-raft, castro-pbft]. They do not verify whether those transitions are semantically correct or compliant with institutional rules. SAB treats authority as a separate admission question.
Access Control (IAM) and Policy Engines.
Zero trust, IAM, and policy-as-code engines (e.g., OPA) enforce static permissions at API endpoints [rose-zerotrust, aws-iam, opa-rego, kubernetes-admission]. They are necessary inputs, but they authorize identities rather than validating autonomous proposals against live evidence. SAB adds proposal certification before broker execution.
AI Safety and Tool Governance.
Agent safety tools (filters, sandboxes, critiques) reduce dangerous outputs from planning models [yao-react, inan-llamaguard]. They run as unprivileged utilities and cannot serve as control-plane security boundaries. In SAB, safety models can act as untrusted validators in the SQA mesh, but authority is released only by the airlock and broker.
Audit Logs and Evidence.
Audit trails and transparency logs capture events for post-incident forensics [w3c-prov]. Certificate binds pre-execution admission evidence, policy versions, and validator outputs; ledger-bound outcome record links subsequent observations to , making admission decisions replayable. Audit logs record executed events; the SAB certificate records the admission context that authorized execution.
Sovereign AI Governance.
Sovereignty frameworks dictate local resource control and residency rules [nist-ai-rmf]. In SAB, those requirements appear as local policy evaluation, domestic validator meshes, and regional execution brokers, so external or mixed planning models cannot bypass local control boundaries.
Existing mechanisms authorize identities, replicate decisions, monitor runtime behavior, or record events. SAB combines these mechanisms into an admission boundary that decides when autonomous proposals may become execution authority.
11 Conclusion
Agentic infrastructure needs an authorization boundary between plausible model-generated proposals and privileged production mutations. The Sovereign Assurance Boundary (SAB) addresses that boundary through certificate-bound admission: agent outputs compile into execution contracts ; admission decisions bind those contracts to evidence digests , certification paths, policy versions, revocation epochs, and scoped broker identities; and execution proceeds only after the sovereign broker verifies the certificate .
The paper’s core claim is intentionally narrow: SAB is not a replacement for IAM, formal verification, consensus, or human review, but a runtime admission path that composes those mechanisms so autonomous reasoning alone cannot become execution authority. The prototype and case studies show that the airlock–broker model can be implemented and measured across cloud operations, generated-code deployment, sovereign AI cloud settings, and regulated data workflows, while the limitations identify the remaining dependencies on evidence freshness, policy quality, validator calibration, and trusted control-plane components. SAB therefore treats delegated execution authority as certificate-bound, evidence-bound, revocable, and replayable, rather than as an ambient permission.