Core Concepts

Name: Code Atelier Governance SDK
Author: Code Atelier

The ideas behind the Governance SDK, explained for someone who's never used it.

Enforcement gates vs observability traces

Most LLM tools (LangSmith, Langfuse, Helicone) are observability platforms — they show you what happened after the fact. Agent frameworks like Microsoft Agent Framework ship HITL and a DevUI on top of the framework itself. Code Atelier Governance is different: enforcement gates at the database + audit layer that run before the LLM call fires — runtime-framework-agnostic, so the gates apply whether your agent runs on Agent Framework, LangChain, the OpenAI SDK, or bare Python. HMAC-chained audit on just Postgres is the moat; the gates are the product.

Observability (others)

Agent calls LLM → LLM responds → tool logs the call → you see it in a dashboard later

Enforcement (us)

Agent wants to call LLM → SDK checks scope + budget + HITL → allowed? → LLM call fires → result audit-logged

The HMAC chain

Every audit row stores an HMAC-SHA256 computed over its own immutable fields plus the previous row's HMAC. This creates a chain where modifying any byte of any past row breaks that row's HMAC and every row that follows it.

Row 1:prev_hash = NULL→hmac = a3f2...

Row 2:prev_hash = a3f2...→hmac = 7b1e...

Row 3:prev_hash = 7b1e...→hmac = c9d4...

Tamper Row 2's metadata → its recomputed HMAC ≠ 7b1e... → Row 3's prev_hash doesn't match → chain broken

The HMAC secret lives in your secrets manager (GOVERNANCE_AUDIT_SECRET). Without it, an attacker with database access cannot forge valid rows. With it, the chain is verifiable from the database alone — no trust in the application required.

The HMAC covers every immutable field: event_id, session_id, agent_id, parent_event_id, kind, model, input_hash, output_hash, metadata, prev_hash, and created_at. Modifying any of these — not just the chain link — breaks the verification.

On-demand chain verification (v0.5.3)

The SDK does not automatically re-verify the chain on every read — verification is O(n) in events and is opt-in. Three ways to check integrity:

await sdk.audit.verify_chain(session_id=..., from_seq=..., to_seq=...) — explicit verification with optional partial-range. Returns True clean, raises ChainIntegrityError with the bad sequence number on tamper.
GovernanceSDK(verify_chain_on_read=True) — config flag that verifies the chain on every get_events() call. Off by default because verification is O(n). Turn on for compliance reporting and post-incident review.
ReportGenerator(audit_module=sdk.audit).generate_article12(verify_chain=True) — chain verification inside an Article 12 report. Sets chain_integrity_status to verified or failed. Raises ValueError immediately if the audit_module was not wired — no silent degradation.

Halt enforcement across every surface (v0.6.2)

When an operator halts an agent via the console (or writes the halt marker directly to the agent's presence row), the SDK fail-closes the agent's next enforcement call on every path — scope.check(), cost.check_or_raise(), gates.request(), and both LLM wrappers (wrap_openai / wrap_anthropic) — with AgentHaltedError. No tool invocation, no API call, no LLM request, no fresh gate minted. The check reads from a 5-second TTL in-memory cache, so the hot path stays fast.

Active by default. No configuration required if you use the default enable_presence=True. The SDK auto-wires PresenceModule into the four enforcement paths at sdk.start(). If you pass enable_presence=False to keep the SDK lightweight, halt enforcement is silently disabled — an operator clicking Halt will write the marker but no enforcement call will read it.

await sdk.presence.assert_alive(agent_id) — raises AgentHaltedError (a RuntimeError subclass, NOT a ScopeViolation or GovernanceError) with the operator, timestamp, and reason from the halt marker. A bare except ScopeViolation or except GovernanceError will let halts propagate. Call directly only if you are building a custom enforcement path.
await sdk.presence.is_halted(agent_id) — observation only (returns bool, does not raise). Use for conditional logic where you want to handle the halt yourself. Not a substitute for enforcement — use assert_alive() for that.
If the governance database is unreachable, the halt cache holds the last known state and a warning is logged. Already-halted agents stay halted, live agents stay live, and your host application keeps running. The refresh path never raises out to the caller.
v0.6.2 BREAKING BEHAVIOUR. On v0.5.4–v0.6.1, only scope.check() was wired into halt; cost, gates, and LLM wrappers silently succeeded against halted agents. Upgrading to v0.6.2 means those paths now raise. If your application catches and ignores enforcement errors to allow graceful degradation, add an explicit except AgentHaltedError guard before upgrading.
Tokens minted before the halt still resolve through gates.grant() / gates.deny() — on resolution the reviewer is the principal, not the halted agent.
Terminology note: the full kill → halt rename landed in v0.6.0 (SDK classes, metadata fields, audit event kind all primary-named with halt). AgentKilledError, is_killed(), force_refresh_killed_cache(), and the POST /api/agents/{id}/kill route are preserved as back-compat aliases through v0.6.x and removed in v0.7. Historic agent.killed audit rows stay in the chain as-is (append-only invariant); query the governance_audit_events_halted SQL view to union both kinds.

See the Presence API reference for full signatures and the Console guide for the halt button.

Opt-in modules and startup signals

Every module is opt-in via a GovernanceConfig flag. When disabled, the module is not constructed at all — accessing it raises AttributeError, not a silent no-op. This is an architectural invariant: a disabled module is loudly absent, never quietly present-but-inert.

GovernanceConfig flagspython

# Lightweight deployment — drop loop detection and presence tracking
async with GovernanceSDK(
    database_url="postgresql://...",
    enable_loop=False,
    enable_presence=False,
) as sdk:
    ...  # sdk.loop and sdk.presence raise AttributeError on access

# Audit-only deployment — no wrappers, no warning
async with GovernanceSDK(
    database_url="postgresql://...",
    warn_on_no_wrappers=False,  # suppress the wrap_openai/wrap_anthropic startup warning
) as sdk:
    ...

# Full enforcement with declared-ceiling budget projection
async with GovernanceSDK(
    database_url="postgresql://...",
    default_max_tokens=4096,  # ceiling used by the projected budget gate
    verify_chain_on_read=True,  # raise on first broken HMAC link at read time
) as sdk:
    ...

warn_on_no_wrappers (default True) emits a structlog warning at sdk.start() when no wrap_openai or wrap_anthropic has been registered. This is the only startup signal that enforcement is not covering your LLM calls — silencing it in a production deployment that expects wrappers will hide a misconfiguration. Set to False only in intentionally wrapper-free deployments.

What the SDK does NOT protect against

Enforcement covers every action routed through the SDK. Deployers and security reviewers should read these carve-outs before treating the SDK as a complete security boundary:

Direct client bypass. Any code path that calls anthropic.Anthropic() or openai.OpenAI() directly — without wrap_anthropic() or wrap_openai() — is invisible to all SDK gates. Budget, scope, and audit are all bypassed. An LLM-generated tool function that instantiates its own client is not governed.
Process-level bypass. The SDK provides in-process enforcement gates. It does not provide kernel-level, network-level, or process-isolation-level enforcement. A second Python process that bypasses the wrappers entirely is not governed.
Streaming cost precision. Streaming calls are budget-gated using the declared max_tokens value before the stream opens. Actual token usage is recorded from the stream's final usage object. If the API does not return a usage object, the SDK falls back to max_tokens as the tracked value — actual usage may differ.
On-demand tampering detection only. The HMAC chain detects tampering when verification is explicitly run or with verify_chain_on_read=True. It does not alert as tampering occurs, and cannot prevent deletion of the entire chain by a privileged DBA who also controls the HMAC secret.
HITL non-blocking mode. When a HITL gate is configured with blocking=False, the gate raises ApprovalPending and the caller is responsible for not proceeding. The SDK cannot prevent a caller who ignores ApprovalPending from proceeding anyway.
Tool invocations inside LLM responses. Scope enforcement gates the LLM API call itself (using a sentinel action name). It does not inspect tool calls returned inside the LLM's response. Tool execution must be gated at the tool-execution layer separately.

The eight modules

Audit + Provenance

sdk.audit

Immutable event log with HMAC chain. Every other module writes here. The substrate everything else builds on.

Scope Enforcement

sdk.scope

Whitelist tools and APIs per agent. Exact match or explicit prefix. No regex. Violations auto-logged. Hidden tool policies remove tools from the LLM's context entirely.

Cost Tracking

sdk.cost

Token, USD, and time caps at per-session and per-agent-daily scopes. Built-in pricing for 24 models with per-model cost breakdown. Pre-call check raises BudgetExceeded. Fail-closed by default.

HITL Gates

sdk.gates

Signed single-use approval tokens. Blocking or non-blocking. Grant/deny from the console GUI or programmatically.

Loop Detection

sdk.loop

Sliding-window detection of repeated tool calls per session. Configurable thresholds and actions (raise or log). Kills runaway agents before they burn budget.

Agent Presence

sdk.presence

Heartbeat-based lifecycle tracking with three states: Live, Idle, Unresponsive. Automatic stale detection. No background worker required.

Behavioral Contracts

sdk.contracts

Declarative pre/post conditions that wrap any tool call. Enforce budget, scope, and approval requirements in a single context manager.

EU AI Act Compliance

sdk.compliance

Automated Article 12 evidence reports from your audit data. Seven-section mapping with compliant/partial/non-compliant status per requirement.

Non-breaking guarantee

The SDK distinguishes between observation surfaces (which never raise) and enforcement surfaces (which raise by contract):

The contractpython

# OBSERVATION — never raises, even if Postgres is down:
await sdk.audit.log(event)            # returns record or placeholder
await sdk.cost.track(agent, sid)      # logs and continues
await sdk.gates.request(...)          # returns request object
await sdk.presence.heartbeat(agent)   # updates last_seen
await sdk.loop.check(agent, sid)      # returns bool

# ENFORCEMENT — raises by contract (that's the gate):
await sdk.scope.check(agent, tool="x")              # raises ScopeViolation
await sdk.cost.check_or_raise(agent, sid)            # raises BudgetExceeded
await sdk.gates.wait_for(request_id, timeout=60)     # raises ApprovalTimeout / ApprovalDenied
await sdk.loop.record_call(agent, sid, "tool_name")  # raises LoopDetected (action="raise")

Fail-closed cost gate

If cost.check_or_raise() cannot read the counter (database unreachable, query timeout), it denies the call rather than allowing it. This prevents an attacker from draining budgets by taking down the cost store. The failure is audit-logged (if the audit substrate is reachable) so operators can see what happened. Pass cost_fail_open=True at SDK init if you prefer availability over safety.

Fail-closed (default)

Cost store unreachable → call denied. Safer: an attacker cannot drain budgets by taking down the store. A budget.check_failed audit event is written so operators can investigate.

Fail-open (opt-in)

Cost store unreachable → call allowed. Higher availability, but a budget gap until the store recovers. Set cost_fail_open=True at SDK init. Not recommended for production.

JSONL durable fallback

When the primary Postgres store is unreachable, audit events are written to a local JSONL file (~/.codeatelier_governance/audit_fallback.jsonl by default). This ensures that events survive process crashes and network partitions — no audit event is silently dropped.

When Postgres goes down: Events are appended to the JSONL file with a chain.degraded_start marker so auditors can identify the gap.

When Postgres recovers: The SDK automatically drains the fallback file back into the primary store and emits a recovery marker. The chain discontinuity is permanently visible.

Configure the path: Pass fallback_path="/your/path.jsonl" at SDK init.

Weak secret rejection

The SDK rejects audit secrets that are too short (<32 bytes) or too weak (fewer than 8 unique bytes). This catches the common "I forgot to set the env var" footgun — placeholder values like "x" * 64 are rejected at initialization with a clear error message explaining how to generate a real key.

Generate a strong secretbash

export GOVERNANCE_AUDIT_SECRET=$(python -c 'import secrets; print(secrets.token_hex(32))')

Hidden tool policies

Scope enforcement has two layers: allowed tools block execution at call time, while hidden toolsgo further by removing tools from the LLM's context entirely. A blocked tool produces a ScopeViolation when called; a hidden tool never appears in the tool list the LLM sees, so the agent cannot even attempt to call it.

allowed_tools (block at call time)

Agent sees the tool → tries to call it → ScopeViolation raised → audit-logged

hidden_tools (remove from context)

Tool removed from list via filter_tools() → agent never knows it exists → cannot attempt the call

Example: combining bothpython

sdk.scope.register(ScopePolicy(
    agent_id="support-agent",
    allowed_tools=frozenset({"search", "reply"}),
    hidden_tools=frozenset({"delete_account", "escalate_admin"}),
))

# Before passing tools to the LLM:
visible_tools = sdk.scope.filter_tools("support-agent", all_tools)
# "delete_account" and "escalate_admin" are not in visible_tools

Built-in model pricing

The SDK ships a pricing table for 24 models from OpenAI, Anthropic, Google, Meta, and Mistral. Instead of computing and passing usd= manually, call track_usage() with the model name and token counts. The SDK auto-computes the cost.

Prefix matching: Versioned model names like claude-sonnet-4-6-20260401 or gpt-4o-2024-05-13 automatically match the claude-sonnet-4-6 / gpt-4o entry. Longest prefix wins, so a point-release of the same model family costs without a pricing-table bump. Users still on an older snapshot (e.g. claude-sonnet-4-5-yyyymmdd) continue to resolve to the claude-sonnet-4-5 entry for as long as it remains in the table.

Unknown models (v0.6.2 BREAKING): Default behaviour flipped from silent $0.00 to UnknownModelError. Add a pricing entry, set cost_strict_unknown_models=False with cost_unknown_model_fallback_usd_per_million, or pass usd= manually via track() for models not in the table.

Supported models: gpt-4o, gpt-4o-mini, gpt-4-turbo, gpt-4, gpt-3.5-turbo, o1, o1-mini, o3-mini, claude-opus-4-7, claude-opus-4-6, claude-sonnet-4-6, claude-haiku-4-5, gemini-1.5-pro, gemini-1.5-flash, gemini-2.0-flash, llama-3.1-405b/70b/8b, mistral-large/medium/small.

Loop and anomaly detection

A common failure mode in agentic systems is a runaway loop — the agent calls the same tool over and over, burning tokens without making progress. The loop detection module catches this pattern using a sliding window per (session, tool) pair.

Call 1:search_web @ t=0s→OK (1/5)

Call 2-4:search_web @ t=10s, 20s, 30s→OK (2-4/5)

Call 5:search_web @ t=45s→LoopDetected raised + audit event

action="raise"

Raises LoopDetected and emits a loop.detected audit event. The agent is stopped immediately. Use for production safety.

action="log"

Emits the loop.detected audit event but does not raise. The agent continues. Use for monitoring before enforcing.

Detection is scoped to (session_id, tool_name). Tool names are normalized to lowercase so Search_Web and search_web are treated as the same tool. The sliding window only counts calls within the last window_seconds.

Agent presence

In a multi-agent deployment, knowing which agents are alive is critical. The presence module provides a heartbeat-based lifecycle with three states, stored in your existing Postgres. No background worker, no Redis pub/sub — just a table and a few methods.

Live

Agent is actively sending heartbeats. Last seen within the timeout window.

Idle

Agent explicitly marked itself idle. Still running, but waiting for work.

Unresponsive

No heartbeat received within the timeout. Marked by check_stale().

Lifecycle examplepython

# Agent startup:
await sdk.presence.heartbeat("billing-agent")  # status: Live

# During processing (call periodically):
await sdk.presence.heartbeat("billing-agent")  # still Live

# Between tasks:
await sdk.presence.mark_idle("billing-agent")  # status: Idle

# Health check (run from a cron or /health endpoint):
stale = await sdk.presence.check_stale(timeout_seconds=300)
# Agents with no heartbeat in 5 min → Unresponsive

# Graceful shutdown:
await sdk.presence.close_agent("billing-agent")  # removed

Policy hot-reload

In production, you should not need to restart your application to update a scope or budget policy. With hot-reload enabled, the SDK polls the governance_policies table at a configurable interval and atomically replaces in-memory policies when changes are detected.

Enable hot-reloadpython

sdk = GovernanceSDK(
    database_url=os.environ["DATABASE_URL"],
    hot_reload=True,           # opt-in
    hot_reload_interval=30,    # poll every 30 seconds (default)
)

# Update a policy in Postgres (e.g. via the console or a migration).
# Within 30 seconds, the running SDK picks up the change — no restart.

Opt-in only: Hot-reload is off by default. Explicit is better than implicit.

In-process: Uses an asyncio task — no background thread, no worker process, no Redis pub/sub.

Atomic replacement: Scope and cost policies are swapped atomically. No partial state.

What reloads: Scope policies and cost/budget policies stored in the governance_policies table.

Session time limits

Beyond token and USD caps, you can set a maximum session duration with per_session_seconds. The SDK tracks when each session started and enforces the limit in check_or_raise(). Valid range: 0 to 86,400 seconds (24 hours).

Example: 5-minute session cappython

sdk.cost.register(BudgetPolicy(
    agent_id="interview-agent",
    per_session_seconds=300,   # 5 minutes max
    per_session_usd=2.00,      # dollar cap still applies too
))

# 6 minutes later...
await sdk.cost.check_or_raise("interview-agent", session_id)
# raises BudgetExceeded: session time limit exceeded: 360s > 300s limit

Self-approval prevention

When an agent sends a heartbeat with operator_id, the console uses that field to enforce self-approval preventionon HITL gates. The operator who owns the agent cannot approve their own agent's requests. This is fail-closed:

operator_id matches user

HTTP 403 — blocked. The operator cannot approve their own agent's actions.

operator_id is NULL

Blocked with audit warning. If the agent didn't register an operator, approval is denied to prevent bypass.

operator_id differs

Allowed. A different operator is approving — verified safe.

Setting operator_idpython

# At agent startup, bind the agent to its operator:
await sdk.presence.heartbeat(
    "billing-agent",
    operator_id="alice@company.com",
    metadata={"version": "1.2.0"},
)

Chain fork detection

Beyond verifying each row's HMAC, the chain verification also detects forks — when two events share the same prev_hash. This catches an attacker who inserts a parallel branch into the chain (e.g. splicing a second root or injecting events at a branch point).

Row 1:prev = NULL→hmac = a3f2...

Row 2:prev = a3f2...→hmac = 7b1e...

Row 3:prev = a3f2...→FORK: same prev_hash as Row 2

Two events sharing prev_hash a3f2... → ChainIntegrityError: audit chain fork detected

Sync vs Async

The SDK is async-first (GovernanceSDK), but many Python web frameworks (Flask, Django, FastAPI sync routes) run synchronous code. The GovernanceSDKSync wrapper solves this by spinning up a daemon thread with a dedicated asyncio event loop. Every async method becomes a blocking sync call via asyncio.run_coroutine_threadsafe.

GovernanceSDK (async)

For FastAPI, async frameworks, or any code with a running event loop. Uses async with and await.

GovernanceSDKSync (sync)

For Flask, Django, or synchronous code. Uses with and direct method calls. Same API surface, no await.

Sync usage (Flask example)python

import os
from codeatelier_governance import GovernanceSDKSync, AuditEvent

with GovernanceSDKSync(database_url=os.environ["DATABASE_URL"]) as sdk:
    sdk.audit.log(AuditEvent(agent_id="a", kind="tool.call"))
    sdk.scope.check("a", tool="send_email")       # blocking
    sdk.cost.check_or_raise("a", session_id)       # blocking

SSE streaming for the console

The governance console uses Server-Sent Events (SSE) for real-time event streaming. Instead of polling the API, the console frontend opens a persistent connection to GET /api/stream/events and receives audit events, presence changes, and approval updates as they happen. SSE is authenticated — the stream requires either a session cookie or an x-governance-token header.

Event types: audit (new audit events), presence (agent status changes), approval (gate resolutions), cost (budget threshold alerts).

Reconnection: Standard SSE reconnection via the browser's EventSource API. No special client code needed.

Authentication: Session cookie or x-governance-token header required. Expired sessions return 401.

Continue to API Reference →