Agent Behavioral
Governance Specification

The first open specification for AI agent behavioral governance. Defines what goes in a SOUL.md file, measures governance coverage across nine domains, and provides three conformance levels for auditing agent deployments.

terminal
# Generate a governance file
$ npx hackmyagent harden-soul

# Scan for governance coverage
$ npx hackmyagent scan-soul

  SOUL.md Governance Scan
  File: SOUL.md (2,847 chars)

  Domain 07: Trust Hierarchy ....... PASS
  Domain 08: Capability Bounds ..... PASS
  Domain 09: Injection Hardening ... PASS
  Domain 10: Data Handling ......... PASS
  Domain 11: Hardcoded Behaviors ... PARTIAL
  Domain 12: Agentic Safety ........ PASS
  Domain 13: Honesty ............... PASS
  Domain 14: Human Oversight ....... PARTIAL
  Domain 15: Harm Avoidance ........ PASS

  Conformance: Standard (35/42 controls)

The Gap ABGS Fills

The AI agent ecosystem has standards for several layers of the stack, but behavioral governance at the deployment level has been unaddressed.

LayerStandardCoverage
Foundation modelAnthropic Soul, OpenAI Model SpecHow the base model should behave
Agent personaSoulSpecPersonality, tone, and character traits
Agent capabilitiesAgent SkillsProcedural knowledge and tools
InfrastructureNIST AI Agent StandardsIdentity, interoperability, infrastructure
Coding instructionsAGENTS.mdInstructions for AI coding agents
Behavioral governanceABGS (this specification)Per-agent safety constraints, scope boundaries, oversight requirements

Nine Governance Domains

ABGS covers domains 7 through 15 of the OASB framework. Each domain contains specific, testable controls that can be scanned automatically.

Domain 7

Trust Hierarchy

Defines who the agent trusts, in what order, and how conflicts between principals are resolved.

Domain 8

Capability Boundaries

Declares what the agent can and cannot do. Explicit scope limits, denied actions, and escalation paths.

Domain 9

Injection Hardening

How the agent handles prompt injection, context manipulation, and adversarial inputs.

Domain 10

Data Handling

Rules for sensitive data: what to retain, what to redact, and how to handle PII across sessions.

Domain 11

Hardcoded Behaviors

Non-negotiable behaviors that cannot be overridden by any principal, including the developer.

Domain 12

Agentic Safety

Constraints for tool use, multi-step planning, resource consumption, and autonomous decision-making.

Domain 13

Honesty and Transparency

Requirements for truthfulness, uncertainty expression, and disclosure of agent identity and limitations.

Domain 14

Human Oversight

When the agent must pause and request human approval before proceeding with high-impact actions.

Domain 15

Harm Avoidance

Guidance for exercising judgment in gray areas where an action is technically permitted but may cause unintended harm.

Conformance Levels

Three tiers of governance maturity. Start with Essential and work toward Hardened as your agent deployment matures.

Essential~15 controls

Minimum governance for any deployed agent. Covers trust hierarchy, basic boundaries, and data handling.

Standard~35 controls

Full behavioral governance for tool-using and agentic agents. Adds injection hardening, agentic safety, and human oversight.

Hardened~50 controls

Maximum governance for multi-agent systems and high-risk deployments. All nine domains fully covered.

Relationship to OASB

OASB v1 -- Infrastructure Security

Domains 1-6: Authentication, authorization, transport security, logging, configuration, dependency management.

ABGS -- Behavioral Governance

Domains 7-15: Trust hierarchy, capability boundaries, injection hardening, data handling, hardcoded behaviors, agentic safety, honesty, human oversight, harm avoidance.

OASB v2 -- Full-Stack Assessment

Integrates both into domains 1-15 for comprehensive AI agent security assessment covering infrastructure and behavior.

Quick Start

Generate a governance file, scan it for coverage, and iterate until you reach your target conformance level.

terminal
# 1. Generate a SOUL.md with guided prompts
$ npx hackmyagent harden-soul

# 2. Scan the file for governance coverage
$ npx hackmyagent scan-soul

# 3. Or start from a template
$ npx hackmyagent harden-soul --template agentic

# 4. Validate conformance level
$ npx hackmyagent scan-soul --level standard

Define your agent's behavioral contract

Start with a template, scan for coverage, and reach conformance.