Semantic Security Intelligence
for AI Agents

On-device ML engine that understands agent artifacts semantically. Compiles skills, configs, and prompts into Abstract Security Trees. Powers HackMyAgent, ARP, and the Registry.

View on GitHub Classifier on HuggingFace

Full documentation at nanomind.dev | Open source, Apache 2.0 license

terminal

$ npx hackmyagent secure
  NanoMind runs automatically with every scan

$ npx hackmyagent secure --deep
  Layer 2: 5-probe behavioral simulation
  Layer 3: 20-probe full simulation (with --deep)

  NanoMind: 3 semantic findings
  SEM-001  Undeclared file-system capability (scope mismatch)
  SEM-002  Weak governance constraint (override-resistant: false)
  SEM-003  Credential forwarding to external endpoint

What NanoMind Does

A compiler-style semantic engine that turns raw agent artifacts into structured security intelligence.

Semantic Compiler

Compiles skills, MCP configs, SOUL.md, and system prompts into Abstract Security Trees. Analyzers query the AST instead of regex on raw text.

6 analyzers: capability, credential, governance, scope, prompt, code

28 semantic checks across the AST

HMAC-SHA256 signed AST output

Self-Securing

Verifies its own binary integrity on every startup. Tampered binaries enter QUARANTINE mode.

Integrity manifest checked at boot

Tamper detection via content hashing

QUARANTINE mode blocks compromised inference

Zero Cost Inference

Runs 100% locally on any CPU via ONNX runtime. No API keys, no cloud calls, no telemetry. Sub-millisecond inference at 8.3 MB model size (v0.5.0 ONNX + weights + tokenizer).

On-device ONNX model, no GPU required

Auto-downloads on first scan

3-tier cascade: TME classifier, Layer 2, Layer 3

Defense-in-Depth

NanoMind can upgrade findings but NEVER suppress static analysis. Even a compromised NanoMind gains the attacker nothing.

AST findings upgrade, never suppress static checks

Static analysis is the floor, not the ceiling

Merge rules enforced in scanner bridge

Continuous Learning

Every scan generates training data. The model improves from real-world usage. Claude reviews all labels before training to prevent data poisoning.

3,168 training samples across 10 classes

Claude-reviewed label quality pipeline

AgentPwn honeypot feeds real attack data

Two Tiers

NanoMind-Edge (8.3 MB ONNX) for local real-time analysis in HMA CLI. NanoMind-Server for deep Registry scanning with behavioral simulation.

Edge: sub-ms inference, ships with HMA

Server: Layer 2 (5 probes) + Layer 3 (20 probes)

Federated learning preserves privacy

Architecture

A 9-step pipeline from raw artifact ingestion to merged findings.

nanomind pipeline

Artifact (skill, MCP, SOUL, prompt)
    |
    v
Secure Ingestion
    |- Validate structure
    |- Sanitize NanoMind prompt injection
    |- Compute content hash
    |
    v
NanoMind Semantic Compiler
    |- Extract declared capabilities
    |- Extract constraints + enforceability
    |- Classify intent (NanoMind model or heuristic)
    |- Map risk surfaces
    |- Sign AST (HMAC-SHA256)
    |
    v
Abstract Security Tree (AST)
    |
    v
6 AST Analyzers (28 checks)
    |- Capability (undeclared caps, scope mismatch)
    |- Credential (exposure, forwarding, hardcoded)
    |- Governance (gaps, weak constraints, override resistance)
    |- Scope (wildcards, undeclared permissions)
    |- Prompt (jailbreak, capability creep, authority confusion)
    |- Code (injection, unsafe deser, path traversal)
    |
    v
Findings (merged with static analysis, defense-in-depth)

Trained Model

Open-source TME (Threat Mamba Engine) classifier trained on curated, Claude-reviewed security corpus. Published on HuggingFace.

98.45%

Eval Accuracy (v0.5.0)

0.978

Macro F1

Attack Classes

3,168

Training Samples

View on HuggingFace -- open-source, Apache 2.0 license.

The reasoning tier, a Qwen3-1.7B generative analyst that produces structured analysis with evidence and remediation, is published separately: nanomind-security-analyst on HuggingFace.

Both tiers, plus the Metal-stable MLX analyst build, are grouped in one place: the NanoMind model collection on HuggingFace.

Try the classifier live

Paste an agent artifact, an MCP config, a SKILL.md, or a SOUL.md, and see the raw attack-pattern read from the inline classifier tier. This tier is a high-recall pre-filter, not a verdict. It over-flags benign input by design, which is why HackMyAgent corrects the signal and escalates to the analyst tier before producing findings.

Open the demo on HuggingFace Browse the model collection

Runs on the free HuggingFace CPU tier. Sub-millisecond inference on an 8.3 MB on-device model. Apache 2.0.

What's Shipping (April 2026)

v0.5.0 remains production. Three components landed this month, moving NanoMind from a single classifier to a layered analyst stack.

v3.0.0-beta2026-04-16

Qwen3-1.7B generative analyst

Generative reasoning that produces structured analysis with evidence and remediation, not just a label. Oracle canon 10-way 0.700, binary 0.978. FP-suppression on real security code is ~77% (corrected 2026-06-03; the earlier 57% was measured on a gate later found to be 41% placeholder filler). Production; spot-check findings on dedicated security libraries.

Input gate · PR #132026-04-17

Input-classifier gate

MiniLM-L6 + sklearn LR classifier plus a byte-level BIDI/stego pre-filter, running ahead of the NLM. Ships at threshold 0.90 (raised from the initial 0.65), tuned to prioritize attack recall.

Daemon · PR #142026-04-17

NanoMind-Guard daemon

Unix socket /tmp/nanomind-guard.sock serves v3.0.0-beta (bf16 on Apple MPS) + the gate over JSON-Lines. Cold boot <30s, bypass p50 <15ms, healthz 116/116. Fail-CLOSED. No consumers wired yet.

Semantic security, built in

NanoMind ships inside HackMyAgent. Install HMA and NanoMind runs automatically on every scan. No configuration needed.

GitHub HuggingFace

Semantic Security Intelligencefor AI Agents