MoltGuard - OpenClaw Security Plugin by OpenGuardrails

Low
by ThomasLWang | Audited: 2026-02-26T09:59:20.936Z | Ruleset: 0.2.0

Quick Install

Add this skill to your agent

clawhub install moltguard

About This Skill

MoltGuard — runtime security plugin for OpenClaw agents by OpenGuardrails. Helps users install, register, activate, and check the status of MoltGuard. Use wh...

Use Cases

Documentation (Original)

Source: SKILL.md
The following is the author's original documentation (often English). For installation, follow “Quick Install” above.

name: moltguard
version: 6.6.4
description: "MoltGuard — runtime security plugin for OpenClaw agents by OpenGuardrails. Helps users install, register, activate, and check the status of MoltGuard. Use when the user asks to: install MoltGuard, check MoltGuard status, register or activate MoltGuard, configure the AI Security Gateway, or understand what MoltGuard detects. Provides local-first protection against data exfiltration, credential theft, command injection, and sensitive data leakage. Source: https://github.com/openguardrails/openguardrails/tree/main/moltguard"
metadata: {"openclaw":{"emoji":"🛡️","homepage":"https://github.com/openguardrails/openguardrails/tree/main/moltguard","keywords":["security","prompt-injection","data-exfiltration","pii","credential-theft","command-injection","guardrails","safety","agent-security","moltguard"]}}

MoltGuard

Runtime security guard for OpenClaw agents by OpenGuardrails. Open source (Apache 2.0), fully auditable.

ClawHub: ThomasLWang/moltguard · npm: @openguardrails/moltguard · GitHub: openguardrails/openguardrails/tree/main/moltguard


Security & Provenance

Verify provenance before installing. The npm package should match the auditable GitHub source. To check:

# Download and inspect the package contents
npm pack @openguardrails/moltguard --dry-run

# Or do a full diff against the cloned repo:
mkdir /tmp/moltguard-audit && cd /tmp/moltguard-audit
npm pack @openguardrails/moltguard
tar -xzf openguardrails-moltguard-*.tgz
git clone https://github.com/openguardrails/openguardrails
diff -r package/scripts openguardrails/moltguard/scripts

Prefer source install if you have low trust in the publisher: openclaw plugins install -l ./openguardrails/moltguard

What activate.mjs does: Makes an outbound call to https://www.openguardrails.com/core to register the agent. Once you visit the claim URL and verify your email, the returned API key is written to ~/.openclaw/credentials/openguardrails/credentials.json. Do not activate if you do not trust this endpoint. All local protections work without activation.

Network behavior:

  • Before activation: no outbound calls are made. Fully air-gapped local detection.
  • After activation: each agent tool call sends a behavioral assessment request to https://www.openguardrails.com/core. No message content is sent — only tool names, sequence, and metadata.

Least privilege: Treat the API key written by activation as sensitive. Rotate or revoke it via the account portal (https://www.openguardrails.com/core/login) if needed. You can keep local-only protections indefinitely by simply not activating the cloud component.


What to Consider Before Installing

Before installing or running activate.mjs:

1. Verify provenance.
Compare the npm package to the GitHub source (npm pack + diff) as described in the Security & Provenance section above. Only proceed if the contents match the source you trust.

2. Inspect the scripts yourself.
scripts/activate.mjs and scripts/status.mjs are short and readable. Review them directly before running.

3. Understand what activation writes.
Activation writes an API key to ~/.openclaw/credentials/openguardrails/credentials.json. If you later distrust the endpoint, delete that file and revoke the key via the platform portal.

4. Activate from a monitored environment if you need high assurance.
Use tcpdump, mitmproxy, or Little Snitch to confirm outbound destinations and that payloads contain only sanitized metadata — not message content.

Local protections remain fully functional without activation — no outbound calls, no credentials written.


Agent Workflow

Check status

Run this when the user asks about MoltGuard status, whether it's registered, or what email/plan is active:

node {baseDir}/scripts/status.mjs

Register and activate

Run this when the user wants to register MoltGuard or enable cloud behavioral detection:

node {baseDir}/scripts/activate.mjs

After running, show the user the claim URL and verification code. Tell them to visit the URL, enter the code, and provide their email to complete activation. On success, an API key is written to ~/.openclaw/credentials/openguardrails/credentials.json — inform the user so they know a credential file was created.


Installation

When the user asks to install MoltGuard, run these steps:

# 1. Install the plugin
openclaw plugins install @openguardrails/moltguard

# 2. Restart the gateway to load the plugin
openclaw gateway restart

Local protections are active immediately after restart — no registration required. Cloud behavioral detection requires registration (run activate.mjs above).

To install from source (highest assurance):

git clone https://github.com/openguardrails/openguardrails.git
# Audit the code, then:
openclaw plugins install -l ./openguardrails/moltguard

What MoltGuard Protects Against

Local (no cloud, no registration needed):

  • Read sensitive file → network call → BLOCK
  • Shell escape in params ($(), backtick, ;, &&, |) → BLOCK
  • Prompt injection in file/web content → REDACT in-place

Cloud (requires activation):

  • Multi-credential access, shell after web fetch → BLOCK
  • Intent-action mismatch, unusual tool sequence → ALERT

For full detection tables and pattern details, see references/details.md.


AI Security Gateway (Free, no registration)

Local HTTP proxy that sanitizes PII/secrets before they reach LLM providers:

npx @openguardrails/gateway   # runs on port 8900

Then point your agent's API base URL to http://127.0.0.1:8900. Sanitizes emails, credit cards, API keys, phone numbers, SSNs, IBANs, IPs, URLs. Restores originals in responses. Stateless — no data retained.


Configuration

All options in ~/.openclaw/openclaw.json under plugins.entries.openguardrails.config:

Option Default Description
enabled true Enable/disable the plugin
blockOnRisk true Block tool call when risk detected
apiKey "" Explicit API key (sk-og-...)
agentName "OpenClaw Agent" Name shown in dashboard
coreUrl https://www.openguardrails.com/core Platform API endpoint
timeoutMs 60000 Cloud assessment timeout (ms)

To use an existing API key directly (skips registration):

{
  "plugins": {
    "entries": {
      "openguardrails": {
        "config": { "apiKey": "sk-og-<your-key>" }
      }
    }
  }
}

Plans

Plan Price Detections/mo
Free $0 30,000
Starter $19/mo 100,000
Pro $49/mo 300,000
Business $199/mo 2,000,000

Account portal: https://www.openguardrails.com/core/login (email + API key)


Uninstall

rm -rf ~/.openclaw/extensions/openguardrails
# Remove config from ~/.openclaw/openclaw.json
rm -rf ~/.openclaw/credentials/openguardrails   # optional

Reference

For detailed information on security & trust, detection patterns, privacy policy, and gateway data types, read references/details.md.

Security Audit

Low

Summary

MoltGuard — runtime security plugin for OpenClaw agents by OpenGuardrails. Helps users install, register, activate, and check the status of MoltGuard. Use wh...

Risk Profile Toxicity Privacy Scope Reputation Quality

ToxicSkills Analysis

Blocklist
Not matched
Prompt Injection
Not detected

No Toxic signals detected by current static checks.

Key Risks 0 items

No LLM risk bullets (LLM disabled or not cached).

Deterministic Findings (Evidence)

No findings detected.

Scoring Criteria

Each skill is scored across 5 dimensions. The weighted total determines the star rating.

Code Toxicity 100/100 (weight 30%)
Privacy Risk 100/100 (weight 25%)
Permission Scope 100/100 (weight 20%)
Author Reputation 75/100 (weight 15%)
Code Quality 70/100 (weight 10%)

Star Rating Scale

5★ Safe — Score ≥ 80
4★ Good — Score 70–79
3★ Caution — Score 60–69
2★ Risky — Score 40–59
1★ Dangerous — Score < 40

Why This Score?

All dimensions scored above 60. This skill passed the safety baseline.

Explore More Skills

VettedSkillsHub

We curate the top 100 most downloaded skills from ClawHub — the official ClawdBot (OpenClaw) marketplace — then run independent 5-dimension security audits. Transparent evidence, reproducible scores, 1-click install.

About

Best-effort static analysis. Scores are reproducible and evidence-based. Always review code and run in isolated environments for sensitive use.

© 2026 VettedSkillsHub. ClawdBot & OpenClaw are community projects.