Verification Tool Skill

Name: crabshell/verifying
Author: ZipperBagCoffee

View on GitHub↗

Share on X Share on LinkedIn Share on Bluesky

100Critical

This skill allows arbitrary command execution and

ThreatsCI SM CS BC S HITL

Behavior Analysis

Description verified — behavior matches claim

Claims to do

Verification Tool Skill: Bridge the gap between VERIFICATION-FIRST principles and project reality. Most projects lack executable verification tools. This skill analyzes the project's runtime environment and creates a verification manifest — a mapping of IA items to executable commands with expected results.

Actually does

This skill analyzes a project's runtime environment using Work and Review Agents, then creates a `.crabshell/verification/manifest.json` mapping Intent Anchor items to executable commands and expected results. It also generates a `node` runner script (`run-verify.js`) that executes these commands via `child_process.execSync` and reports pass/fail status. It can also update the manifest with new verification entries.

Threat Analysis

AI Agent Traps ↗Scanned 4/15/2026

Content InjectionPerception1 finding

Semantic ManipulationReasoning1 finding

Cognitive StateMemory & Learning2 findings

Behavioural ControlAction8 findings

SystemicMulti-Agent Dynamics2 findings

Human-in-the-LoopHuman Overseer1 finding

Skill Info

NameZipperBagCoffee/crabshell/verifying

Registrygithub

Versionfc0563504133

PURLpkg:github/ZipperBagCoffee/crabshell@fc0563504133?skill=verifying

Stars1

Source

Repository ↗SKILL.md ↗

SHA-256cc429f71242b...

SHA-19327e0f9e91f...

MD57477dfaae4f5...

TLSH12e1b6839cbe...

ssdeep192:iXY2wZ65...

Size7,126 bytes

Install

This skill has been flagged as potentially malicious. Review the findings below before installing.

GitHub

https://github.com/ZipperBagCoffee/crabshell/tree/main/skills/verifying

Claude CodeDocs

/plugin marketplace add ZipperBagCoffee/crabshell

/plugin install verifying@ZipperBagCoffee/crabshell

Skills.shDocs

npx skills add https://github.com/ZipperBagCoffee/crabshell --skill verifying

Assessments (6)

AI Agent Traps ↗

Shell command execution function detectedcriticalcommand execution

Shell command execution function detected

100% confidenceline 106

child_process

Arbitrary Command Execution via execSynccriticalcommand execution

The skill directly executes commands from the `manifest.json` using `child_process.execSync` in the `run-verify.js` script. The `command` field in the manifest is populated from 'IA item in the current session' or 'user input' in update mode, allowing for arbitrary command injection.

100% confidenceline 116

const output = execSync(entry.command, { ... }); (run-verify.js, line 100); 'command': '{executable command}' (manifest.json structure, line 70); 'Create entry with user input (IA, type, command, expected)' (Update Mode, line 160)

Data Exfiltration via Command Outputhighdata exfiltration

In conjunction with arbitrary command execution, the `run-verify.js` script checks if `output.includes(entry.expected)`. An attacker could craft a command to read sensitive files and set `entry.expected` to a pattern found in the sensitive data, effectively exfiltrating information by observing the verification result.

90% confidenceline 121

const pass = output.includes(entry.expected) || entry.expected === 'exit-0'; (run-verify.js, line 105); 'expected': '{expected output or behavior}' (manifest.json structure, line 70)

Sub-Agent Spawning and Reconnaissancemediumsub agent spawning, reconnaissance

The skill explicitly launches 'Work Agent (Task tool)' and 'Review Agent (Task tool)' to analyze the project's runtime environment, gathering detailed information about the system, entry points, and commands. This capability can be abused for extensive reconnaissance or to orchestrate further attacks via sub-agents.

100% confidenceline 54

Launch a Work Agent (Task tool) to determine: (line 35); Launch a Review Agent (Task tool, SEPARATE from Work Agent) (line 50)

Unvalidated Command Execution via manifest entrieshighcommand execution

The run-verify.js runner script executes arbitrary shell commands sourced directly from manifest.json entries via `execSync` with no sanitization, validation, or allowlisting. Any entity that can write to or influence manifest.json can achieve arbitrary command execution on the host system.

90% confidenceline 116

const output = execSync(entry.command, { timeout: entry.timeout || 30000, encoding: 'utf8', cwd: process.env.PROJECT_ROOT || process.cwd() });

PROJECT_ROOT env var enables path traversal/hijackmediumcommand execution

The working directory for executed commands is taken from the unvalidated `process.env.PROJECT_ROOT` environment variable. An attacker who can set this variable could redirect command execution to an arbitrary directory, potentially enabling path traversal or execution of attacker-controlled scripts.

80% confidenceline 119

cwd: process.env.PROJECT_ROOT || process.cwd()

Generated script written without integrity protectionmediumpersistence

The skill creates a persistent executable script (.crabshell/verification/run-verify.js) on disk with no checksum, signature, or integrity verification. This file persists across sessions and can be modified by other processes or attackers to establish persistent malicious execution.

70% confidenceline 94

Create `.crabshell/verification/run-verify.js`: ... // Auto-generated verification runner

Rule prohibiting destructive commands is insufficientlowcommand execution

The only command restriction is a prose rule stating 'Destructive commands (rm, drop) PROHIBITED.' This is enforced only by LLM instruction following, not by any technical allowlist or blocklist in the runner script itself. Bypass via obfuscation (e.g., `$(rm ...)`, aliases, encoded commands) is trivial.

85% confidenceline 194

**Timeout safety.** Default 30s. Destructive commands (rm, drop) PROHIBITED.

Cognitive State Manipulation (Manifest Poisoning)mediummemory poisoning, rag poisoning

The skill creates and modifies `.crabshell/verification/manifest.json` and `run-verify.js`. If an attacker can influence the 'IA item in the current session' or 'user input' used to populate these files, they can inject malicious commands or logic that the agent will later execute, effectively poisoning the agent's operational knowledge base.

90% confidenceline 77

Create `.crabshell/verification/manifest.json` (line 60); For each IA item in the current session, create a verification entry: (line 65); Create entry with user input (IA, type, command, expected) (line 160)

Manifest poisoning enables persistent code executionhighmemory poisoning

The manifest.json file is treated as a persistent, authoritative source of truth for commands to execute. If an attacker can write to .crabshell/verification/manifest.json (e.g., via supply chain, another skill, or social engineering), malicious commands will be persistently executed every time `/verifying run` is invoked.

85% confidenceline 143

**Manifest is source of truth.** All entries live in `manifest.json`. ... Read `.crabshell/verification/manifest.json`.

Sub-agent spawning with attacker-influenced promptshighsub agent spawning

The skill spawns multiple sub-agents (Work Agent and Review Agent via Task tool) whose prompts incorporate project-derived data (directory paths, runtime analysis results). If the project contains adversarial content (e.g., in filenames, README, or code comments), it could inject instructions into sub-agent prompts, constituting indirect prompt injection.

75% confidenceline 54

Launch a Work Agent (Task tool) to determine: ... Launch a Review Agent (Task tool, SEPARATE from Work Agent) to verify the analysis independently.

Autonomy abuse: orchestrator auto-cross-references agentsmediumautonomy abuse

The skill instructs the orchestrator to autonomously cross-reference two agent outputs and determine 'discrepancies are findings' without requiring human confirmation of the interpretation. This unbounded autonomous reasoning over externally-sourced data (project files) could be exploited to cause the agent to take unintended actions based on fabricated discrepancies.

65% confidenceline 53

After Review Agent completes, the Orchestrator cross-references RA findings against WA Project Analysis — discrepancies are findings.

Indirect prompt injection via project analysis datahighprompt injection

The Work Agent reads the project environment (files, commands, entry points) and appends findings to agent context. Malicious content embedded in project files (package.json, README, source files) could inject instructions into the agent reasoning pipeline when the analysis results are consumed by the orchestrator or subsequent agents.

80% confidenceline 51

Work Agent appends results as: ## Project Analysis - Runtime: {type} - Entry points: {list} ...

Semantic Manipulation (Policy Bypass)lowoversight evasion

The skill states 'Destructive commands (rm, drop) PROHIBITED' as a rule. However, the underlying `execSync` mechanism does not technically enforce this, relying on a semantic constraint. An attacker could attempt to bypass this policy by crafting destructive commands that are not explicitly `rm` or `drop`, or by relying on the agent to overlook this rule.

80% confidenceline 194

Destructive commands (rm, drop) PROHIBITED. (Rule 5, line 175)

Document-first rule bypasses human review of resultslowapproval fatigue

The 'document-first rule' instructs the agent to write verification results to project documents using the Edit tool BEFORE reporting to the human. This pattern could be used to commit attacker-influenced content to persistent project documents without human review of what is being written.

60% confidenceline 166

**Document-first rule:** If this verification run was invoked from within a T or P document context, append the P/O/G results to that document's verification section using the Edit tool FIRST. After the document is updated, report the summary in conversation.

Badge

Markdown

[![Mondoo Skill Check](https://mondoo.com/ai-agent-security/api/badge/github/ZipperBagCoffee/crabshell/verifying.svg)](https://mondoo.com/ai-agent-security/skills/github/ZipperBagCoffee/crabshell/verifying)

HTML

<a href="https://mondoo.com/ai-agent-security/skills/github/ZipperBagCoffee/crabshell/verifying"><img src="https://mondoo.com/ai-agent-security/api/badge/github/ZipperBagCoffee/crabshell/verifying.svg" alt="Mondoo Skill Check" /></a>

Image URL

https://mondoo.com/ai-agent-security/api/badge/github/ZipperBagCoffee/crabshell/verifying.svg

Secure your AI agents

Skills can read files, run commands, and access credentials. Mondoo helps organizations manage the security risks of AI agent skills across their entire fleet.

Continuous scanning of skills across all registries
Policy enforcement before skills reach your agents
Integration with your existing security workflow