Permission Classifier -- EasyClaw v2

An agent that's too autonomous is dangerous. An agent that asks permission for everything is unusable. The permission classifier finds the right balance.

Before every action, the agent automatically evaluates its risk level. Read a file? No-risk action, immediate execution. Delete a directory? Irreversible action, confirmation required.

The result is an agent that moves fast on routine tasks and stops at the right moments. No unnecessary micromanagement, no risky action without validation.

How it works

Interception

Every action the agent is about to execute passes through the classifier before execution.

Risk evaluation

The classifier analyzes: action type (read/write/delete), scope (local/remote), reversibility, potential impact.

User context

The history of permissions granted or denied by the user is considered.

Decision

Three possible outcomes: autonomous execution, confirmation request, or refusal with explanation.

Learning

User decisions feed the model. The agent gradually adapts to each user's preferences.

Real-world use cases

Development

The agent reads and modifies code files without asking. But it asks confirmation before launching a deployment or modifying a production environment variable.

Communication

The agent drafts emails freely but asks for validation before sending. Except for routine internal replies.

System administration

The agent runs diagnostic commands without asking. It requests confirmation for service restarts and refuses destructive commands.

Configuration

featurePages.permissionClassifier.config

Availability

Active by default on all EasyClaw v2 agents
Configurable per agent and per user
Compatible with the OpenClaw governance system

Hook System Markdown Agent Definitions