Agent Integration¶

This guide explains how agentvfs should fit into an agent runtime.

Preferred Model¶

The preferred agent model is not direct shell access. It is a mediated execution boundary:

agent -> proxy boundary -> mounted forked workspace -> cli tools

That boundary should control:

which workspace the command runs in
whether the command is allowed
whether a checkpoint should be taken first
how the workspace is mounted
what changed after execution

Why This Model¶

This gives the agent runtime one cheap control point for shell work.

The proxy boundary is intended to govern the top-level command requested by the agent. It is not meant to trace every subprocess or syscall.

That trade-off keeps the model practical:

cheap enough to use per command
strong enough to prepare and isolate workspaces
clear enough to audit what the agent asked to do

Runtime Pieces¶

The boundary is built from four lower-level primitives already present in the project:

Vaults for durable state
Forks for task-scoped workspaces
Checkpoints for rollback
Mounts for tool compatibility

Python SDK (preferred for Python agents)¶

If your agent runtime is in Python, use the official SDK from PyPI:

pip install agentvfs-cli

from agentvfs import AVFS

root = AVFS()
root.create_vault("agent-workspace")
root.fork_vault("agent-workspace", "agent-workspace-task-1")

task = AVFS("agent-workspace-task-1")
task.checkpoint_save("before-change")

result = task.proxy_exec("cargo", "test", cwd="/")
print(result["execution_result"]["exit_code"])
print(result["execution_result"]["changed_files"])

The SDK lazily downloads the matching native avfs binary on first invocation and caches it under ~/.cache/agentvfs/<version>/. See the Python SDK reference for the full API and the installation guide for the resolution order.

Raw subprocess integration¶

If you can't use the SDK (non-Python runtime, or you need full control over the avfs invocation), you can compose the CLI directly:

import json
import subprocess

def avfs(*args):
    result = subprocess.run(
        ["avfs", "--json"] + [str(a) for a in args],
        capture_output=True,
        text=True,
    )
    return json.loads(result.stdout) if result.stdout else None

avfs("vault", "create", "agent-workspace")
avfs("vault", "fork", "agent-workspace", "agent-workspace-task-1")
avfs("--vault", "agent-workspace-task-1", "checkpoint", "save", "before-change")

This works, but it is not the desired long-term interface because the agent has to manually orchestrate:

workspace choice
fork creation
checkpoint creation
mount lifecycle

Current Proxy Contract¶

The current higher-level path is proxy exec:

avfs --json --vault agent-workspace-task-1 proxy exec -- cargo test

That output is now versioned and shaped as a single execution envelope so agents can reliably parse one object per invocation, even when the proxied command exits non-zero.

The envelope includes:

schema_version
kind
success
normalized request metadata
structured execution result

Intended Proxy Responsibilities¶

The proxy boundary should eventually own this lifecycle:

receive one top-level command from the agent
resolve the target vault or fork
classify the command
allow, deny, or checkpoint it
mount the workspace
execute the command
report stdout, stderr, exit code, and changed files

Policy Scope¶

The proxy policy layer should stay coarse and obvious.

Useful classifications:

read_only
mutating
destructive
networked
host_escape_risk

Useful outcomes:

allow
allow_with_checkpoint
deny
require_approval

Recommended Workspace Pattern¶

For agent tasks, use this pattern:

keep a durable project root in a vault
create a fork for each task
save a checkpoint before risky mutations
run the task in the fork
inspect the resulting changes
keep or discard the fork

What the Proxy Should Return¶

The top-level execution result should include:

policy decision
workspace or fork used
checkpoint created
mountpoint used
exit code
stdout
stderr
changed-files summary

Non-Goals¶

The proxy boundary is not supposed to:

trace every child process launched from a script
act as a kernel sandbox
implement syscall-level enforcement

If those requirements appear later, they should be handled below the proxy boundary, not mixed into the first cheap control layer.