# AI Agent Safety Checklist

By John & Dexter / The AI Fixer

A public-safe checklist for putting AI agents near real work. This is not legal advice, security advice, or a private deployment runbook. It is the operator discipline I would want in place before an agent touches anything that matters.

## 1. Name the job

- What exact job should the agent do?
- What is explicitly out of scope?
- Who owns the final decision?
- What evidence proves the job was done?

If the job cannot be written in four boring sentences, it is not ready for autonomy.

## 2. Class the risk

Mark the workflow:

- LOW: read-only research, summaries, drafts.
- MEDIUM: file edits, public content, customer-facing drafts, internal routing.
- HIGH: payments, purchases, live deployment, customer sends, legal/warranty wording, safety-related advice.
- NEVER-AUTO: credentials, destructive deletes, irreversible infrastructure changes, direct physical-world control, anything that could harm a person, customer, vehicle, account or business.

## 3. Reduce the tool set

Before giving the agent tools, remove what it does not need.

- Read-only first.
- Draft before send.
- Propose before execute.
- Specific folders, not the whole machine.
- Specific APIs, not global credentials.
- Time-boxed access, not permanent trust.

## 4. Put gates where risk changes

Add a human approval gate before the agent can:

- send messages to customers or suppliers;
- publish public content;
- restart or deploy production systems;
- buy, bid, refund, invoice or charge;
- alter customer records;
- use sensitive private data;
- make safety-critical recommendations.

## 5. Separate worker and verifier

The same agent should not be the only judge of its own work.

Use at least one of:

- deterministic tests;
- live URL or file verification;
- second-model review;
- human review;
- known-good checklist;
- log/audit comparison.

## 6. Keep a rollback path

Before execution, write down:

- what changed;
- where the old version lives;
- how to undo it;
- who can stop it;
- how you will know the rollback worked.

No rollback, no autonomy.

## 7. Log enough to inspect

A useful log says:

- prompt/task;
- tools used;
- files/routes affected;
- approval decision;
- output/result;
- verification result;
- failure or rollback notes.

Do not log secrets or private customer detail into public/debug channels.

## 8. Public-safe publishing check

Before publishing any agent-produced public material, scan for:

- passwords, tokens, API keys, cookies;
- internal IPs, hostnames, usernames, private paths;
- customer identifiers, phone numbers, registrations, addresses;
- private family details;
- licensed data-source/vendor leakage;
- claims stronger than the evidence.

## 9. The blunt rule

If an agent failure would cost money, reputation, access, safety, legal position, or customer trust, the agent gets a gate.

If the gate feels annoying, make the workflow smaller. Do not remove the gate.
