Prompt Safety Checklist

Doc typePrimary usersSuccess metricArtifacts

Doc type: Checklist + governance

Primary users: Prompt designers, reviewers

Success metric: Consistent prompt reviews

Artifacts: Checklist, review log

0. Why this guide exists

Prompts are a policy surface. This checklist uses Bedrock Guardrails to keep prompts safe, reviewable, and consistent across teams.

Problem

Prompts change without review, causing policy drift.

Outcome

Consistent prompt quality and fewer unsafe outputs.

Goal

Reviewable prompts at scale.

1. Guardrails model (Policy -> Prompt -> Review)

Policy

Leadership layer defines limits.

Prompt

Developer layer implements safe defaults.

Review

Enablement layer validates and logs.

Prompts pass through guardrails and end in an auditable review log.

2. When to use this (governance first)

New prompt templates for production workflows.
High-risk prompts involving customers or sensitive data.
Cross-team prompt reuse.

3. Checklist (isolation and safety)

Does the prompt avoid PII or restricted data?
Is the output expectation explicit and measurable?
Are constraints visible to the model?
Is there a fallback for uncertainty or refusal?
Has this prompt been reviewed and logged?

4. Review protocol (learning before building)

Outcome: Every prompt change is auditable before release.

Assign an owner and reviewer for every prompt.
Run tests with edge cases before approval.
Log approval date and next review date.

Prompt review log with approval status. — Prompt review log.

5. Guardrails alignment (proof of access)

Ensure prompts align with Bedrock Guardrails categories and thresholds.

Prompt constraints aligned with guardrail categories. — Guardrails alignment with prompt constraints.

6. Guardrails and limits (preventing early failures)

Policy mapping

Prompt constraints align to guardrail categories.

Version pinning

Prompt changes require review and release notes.

Logging

All exceptions logged for audit.

7. Common failure modes (what breaks in real orgs)

Unbounded prompts

Constraints are missing or unclear.

No review trail

Prompts cannot be audited after incidents.

Misaligned guardrails

Prompts conflict with policy filters.

8. What "ready" actually means

Review: Prompt owners and reviewers assigned.
Policy: Guardrails aligned to org rules.
Logging: Exceptions recorded and reviewed.
Cadence: Quarterly prompt audits scheduled.

Business impact: Lower risk and higher trust in AI outputs.

Author note

A checklist is a training tool. I write it so a new team member can understand the intent in one pass.