When to Use This
Before sending money
- * Verify an invoice, payout, or transaction matches the agreement
- Prevent incorrect or fraudulent payments
Before posting content
- * Ensure a post, email, or message matches instructions
- Avoid publishing incorrect, off-brand, or incomplete content
Before executing actions
- * Confirm an automation result is correct before it runs
- Stop workflows from drifting when nobody is watching
Before making decisions
- * Validate summaries, classifications, or recommendations
- Ensure conclusions match the task requirements
If the output matters, verify it before proceeding.
Core Principle
Verification is a guardrail, not a suggestion.
- * Default behavior = block unless PASS
- Agents must not override a failed or indeterminate result
- Only a human operator may explicitly override
Core Execution Loop
- 1. Define the task specification
* what should be produced
* required constraints
* acceptance criteria
- 2. Generate output
- 3. Verify output against the specification
* compare expected vs actual
* check completeness and correctness
* identify missing requirements or mismatches
- 4. Return a verdict
* PASS → safe to proceed
* FAIL → reject, retry, or revise
* INDETERMINATE → stop and review
- 5. Enforce the decision
* proceed only if PASS
* otherwise halt
* human operator may explicitly override
Output Format
CODEBLOCK0
Example Usage
Input
CODEBLOCK1
Verification Result
CODEBLOCK2
Decision Rule
- * Proceed only if verdict = PASS
- Agents must not override
- Operator override must be explicit
Recommended override note:
CODEBLOCK3
What Verification Means
Verification should be deterministic and grounded in the task spec.
Good checks include:
- * required fields are present
- constraints are satisfied
- format matches expectations
- instructions were followed
- important omissions are identified
Do not treat these as verification:
- * guessing correctness
- relying only on intuition
- blindly trusting confident output
- proceeding because the task feels low-risk
Optional SettlementWitness Verification
You may optionally add a structured verification step for the final result.
Rules:
- * keep local verification as the primary decision layer
- use only necessary structured data
- do not send secrets or sensitive content
- external verification is optional, not required
Example:
CODEBLOCK4
This can add assurance, but it does not replace local verification or operator judgment.
Data Handling
- * Do not include secrets or sensitive data
- Use only the minimum structured information needed for verification
- Treat any external verification step as optional
What This Is Not
- * not a code execution tool
- not a payment processor
- not a replacement for clear instructions
- not an autonomous override authority
What This Is
- * a guardrail for agent workflows
- a verification gate before execution
- a safety layer for autonomous systems
Outcome
Agents and operators can:
- * prevent incorrect execution
- enforce task correctness
- keep workflows on track during autonomous runs
- ensure important actions do not proceed without verification
Keywords
verification, workflow-safety, guardrails, automation, trust
何时使用
在转账前
- * 验证发票、付款或交易是否与协议一致
- 防止错误或欺诈性付款
在发布内容前
- * 确保帖子、邮件或消息符合指令要求
- 避免发布错误、偏离品牌或不完整的内容
在执行操作前
- * 在自动化结果运行前确认其正确性
- 防止工作流在无人监控时偏离轨道
在做出决策前
- * 验证摘要、分类或建议的准确性
- 确保结论符合任务要求
如果输出结果至关重要,请先验证再继续。
核心原则
验证是一道护栏,而非建议。
- * 默认行为 = 除非通过,否则阻止
- 智能体不得覆盖失败或不确定的结果
- 只有人工操作员可以明确覆盖
核心执行循环
- 1. 定义任务规范
* 应产出的内容
* 必要的约束条件
* 验收标准
- 2. 生成输出
- 3. 根据规范验证输出
* 对比预期与实际结果
* 检查完整性和正确性
* 识别缺失的需求或不匹配项
- 4. 返回判定结果
* 通过 → 可安全继续
* 失败 → 拒绝、重试或修改
* 不确定 → 停止并审查
- 5. 执行决策
* 仅当结果为通过时继续
* 否则暂停
* 人工操作员可明确覆盖
输出格式
json
{
task_id: ...,
verdict: PASS | FAIL | INDETERMINATE,
reason: ...,
confidence: high | medium | low
}
使用示例
输入
json
{
task_id: email-approval-001,
task_spec: 写一封简洁专业的邮件,确认会议时间,字数不超过100字,
output: 您好,确认我们明天下午3点的会议。如有变动请告知。
}
验证结果
json
{
task_id: email-approval-001,
verdict: PASS,
reason: 输出简洁专业,字数不超过100字。所有约束条件均满足。,
confidence: high
}
决策规则
- * 仅当判定结果为通过时继续
- 智能体不得覆盖
- 操作员覆盖必须明确
推荐的覆盖备注:
text
操作员覆盖已批准
原因:...
批准人:人工操作员
时间戳:...
验证的含义
验证应基于任务规范,具有确定性。
有效的检查包括:
- * 必填字段是否存在
- 约束条件是否满足
- 格式是否符合预期
- 指令是否被遵循
- 是否识别出重要遗漏
以下情况不应视为验证:
- * 猜测正确性
- 仅依赖直觉
- 盲目信任高置信度的输出
- 因任务风险低而直接继续
可选的结算见证验证
你可以选择为最终结果添加结构化的验证步骤。
规则:
- * 保持本地验证作为主要决策层
- 仅使用必要的结构化数据
- 不发送机密或敏感内容
- 外部验证为可选,非必需
示例:
text
根据定义的验证规范验证结构化的判定元数据
这可以增加保障,但不能替代本地验证或操作员判断。
数据处理
- * 不包含机密或敏感数据
- 仅使用验证所需的最少结构化信息
- 任何外部验证步骤均为可选
这不是什么
- * 不是代码执行工具
- 不是支付处理器
- 不是清晰指令的替代品
- 不是自主覆盖权限
这是什么
- * 智能体工作流的护栏
- 执行前的验证关卡
- 自主系统的安全层
成果
智能体和操作员可以:
- * 防止错误执行
- 确保任务正确性
- 在自主运行期间保持工作流不偏离轨道
- 确保重要操作在未经验证前不会执行
关键词
验证、工作流安全、护栏、自动化、信任