Safe Long-Run Mode (GPT-5.4 + Claude)

Use this skill for tasks that may run long, span multiple systems, or risk losing progress if interrupted.

Core rule

Do not run long tasks as one monolithic attempt. Split into phases, write checkpoints, and keep the work resumable.

Model selection

Use GPT-5.4 for:

- coding
docs
research
file-heavy transformations
multi-agent delegated work
repetitive build tasks
long internal work where cost and throughput matter

Use Claude for:

- strategic judgment
sensitive decisions
nuanced synthesis
client-facing polish
brand voice refinement
high-trust orchestration

Default to GPT-5.4 first. Escalate to Claude only when the task actually benefits from higher-quality judgment or tone.

Operating procedure

1. Scope before acting

Before starting, decide:

- what the final deliverable is
which systems/tools will be touched
what can fail or throttle
what must be saved after each phase

2. Break work into phases

Use phases such as:

1. gather / inspect
plan / write brief
execute / edit / build
validate
deploy or report

At the end of each phase, write artifacts to disk.

3. Always checkpoint

For long tasks, save progress in files:

- draft outputs
notes
reports
partial results
tracker entries
checkpoint summaries

Prefer a resumable workspace state over a perfect one-shot run.

4. Isolate long work

Use subagents when:

- the task will take more than a few tool calls
multiple files/systems are involved
external APIs are involved
failure should not pollute the main session
specialized work can be delegated cleanly

5. Throttle external systems

When interacting with Azure, Graph, Orgo, messaging providers, registries, websites, or any external API:

- batch reads when possible
avoid tight polling loops
serialize risky writes
respect retry/backoff
avoid one-item burst loops when a bulk operation is possible

6. Prefer resumability over perfection

The goal is not "never fail." The goal is: if interrupted, resume with minimal loss.

System-specific guidance

Azure / cloud control planes

- validate auth first
create foundational resources first
verify after each layer
log resource names/IDs
do not chain long destructive commands blindly

Browser / Orgo / GUI automation

- use explicit goals and stop conditions
capture screenshots at checkpoints
bound retry counts
save artifacts locally
prefer API/CLI over GUI when equivalent exists

Coding / documentation work

- create a brief/spec first for complex tasks
write files in chunks
validate after each major change
leave notes for resume if work is unfinished

What to tell the user

When relevant, explain that safe long-run mode means:

- cheapest adequate model
phased execution
saved checkpoints
subagent isolation
controlled API usage
resumable progress

Failure handling

If a long task is interrupted:

1. summarize completed phases
point to saved artifacts
identify exact next step
resume from checkpoint rather than restarting

References

- Read references/checklist.md for a reusable pre-flight checklist and model routing matrix.

安全长运行模式（GPT-5.4 + Claude）

使用此技能处理可能运行时间较长、跨越多个系统，或中断后可能丢失进度的任务。

核心规则

不要将长任务作为单一整体运行。将其拆分为多个阶段，写入检查点，并保持工作可恢复。

模型选择

使用 GPT-5.4 处理：

- 编码
文档
研究
文件密集型转换
多智能体委派工作
重复性构建任务
成本和吞吐量重要的长内部工作

使用 Claude 处理：

- 战略判断
敏感决策
细微综合
面向客户的润色
品牌语调优化
高信任度编排

默认优先使用 GPT-5.4。仅当任务确实需要更高质量的判断或语调时，才升级到 Claude。

操作流程

1. 行动前明确范围

开始前，确定：

- 最终交付物是什么
将涉及哪些系统/工具
哪些环节可能失败或受限
每个阶段后必须保存什么

2. 将工作分解为阶段

使用如下阶段：

1. 收集/检查
规划/编写概要
执行/编辑/构建
验证
部署或报告

每个阶段结束时，将产物写入磁盘。

3. 始终设置检查点

对于长任务，将进度保存在文件中：

- 草稿输出
笔记
报告
部分结果
跟踪条目
检查点摘要

优先选择可恢复的工作区状态，而非完美的一次性运行。

4. 隔离长任务

在以下情况下使用子智能体：

- 任务需要多次工具调用
涉及多个文件/系统
涉及外部 API
失败不应污染主会话
专业工作可清晰委派

5. 限制外部系统

与 Azure、Graph、Orgo、消息提供商、注册表、网站或任何外部 API 交互时：

- 尽可能批量读取
避免紧密轮询循环
序列化风险写入
遵守重试/退避策略
在可批量操作时避免单项目突发循环

6. 优先可恢复性而非完美性

目标不是“永不失败”。目标是：如果中断，以最小损失恢复。

系统特定指南

Azure / 云控制平面

- 首先验证身份验证
首先创建基础资源
每层后验证
记录资源名称/ID
不要盲目链式执行长破坏性命令

浏览器 / Orgo / GUI 自动化

- 使用明确目标和停止条件
在检查点捕获截图
限制重试次数
本地保存产物
存在等效方案时优先使用 API/CLI 而非 GUI

编码 / 文档工作

- 复杂任务先创建概要/规范
分块写入文件
每次重大更改后验证
工作未完成时留下恢复笔记

告知用户的内容

在相关时，解释安全长运行模式意味着：

- 最便宜且足够的模型
分阶段执行
保存检查点
子智能体隔离
受控的 API 使用
可恢复的进度

故障处理

如果长任务被中断：

1. 总结已完成阶段
指向已保存的产物
确定确切下一步
从检查点恢复，而非重新开始

参考资料

- 阅读 references/checklist.md 获取可复用的预检清单和模型路由矩阵。

safe-long-run-mode-gpt54-claude安全长程模式