Safe Long-Run Mode (GPT-5.4 + Claude)
Use this skill for tasks that may run long, span multiple systems, or risk losing progress if interrupted.
Core rule
Do not run long tasks as one monolithic attempt. Split into phases, write checkpoints, and keep the work resumable.
Model selection
Use GPT-5.4 for:
- - coding
- docs
- research
- file-heavy transformations
- multi-agent delegated work
- repetitive build tasks
- long internal work where cost and throughput matter
Use Claude for:
- - strategic judgment
- sensitive decisions
- nuanced synthesis
- client-facing polish
- brand voice refinement
- high-trust orchestration
Default to GPT-5.4 first. Escalate to Claude only when the task actually benefits from higher-quality judgment or tone.
Operating procedure
1. Scope before acting
Before starting, decide:
- - what the final deliverable is
- which systems/tools will be touched
- what can fail or throttle
- what must be saved after each phase
2. Break work into phases
Use phases such as:
- 1. gather / inspect
- plan / write brief
- execute / edit / build
- validate
- deploy or report
At the end of each phase, write artifacts to disk.
3. Always checkpoint
For long tasks, save progress in files:
- - draft outputs
- notes
- reports
- partial results
- tracker entries
- checkpoint summaries
Prefer a resumable workspace state over a perfect one-shot run.
4. Isolate long work
Use subagents when:
- - the task will take more than a few tool calls
- multiple files/systems are involved
- external APIs are involved
- failure should not pollute the main session
- specialized work can be delegated cleanly
5. Throttle external systems
When interacting with Azure, Graph, Orgo, messaging providers, registries, websites, or any external API:
- - batch reads when possible
- avoid tight polling loops
- serialize risky writes
- respect retry/backoff
- avoid one-item burst loops when a bulk operation is possible
6. Prefer resumability over perfection
The goal is not "never fail." The goal is: if interrupted, resume with minimal loss.
System-specific guidance
Azure / cloud control planes
- - validate auth first
- create foundational resources first
- verify after each layer
- log resource names/IDs
- do not chain long destructive commands blindly
Browser / Orgo / GUI automation
- - use explicit goals and stop conditions
- capture screenshots at checkpoints
- bound retry counts
- save artifacts locally
- prefer API/CLI over GUI when equivalent exists
Coding / documentation work
- - create a brief/spec first for complex tasks
- write files in chunks
- validate after each major change
- leave notes for resume if work is unfinished
What to tell the user
When relevant, explain that safe long-run mode means:
- - cheapest adequate model
- phased execution
- saved checkpoints
- subagent isolation
- controlled API usage
- resumable progress
Failure handling
If a long task is interrupted:
- 1. summarize completed phases
- point to saved artifacts
- identify exact next step
- resume from checkpoint rather than restarting
References
- - Read
references/checklist.md for a reusable pre-flight checklist and model routing matrix.
安全长运行模式(GPT-5.4 + Claude)
使用此技能处理可能运行时间较长、跨越多个系统,或中断后可能丢失进度的任务。
核心规则
不要将长任务作为单一整体运行。将其拆分为多个阶段,写入检查点,并保持工作可恢复。
模型选择
使用 GPT-5.4 处理:
- - 编码
- 文档
- 研究
- 文件密集型转换
- 多智能体委派工作
- 重复性构建任务
- 成本和吞吐量重要的长内部工作
使用 Claude 处理:
- - 战略判断
- 敏感决策
- 细微综合
- 面向客户的润色
- 品牌语调优化
- 高信任度编排
默认优先使用 GPT-5.4。仅当任务确实需要更高质量的判断或语调时,才升级到 Claude。
操作流程
1. 行动前明确范围
开始前,确定:
- - 最终交付物是什么
- 将涉及哪些系统/工具
- 哪些环节可能失败或受限
- 每个阶段后必须保存什么
2. 将工作分解为阶段
使用如下阶段:
- 1. 收集/检查
- 规划/编写概要
- 执行/编辑/构建
- 验证
- 部署或报告
每个阶段结束时,将产物写入磁盘。
3. 始终设置检查点
对于长任务,将进度保存在文件中:
优先选择可恢复的工作区状态,而非完美的一次性运行。
4. 隔离长任务
在以下情况下使用子智能体:
- - 任务需要多次工具调用
- 涉及多个文件/系统
- 涉及外部 API
- 失败不应污染主会话
- 专业工作可清晰委派
5. 限制外部系统
与 Azure、Graph、Orgo、消息提供商、注册表、网站或任何外部 API 交互时:
- - 尽可能批量读取
- 避免紧密轮询循环
- 序列化风险写入
- 遵守重试/退避策略
- 在可批量操作时避免单项目突发循环
6. 优先可恢复性而非完美性
目标不是“永不失败”。目标是:如果中断,以最小损失恢复。
系统特定指南
Azure / 云控制平面
- - 首先验证身份验证
- 首先创建基础资源
- 每层后验证
- 记录资源名称/ID
- 不要盲目链式执行长破坏性命令
浏览器 / Orgo / GUI 自动化
- - 使用明确目标和停止条件
- 在检查点捕获截图
- 限制重试次数
- 本地保存产物
- 存在等效方案时优先使用 API/CLI 而非 GUI
编码 / 文档工作
- - 复杂任务先创建概要/规范
- 分块写入文件
- 每次重大更改后验证
- 工作未完成时留下恢复笔记
告知用户的内容
在相关时,解释安全长运行模式意味着:
- - 最便宜且足够的模型
- 分阶段执行
- 保存检查点
- 子智能体隔离
- 受控的 API 使用
- 可恢复的进度
故障处理
如果长任务被中断:
- 1. 总结已完成阶段
- 指向已保存的产物
- 确定确切下一步
- 从检查点恢复,而非重新开始
参考资料
- - 阅读 references/checklist.md 获取可复用的预检清单和模型路由矩阵。