SOTA Agent

Goal

Turn a vague "beat the benchmark" request into a disciplined campaign:

- fixed target metric and split
explicit literature and leaderboard snapshot
bounded reproduction plan
explicit browser, notebook, or VM execution lane
GUI evidence when notebook or browser state matters
ablations that answer one question at a time
promotion only when the claim survives review

This skill is the frontier-planning and candidate-selection layer.
For browser evidence, VM execution, and promotion artifacts, pair it with
data-science-cv-repro-lab instead of letting the campaign drift into ad hoc runs.

Use This Skill When

- the user wants a CV or DS system pushed toward state-of-the-art results
the task involves reproducing or surpassing recent papers
the workflow needs paper triage, leaderboard tracking, or claim review
the workflow includes OpenClaw, Colab, Kaggle, browser-only notebook actions, or GUI-heavy pages
the user needs experiment management across browser research, notebooks, local runs, and long GPU jobs
the user wants GPU VM or notebook watchdog logic, artifact pulls, or browser evidence for a SOTA candidate
the question is whether a candidate is a real SOTA step or only noise, leakage, or benchmark overfitting

If the campaign includes serious execution or release review, use this skill to choose and rank candidates,
then use data-science-cv-repro-lab as the execution lane.

Quick Start

1. Freeze the claim target before touching recipes.

- Name the task, dataset, metric, split, and target score. - Name the current trusted baseline. - Name the claim threshold for "match", "beat", or "not enough".

2. Initialize the campaign records immediately.

- Use python3 {baseDir}/scripts/init_sota_campaign.py --root <dir> --campaign-id <id> --title <title>. - Use

python3 {baseDir}/scripts/init_sota_leaderboard_snapshot.py --out <json> --task <task> --dataset <dataset> --metric <metric> --split <split>

. - Use python3 {baseDir}/scripts/init_sota_paper_triage.py --out <json> --campaign-id <id> --task <task>. - Use

python3 {baseDir}/scripts/init_sota_program.py --out <json> --campaign-id <id> --task <task> --dataset <dataset> --metric <metric> --split <split>

when you need one machine-readable benchmark, rerun, delegation, and auth plan. - Use

python3 {baseDir}/scripts/init_sota_candidate_card.py --out <json> --candidate-id <id> --campaign-id <id> --objective <goal>

. - If execution review depends on synced QA runs, runtime sweeps, or benchmark panels, store the paired data-science-cv-repro-lab review dashboard path in the program and candidate records before the claim review starts. - If the execution path depends on a real browser or notebook UI, use python3 {baseDir}/scripts/init_sota_browser_run_card.py --out <json> --target-url <url>. - If the browser or notebook surface needs manual or visual QA, use python3 {baseDir}/scripts/init_sota_validation_scorecard.py --out <json> --scorecard-id <id> --surface <surface>. - If a Colab, Kaggle, or notebook export bundle matters, use python3 {baseDir}/scripts/init_sota_artifact_manifest.py --out <json> --bundle-root <dir>. - If a long GPU VM run is involved, use

python3 {baseDir}/scripts/init_sota_vm_bootstrap_manifest.py --out <json> --output-root <run_root> --model-family <name> --command python train.py --epochs 40

3. Separate the campaign roles even if one agent performs all of them.

- Scout: papers, leaderboards, repos, and benchmark rules. - Reproducer: baseline and top-paper reproduction. - Ablator: controlled change sets and compute allocation. - Reviewer: contamination, metric drift, and claim integrity. - Promoter: final claim or hold decision. - Keep the benchmark definition and final claim wording fixed. - Use bounded scouting and review lanes for literature triage, repo inspection, per-paper extraction, and hard-case review. - For repeated audits, batch over a manifest or CSV instead of free-form context accumulation.

4. Pick the execution lane explicitly.

- Browser or GUI lane: OpenClaw, Colab, Kaggle, or another real browser session when notebook UI state matters. - Colab or notebook GPU lane: runtime selection, smoke validation, artifact export, and browser evidence. - GPU VM lane: long runs with heartbeats, watchdogs, sync, and auto-stop policy. - Local lane: cheap falsification, tiny reruns, and artifact review.

5. Keep file writes inside one campaign workspace.

- Create one dedicated campaign root and keep every --out, --bundle-root, and --output-root path under it. - Do not point the bundled scripts at unrelated home-directory or system paths. - Treat scripts/sota_public_safety.py as the canonical public-redaction layer for URLs, refs, and paths.

6. Work the SOTA ladder in order.

- Freeze the benchmark definition and auth rule before using more compute. - Reproduce the trusted baseline first. - Reproduce one relevant reference result or a close public checkpoint. - Build a hypothesis backlog from literature gaps, not vibes. - Run narrow ablations before broad recipe churn. - Stress the best candidate on the fixed review surfaces.

7. Claim only on full-surface wins.

- Fixed benchmark score - Reproduced baseline delta - Compute or cost context - Browser or GUI evidence if that lane mattered - Failure-case review - Exact evidence bundle - Render the final review with python3 {baseDir}/scripts/render_sota_claim_summary.py --candidate-card <json> --out <md>.

Operating Rules

Campaign rules

- One campaign has one target benchmark contract.
Do not let the target metric or split drift midstream.
Keep a short hypothesis backlog and kill low-information ideas quickly.
Record why each experiment exists before running it.

Codex multi-agent rules

- Main thread owns the benchmark contract, stop conditions, and final claim decision.
Subagents should do bounded work only: scout, reproduce, ablate, or review.
Do not let one exploratory thread silently rewrite the campaign contract.
For repeated claim checks or literature extraction, prefer manifest-driven fanout over conversational drift.

Literature rules

- Read only the papers or repos that change the candidate plan.
Extract the minimum useful fields: task, metric, split, data, compute, architecture, augmentations, training tricks, and caveats.
Prefer a reproduced strong baseline over copying five tricks from five papers without control.
Do not treat leaderboard rows as ground truth without checking task definition and split rules.

Ablation rules

- Change one meaningful variable at a time when the goal is causal understanding.
If several knobs move together, label the run as a package change, not an ablation.
Keep one canonical baseline recipe alive for comparison.
Require the first winning candidate to survive at least one rerun or adjacent-seed check before escalating the claim.

Compute rules

- Spend cheap compute on reproduction and short falsification first.
Do not push a long run unless the hypothesis would matter if it wins.
Record training cost, wall time, and hardware for every serious candidate.
Cut branches that cannot plausibly clear the target with the remaining budget.

OAuth and auth rules

- Use ChatGPT or Codex OAuth-backed sessions as the default and preferred path.
Prefer Codex multi-agent or app-server workflows over orchestrators that require paid API keys.
Do not require or recommend OPENAI_API_KEY, other vendor API keys, or paid inference APIs as the default campaign runtime path.
If a third-party framework only works through paid API keys, treat it as reference material unless it can run fully through local tools and OAuth-backed Codex sessions.

OpenClaw browser rules

- Use OpenClaw for public papers, leaderboards, docs, notebook-only steps, and GUI-heavy flows when the browser lane adds evidence.
Prefer direct public URLs over uploads or private sessions.
Capture leaderboard, notebook, or GUI evidence as notes, screenshots, and exact URLs when they are part of the claim path.
Fail hard on dead browser attach, missing notebook readiness, or unavailable requested model or runtime mode.
Treat screenshots and GUI evidence as supporting artifacts, not the claim itself.
Do not use browser-only summaries as the claim itself; claims still require benchmark artifacts.

Colab and notebook GPU rules

- Select the accelerator explicitly before running expensive cells.
Run a smoke cell that proves imports, runtime, data mounts, and export paths all work.
Keep one stable export root and pull the artifact manifest plus at least one preview back locally.
Add the browser run card and validation scorecard when the notebook GUI is part of the evaluation story.

GPU VM rules

- Create a named run root before launch.
Write a machine-readable VM bootstrap manifest before long runs.
Run long jobs under a heartbeat, session, or supervisor so liveness is explicit.
Sync metrics, summaries, and checkpoints back to a trusted store on a schedule.
Do not promote directly from live VM state; promote from synced artifacts and review evidence.

Claim safety rules

- No SOTA claim without a fixed metric, split, and baseline.
No SOTA claim on a contaminated benchmark or hidden train-on-test path.
If the execution story depends on a dashboard or synced review surface, keep the dashboard path, source audit, and leakage audit in the claim packet.
If a candidate wins only on one slice while regressing important surfaces, hold it.
Report uncertainty honestly: "best internal result so far" is not the same as "new SOTA".
Small deltas need rerun or adjacent-seed support before they become claim language.

References

Read only the reference that matches the task:

- INLINECODE18

- Full campaign structure, role separation, and stop conditions.

- INLINECODE19

- Rules for queues, stage discipline, ablations, and promotion gating.

- INLINECODE20

- What to reuse from Codex subagents, harness engineering, OpenEvolve, Symphony, Paperclip, and OptiLLM under an OAuth-only campaign rule.

- INLINECODE21

- How to avoid contamination, metric drift, and invalid comparisons.

- INLINECODE22

- How to filter papers and extract only decision-relevant details.

- INLINECODE23

- How to use OpenClaw productively for public literature and leaderboard work.

- INLINECODE24

- How to run GUI-heavy notebook, browser, and screenshot-based execution safely.

- INLINECODE25

- How to manage Colab, Kaggle, and GPU VM execution lanes with smoke tests and artifact discipline.

- INLINECODE26

- Review rules for whether a candidate deserves a SOTA claim at all.

- INLINECODE27

- Publication review rules for secrets, private refs, and raw notebook paths.

Bundled Scripts

- INLINECODE28

- Pure local helpers for path, URL, ref, env, and command redaction. No network I/O or subprocess execution.

- INLINECODE29

- Create a reusable campaign folder with benchmark, program, agent, research, leaderboard, plan, ablation, evidence, and claim files.

- INLINECODE30

- Create a machine-readable program record with the fixed benchmark, baselines, rerun policy, bounded subagent roles, and OAuth rules.

- INLINECODE31

- Create a machine-readable snapshot of the target benchmark contract and current reference scores.

- INLINECODE32

- Create a machine-readable literature queue for paper screening and extraction.

- INLINECODE33

- Create a sanitized browser evidence record for OpenClaw, Colab, Kaggle, or other notebook UI runs.

- INLINECODE34

- Create a machine-readable GUI or notebook validation scorecard when visible state matters to the campaign.

- INLINECODE35

- Create a machine-readable export-bundle manifest for notebook or VM artifact pulls with redacted public path metadata.

- INLINECODE36

- Create a machine-readable card for a serious candidate, its execution lane, auth mode, and claim state.

- INLINECODE37

- Create a machine-readable candidate record with change set, risks, and redacted public artifact refs.

- INLINECODE38

- Create a focused ablation queue for one candidate family.

- INLINECODE39

- Create a machine-readable bootstrap manifest for long GPU VM or cluster runs with public-release redaction.

- INLINECODE40

- Refresh a ranked scoreboard for a fixed metric and goal direction.

- INLINECODE41

- Join the core artifacts for a promotion, hold, or cut decision.

- INLINECODE42

- Render a concise markdown review from the machine-readable candidate card.

- INLINECODE43

- Render a concise markdown summary from the program, candidate, scoreboard, and review packet.

SOTA 智能体

目标

将模糊的超越基准需求转化为规范化的行动方案：

- 固定的目标指标与数据划分
明确的文献与排行榜快照
有边界的复现计划
明确的浏览器、笔记本或虚拟机执行通道
当笔记本或浏览器状态重要时提供图形界面证据
每次只回答一个问题的消融实验
仅当结论经得起审查时才进行晋升

该技能是前沿规划与候选方案筛选层。
对于浏览器证据、虚拟机执行和晋升产物，请配合使用
data-science-cv-repro-lab，而非让行动方案陷入临时性运行。

何时使用该技能

- 用户希望将CV或DS系统推向最先进水平
任务涉及复现或超越近期论文
工作流程需要论文筛选、排行榜追踪或结论审查
工作流程包含OpenClaw、Colab、Kaggle、仅浏览器笔记本操作或重度图形界面页面
用户需要跨浏览器研究、笔记本、本地运行和长时间GPU作业的实验管理
用户需要GPU虚拟机或笔记本看门狗逻辑、产物拉取或SOTA候选方案的浏览器证据
需要判断候选方案是真正的SOTA进步，还是仅仅是噪声、数据泄露或基准过拟合

如果行动方案包含严肃的执行或发布审查，请使用该技能选择和排序候选方案，
然后使用data-science-cv-repro-lab作为执行通道。

快速开始

1. 在接触任何方案之前，先冻结结论目标。

- 明确任务、数据集、指标、数据划分和目标分数。 - 明确当前可信的基线。 - 明确匹配、超越或不足的结论阈值。

2. 立即初始化行动记录。

- 使用 python3 {baseDir}/scripts/initsotacampaign.py --root --campaign-id --title 。 - 使用 python3 {baseDir}/scripts/init<em>sota</em>leaderboard_snapshot.py --out <json> --task <task> --dataset <dataset> --metric <metric> --split <split>。 - 使用 python3 {baseDir}/scripts/init<em>sota</em>paper_triage.py --out <json> --campaign-id <id> --task <task>。 - 当需要机器可读的基准、重跑、委派和认证计划时，使用 python3 {baseDir}/scripts/init<em>sota</em>program.py --out <json> --campaign-id <id> --task <task> --dataset <dataset> --metric <metric> --split <split>。 - 使用 python3 {baseDir}/scripts/init<em>sota</em>candidate_card.py --out <json> --candidate-id <id> --campaign-id <id> --objective <goal>。 - 如果执行审查依赖于同步的QA运行、运行时扫描或基准面板，在结论审查开始前，将配对的data-science-cv-repro-lab审查仪表板路径存储在程序和候选记录中。 - 如果执行路径依赖于真实的浏览器或笔记本界面，使用 python3 {baseDir}/scripts/init<em>sota</em>browser<em>run</em>card.py --out <json> --target-url <url>。 - 如果浏览器或笔记本界面需要手动或可视化QA，使用 python3 {baseDir}/scripts/init<em>sota</em>validation_scorecard.py --out <json> --scorecard-id <id> --surface <surface>。 - 如果Colab、Kaggle或笔记本导出包很重要，使用 python3 {baseDir}/scripts/init<em>sota</em>artifact_manifest.py --out <json> --bundle-root <dir>。 - 如果涉及长时间GPU虚拟机运行，使用 python3 {baseDir}/scripts/init<em>sota</em>vm<em>bootstrap</em>manifest.py --out <json> --output-root <run_root> --model-family <name> --command python train.py --epochs 40。 <ol><li>3. 分离行动角色，即使一个智能体执行所有角色。</li></ol> - 侦察者：论文、排行榜、代码仓库和基准规则。 - 复现者：基线和顶级论文复现。 - 消融者：受控变更集和计算资源分配。 - 审查者：污染、指标漂移和结论完整性。 - 晋升者：最终结论或搁置决策。 - 保持基准定义和最终结论措辞固定。 - 使用有边界的侦察和审查通道进行文献筛选、仓库检查、逐篇论文提取和困难案例审查。 - 对于重复审计，基于清单或CSV进行批量处理，而非自由形式的上下文积累。 <ol><li>4. 明确选择执行通道。</li></ol> - 浏览器或图形界面通道：当笔记本界面状态重要时，使用OpenClaw、Colab、Kaggle或其他真实浏览器会话。 - Colab或笔记本GPU通道：运行时选择、冒烟验证、产物导出和浏览器证据。 - GPU虚拟机通道：长时间运行，带心跳、看门狗、同步和自动停止策略。 - 本地通道：低成本证伪、小型重跑和产物审查。 <ol><li>5. 将文件写入保持在单一行动工作空间内。</li></ol> - 创建一个专用的行动根目录，并将所有--out、--bundle-root和--output-root路径保持在其下。 - 不要将捆绑脚本指向无关的家目录或系统路径。 - 将scripts/sota<em>public</em>safety.py视为URL、引用和路径的规范公共编辑层。 <ol><li>6. 按顺序攀登SOTA阶梯。</li></ol> - 在使用更多计算资源前，先冻结基准定义和认证规则。 - 首先复现可信基线。 - 复现一个相关的参考结果或接近的公开检查点。 - 基于文献空白构建假设积压，而非凭感觉。 - 在广泛调整方案之前，先运行窄范围的消融实验。 - 在固定的审查面上对最佳候选方案进行压力测试。 <ol><li>7. 仅在全面获胜时提出结论。</li></ol> - 固定的基准分数 - 复现的基线增量 - 计算或成本上下文 - 浏览器或图形界面证据（如果该通道重要） - 失败案例审查 - 精确的证据包 - 使用 python3 {baseDir}/scripts/render<em>sota</em>claim_summary.py --candidate-card <json> --out <md> 渲染最终审查。 <h2>操作规则</h2> <h3>行动规则</h3> <ul><li>- 一个行动只有一个目标基准合同。</li><li>不允许目标指标或数据划分中途漂移。</li><li>保持简短假设积压，快速淘汰低信息量的想法。</li><li>在运行每个实验前记录其存在的原因。</li></ul> <h3>Codex多智能体规则</h3> <ul><li>- 主线程拥有基准合同、停止条件和最终结论决策权。</li><li>子智能体仅执行有边界的工作：侦察、复现、消融或审查。</li><li>不允许一个探索性线程悄悄重写行动合同。</li><li>对于重复的结论检查或文献提取，优先使用清单驱动的扇出而非对话式漂移。</li></ul> <h3>文献规则</h3> <ul><li>- 只阅读那些会改变候选方案的论文或代码仓库。</li><li>提取最小有用字段：任务、指标、数据划分、数据、计算资源、架构、数据增强、训练技巧和注意事项。</li><li>优先复现一个强大的基线，而非不加控制地从五篇论文中复制五个技巧。</li><li>不检查任务定义和划分规则，不要将排行榜行视为地面真相。</li></ul> <h3>消融规则</h3> <ul><li>- 当目标是因果理解时，每次只改变一个有意义的变量。</li><li>如果多个旋钮同时变动，将该运行标记为包变更，而非消融。</li><li>保持一个规范的基线方案存活以用于比较。</li><li>要求第一个获胜候选方案在升级结论前至少通过一次重跑或相邻种子检查。</li></ul> <h3>计算规则</h3> <ul><li>- 首先将低成本计算用于复现和短期证伪。</li><li>除非假设获胜后会很重要，否则不要启动长时间运行。</li><li>为每个严肃候选方案记录训练成本、挂钟时间和硬件信息。</li><li>切断那些在剩余预算内无法合理达到目标的实验分支。</li></ul> <h3>OAuth与认证规则</h3> <ul><li>- 使用ChatGPT或Codex OAuth支持的会话作为默认和首选路径。</li><li>优先使用Codex多智能体或应用服务器工作流，而非需要付费API密钥的编排器。</li><li>不要求或推荐将OPENAI<em>API</em>KEY、其他供应商API密钥或付费推理API作为默认行动运行时路径。</li><li>如果第三方框架仅通过付费API密钥工作，将其视为参考资料，除非它能完全通过本地工具和OAuth支持的Codex会话运行。</li></ul> <h3>OpenClaw浏览器规则</h3> <ul><li>- 当浏览器通道能增加证据时，使用OpenClaw处理公开论文、排行榜、文档、仅笔记本步骤和重度图形界面流程。</li><li>优先使用直接公开URL而非上传或私有会话。</li><li>当排行榜、笔记本或图形界面证据是结论路径的一部分时，将其捕获为笔记、截图和精确URL。</li><li>在浏览器附加失败、笔记本就绪状态缺失或请求的模型/运行时模式不可用时，果断失败。</li><li>将截图和图形界面证据视为支持性产物，而非结论本身。</li></ul></div> </div> <div class="tags"> <h4>标签</h4> <span class="tag">skill</span> <span class="tag">ai</span> </div> </div> </div> <div class="tab-content" id="tab-install"> <div class="install-panel"> <h3>通过对话安装</h3> <p>该技能支持在以下平台通过对话安装：</p> <div class="platform-list"> <span class="platform-tag">OpenClaw</span> <span class="platform-tag">WorkBuddy</span> <span class="platform-tag">QClaw</span> <span class="platform-tag">Kimi</span> <span class="platform-tag">Claude</span> </div> <div class="install-method"> <h4>方式一：安装 SkillHub 和技能</h4> <div class="code-block"> <button class="copy-btn" onclick="copyToClipboard('帮我安装 SkillHub 和 sota-agent-1776112883 技能', this)">复制</button> <code>帮我安装 SkillHub 和 sota-agent-1776112883 技能</code> </div> </div> <div class="install-method"> <h4>方式二：设置 SkillHub 为优先技能安装源</h4> <div class="code-block"> <button class="copy-btn" onclick="copyToClipboard('设置 SkillHub 为我的优先技能安装源，然后帮我安装 sota-agent-1776112883 技能', this)">复制</button> <code>设置 SkillHub 为我的优先技能安装源，然后帮我安装 sota-agent-1776112883 技能</code> </div> </div> <h3 style="margin-top: 30px;">通过命令行安装</h3> <div class="install-method"> <div class="code-block"> <button class="copy-btn" onclick="copyToClipboard('skillhub install sota-agent-1776112883', this)">复制</button> <code>skillhub install sota-agent-1776112883</code> </div> </div> <h3 style="margin-top: 30px;">下载</h3> <a href="plugin.php?id=hl_skillhub&mod=download&version_id=22514&token=5f7a81291c7e382891956c89f900f7ac" class="download-btn"> ⬇ 下载 sota-agent v1.4.1（免费） </a> <p class="download-info"> 文件大小: 37.66 KB | 发布时间: 2026-4-17 16:12 </p> </div> </div> <div class="tab-content" id="tab-versions"> <div class="version-list"> <div class="version-item"> <div class="version-header"> <span class="version-number">v1.4.1</span> <span class="latest-tag">最新</span> <span class="version-date">2026-4-17 16:12</span> </div> <div class="changelog"> Polish public wording in published skill docs. </div> </div> </div> </div>  <div class="related-skills"> <h3>相关推荐</h3> <div class="skill-list-related"> <div class="skillhub-card-related"> <a href="plugin.php?id=hl_skillhub&mod=detail&slug=self-improving-agent-1776396682"> <div class="skill-icon"> <div class="default-icon">s</div> </div> <h3 class="skill-name">self-improvement</h3> <p class="skill-desc">Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.</p> <div class="skill-meta"> <div class="stats"> <span>⭐ 3209</span> <span>⬇ 392962</span> </div> <span class="skill-source">AI智能</span> </div> </a> </div> <div class="skillhub-card-related"> <a href="plugin.php?id=hl_skillhub&mod=detail&slug=self-improving-agent-1776382578"> <div class="skill-icon"> <div class="default-icon">s</div> </div> <h3 class="skill-name">self-improvement</h3> <p class="skill-desc">Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.</p> <div class="skill-meta"> <div class="stats"> <span>⭐ 3209</span> <span>⬇ 392961</span> </div> <span class="skill-source">AI智能</span> </div> </a> </div> <div class="skillhub-card-related"> <a href="plugin.php?id=hl_skillhub&mod=detail&slug=self-improving-agent-1776282374"> <div class="skill-icon"> <div class="default-icon">s</div> </div> <h3 class="skill-name">self-improvement</h3> <p class="skill-desc">Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.</p> <div class="skill-meta"> <div class="stats"> <span>⭐ 3195</span> <span>⬇ 389594</span> </div> <span class="skill-source">AI智能</span> </div> </a> </div> <div class="skillhub-card-related"> <a href="plugin.php?id=hl_skillhub&mod=detail&slug=self-improving-agent-1776190579"> <div class="skill-icon"> <div class="default-icon">s</div> </div> <h3 class="skill-name">self-improvement</h3> <p class="skill-desc">Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.</p> <div class="skill-meta"> <div class="stats"> <span>⭐ 3177</span> <span>⬇ 386649</span> </div> <span class="skill-source">AI智能</span> </div> </a> </div> </div> </div> </div> </div> <style> .skill-detail { padding: 20px 0; min-height: 100vh; } .breadcrumb { margin-bottom: 20px; color: #666; } .breadcrumb a { color: #155BD5; } .breadcrumb span { margin: 0 5px; } .skill-detail-header { display: flex; gap: 24px; padding: 30px; background: #fff; border-radius: 12px; margin-bottom: 20px; border: 1px solid #e8e8e8; } .skill-icon-large { width: 120px; height: 120px; flex-shrink: 0; } .skill-icon-large img { width: 100%; height: 100%; border-radius: 20px; object-fit: cover; } .skill-icon-large .default-icon-large { width: 100%; height: 100%; border-radius: 20px; background: #155BD5; color: #fff; display: flex; align-items: center; justify-content: center; font-size: 48px; font-weight: bold; } .skill-info { flex: 1; } .skill-header-action { flex-shrink: 0; display: flex; align-items: center; padding-left: 20px; } .header-download-btn { display: inline-flex; align-items: center; gap: 8px; padding: 12px 24px; background: #155BD5; color: #fff; border-radius: 8px; text-decoration: none; font-size: 14px; font-weight: 500; transition: all 0.2s ease; white-space: nowrap; } .header-download-btn:hover { background: #0d4bb5; transform: translateY(-1px); box-shadow: 0 4px 12px rgba(21, 91, 213, 0.3); } .header-download-btn.buy { background: #f97316; } .header-download-btn.buy:hover { background: #ea580c; box-shadow: 0 4px 12px rgba(249, 115, 22, 0.3); } .skill-info h1 { font-size: 36px; font-weight: 700; color: #1e293b; margin-bottom: 10px; display: flex; align-items: center; flex-wrap: wrap; gap: 12px; } .skill-info h1 .name-cn { font-size: 20px; color: #666; font-weight: 500; background: #f1f5f9; padding: 4px 14px; border-radius: 20px; white-space: nowrap; } .skill-info .skill-desc { font-size: 16px; color: #666; margin-bottom: 8px; } .skill-info .skill-scenarios { font-size: 14px; color: #999; margin-bottom: 10px; } .skill-info .skill-author { font-size: 13px; color: #999; } /* 统计面板 */ .stats-panel { display: flex; align-items: center; background: #fff; border-radius: 12px; padding: 24px 0; margin-bottom: 24px; border: 1px solid #e8e8e8; box-shadow: 0 1px 3px rgba(0,0,0,0.05); } .stats-panel .stats-item { flex: 1; text-align: center; padding: 0 16px; } .stats-panel .stats-divider { width: 1px; height: 48px; background: linear-gradient(180deg, transparent, #e8e8e8, transparent); } .stats-panel .stats-icon { width: 40px; height: 40px; margin: 0 auto 8px; border-radius: 50%; display: flex; align-items: center; justify-content: center; font-size: 18px; background: #f1f5f9; color: #64748b; } .stats-panel .stats-icon.source-icon { background: #e0e7ff; color: #6366f1; } .stats-panel .stats-icon.version-icon { background: #fef3c7; color: #d97706; } .stats-panel .stats-icon.security-icon { background: #fee2e2; color: #9ca3af; } .stats-panel .stats-icon.security-icon.verified { background: #d1fae5; color: #10b981; } .stats-panel .stats-value { font-size: 15px; font-weight: 600; color: #374151; margin-bottom: 4px; } .stats-panel .stats-number { font-size: 24px; font-weight: 700; color: #111827; margin-bottom: 4px; line-height: 1.2; } .stats-panel .stats-label { font-size: 13px; color: #9ca3af; } .stats-panel .stats-item.highlight .stats-label { color: #6b7280; } .skill-tabs { display: flex; gap: 40px; border-bottom: 1px solid #e8e8e8; margin-bottom: 20px; } .skill-tabs .tab-item { padding: 15px 0; color: #666; cursor: pointer; border-bottom: 2px solid transparent; font-size: 16px; } .skill-tabs .tab-item.active { color: #155BD5; border-bottom-color: #155BD5; } .tab-content { background: #fff; border-radius: 12px; padding: 30px; margin-bottom: 20px; border: 1px solid #e8e8e8; opacity: 0; transform: translateY(10px); transition: all 0.3s ease; display: none; } .tab-content.active { opacity: 1; transform: translateY(0); display: block; } .lang-switcher { display: flex; gap: 8px; margin-bottom: 16px; justify-content: flex-end; } .lang-switcher .lang-btn { padding: 4px 16px; border: 1px solid #d1d5db; background: #fff; border-radius: 20px; font-size: 13px; color: #6b7280; cursor: pointer; transition: all 0.2s; } .lang-switcher .lang-btn:hover { border-color: #155BD5; color: #155BD5; } .lang-switcher .lang-btn.active { background: #155BD5; color: #fff; border-color: #155BD5; } .overview-content h3 { margin-bottom: 15px; } /* Markdown 预览样式 */ .markdown-body { line-height: 1.8; color: #333; margin-bottom: 20px; } .markdown-body h1, .markdown-body h2, .markdown-body h3, .markdown-body h4, .markdown-body h5, .markdown-body h6 { margin-top: 24px; margin-bottom: 16px; font-weight: 600; line-height: 1.25; color: #1e293b; } .markdown-body h1 { font-size: 2em; border-bottom: 1px solid #e8e8e8; padding-bottom: 8px; } .markdown-body h2 { font-size: 1.5em; border-bottom: 1px solid #e8e8e8; padding-bottom: 8px; } .markdown-body h3 { font-size: 1.25em; } .markdown-body p { margin-bottom: 16px; } .markdown-body ul, .markdown-body ol { padding-left: 2em; margin-bottom: 16px; } .markdown-body li { margin-bottom: 4px; } .markdown-body code { padding: 2px 6px; background: #f1f5f9; border-radius: 4px; font-family: 'Consolas', monospace; font-size: 0.9em; color: #155BD5; } .markdown-body pre { padding: 16px; background: #1e1e1e; border-radius: 8px; overflow-x: auto; margin-bottom: 16px; } .markdown-body pre code { background: transparent; color: #d4d4d4; padding: 0; } .markdown-body blockquote { padding: 0 1em; border-left: 4px solid #155BD5; color: #666; margin-bottom: 16px; } .markdown-body table { width: 100%; border-collapse: collapse; margin-bottom: 16px; } .markdown-body th, .markdown-body td { padding: 8px 12px; border: 1px solid #e8e8e8; } .markdown-body th { background: #f8f9fa; font-weight: 600; } .markdown-body a { color: #155BD5; text-decoration: none; } .markdown-body a:hover { text-decoration: underline; } .markdown-body img { max-width: 100%; border-radius: 8px; } .overview-content .tags { margin-top: 20px; } .overview-content .tags h4 { margin-bottom: 10px; } .overview-content .tag { display: inline-block; padding: 4px 12px; background: #f0f0f0; border-radius: 4px; margin-right: 8px; margin-bottom: 8px; font-size: 13px; } .install-panel h3 { margin-bottom: 15px; } .platform-list { margin-bottom: 20px; } .platform-tag { display: inline-block; padding: 4px 12px; background: #e3f2fd; color: #1976d2; border-radius: 4px; margin-right: 8px; font-size: 13px; } .install-method { margin-bottom: 25px; } .install-method h4 { margin-bottom: 10px; font-size: 14px; color: #666; } .code-block { position: relative; background: #1e1e1e; border-radius: 8px; padding: 16px 60px 16px 16px; } .code-block code { color: #d4d4d4; font-family: 'Consolas', monospace; font-size: 14px; } .copy-btn { position: absolute; right: 12px; top: 50%; transform: translateY(-50%); padding: 4px 12px; background: #333; border: none; border-radius: 4px; color: #fff; cursor: pointer; font-size: 12px; } .copy-btn:hover { background: #444; } .copy-btn.copied { background: #10b981 !important; animation: copyPulse 0.3s ease; } @keyframes copyPulse { 0%, 100% { transform: translateY(-50%) scale(1); } 50% { transform: translateY(-50%) scale(1.1); } } .download-btn { display: inline-block; padding: 12px 24px; background: #155BD5; color: #fff; border-radius: 8px; text-decoration: none; font-size: 14px; } .download-btn:hover { background: #0d4bb5; } .download-info { margin-top: 10px; font-size: 13px; color: #999; } .version-list .version-item { padding: 20px 0; border-bottom: 1px solid #e8e8e8; } .version-list .version-item:last-child { border-bottom: none; } .version-header { display: flex; align-items: center; gap: 10px; margin-bottom: 10px; } .version-header .version-number { font-size: 18px; font-weight: 600; } .version-header .latest-tag { padding: 2px 8px; background: #e8f5e9; color: #4caf50; border-radius: 4px; font-size: 12px; } .version-header .version-date { margin-left: auto; color: #999; font-size: 13px; } .changelog { color: #666; line-height: 1.6; } /* 相关推荐 */ .related-skills { margin-top: 40px; padding: 24px; background: #fff; border-radius: 12px; border: 1px solid #e8e8e8; } .related-skills h3 { margin-bottom: 20px; font-size: 18px; font-weight: 600; color: #1e293b; display: flex; align-items: center; gap: 8px; } .related-skills h3:before { content: ''; width: 4px; height: 20px; background: #155BD5; border-radius: 2px; } .skill-list-related { display: grid; grid-template-columns: repeat(auto-fill, minmax(240px, 1fr)); gap: 20px; } .skillhub-card-related { border-radius: 16px; padding: 20px; transition: all 0.3s ease; background: rgba(255, 255, 255, 0.95); backdrop-filter: blur(10px); border: 1px solid rgba(0, 0, 0, 0.08); box-shadow: 0 1px 3px rgba(0,0,0,0.05), 0 4px 12px rgba(0,0,0,0.03); } .skillhub-card-related:hover { box-shadow: 0 8px 24px rgba(21, 91, 213, 0.15); transform: translateY(-4px); } .skillhub-card-related:active { transform: scale(0.98); } .skillhub-card-related a { color: inherit; text-decoration: none; } .skillhub-card-related .skill-icon { width: 56px; height: 56px; margin-bottom: 12px; } .skillhub-card-related .skill-icon .default-icon { width: 100%; height: 100%; border-radius: 12px; background: #155BD5; color: #fff; display: flex; align-items: center; justify-content: center; font-size: 24px; font-weight: bold; } .skillhub-card-related .skill-name { font-size: 16px; font-weight: 600; margin-bottom: 8px; color: #333; } .skillhub-card-related .skill-desc { font-size: 13px; color: #666; line-height: 1.5; overflow: hidden; text-overflow: ellipsis; display: -webkit-box; -webkit-line-clamp: 2; -webkit-box-orient: vertical; margin-bottom: 12px; min-height: 39px; } .skillhub-card-related .skill-meta { display: flex; justify-content: space-between; align-items: center; font-size: 12px; color: #999; } .skillhub-card-related .skill-meta .stats { display: flex; gap: 12px; } .skillhub-card-related .skill-source { background: #f0f0f0; padding: 2px 10px; border-radius: 12px; font-size: 12px; } @media screen and (max-width: 768px) { .skill-detail-header { flex-direction: column; text-align: center; padding: 24px 20px; } .skill-icon-large { width: 80px; height: 80px; } .skill-icon-large .default-icon-large { font-size: 32px; } .skill-info h1 { font-size: 24px; } .skill-info h1 .name-cn { font-size: 14px; padding: 2px 10px; margin-left: 0; margin-top: 6px; } .skill-header-action { padding-left: 0; padding-top: 16px; width: 100%; justify-content: center; } .header-download-btn { width: 100%; justify-content: center; max-width: 280px; } .stats-panel { flex-wrap: wrap; padding: 16px 0; } .stats-panel .stats-item { flex: 1 1 30%; min-width: 80px; padding: 12px 8px; } .stats-panel .stats-divider { width: 1px; height: 36px; } .stats-panel .stats-icon { width: 32px; height: 32px; font-size: 14px; } .stats-panel .stats-number { font-size: 18px; } .stats-panel .stats-value { font-size: 13px; } .stats-panel .stats-label { font-size: 11px; } .skill-tabs { overflow-x: auto; gap: 24px; -webkit-overflow-scrolling: touch; } .skill-tabs .tab-item { white-space: nowrap; } .code-block { padding: 16px 50px 16px 16px; } .code-block code { font-size: 12px; word-break: break-all; } } /* Paid skill notice */ .paid-notice { text-align: center; padding: 40px 24px; background: linear-gradient(135deg, #fff7ed, #ffedd5); border-radius: 12px; border: 1px solid #fed7aa; } .paid-notice-icon { width: 56px; height: 56px; margin: 0 auto 16px; background: #fff; border-radius: 50%; display: flex; align-items: center; justify-content: center; font-size: 28px; color: #f97316; box-shadow: 0 2px 8px rgba(249,115,22,0.15); } .paid-notice h3 { font-size: 20px; color: #9a3412; margin-bottom: 8px; } .paid-notice p { font-size: 14px; color: #c2410c; margin: 4px 0; } .paid-buy-box { padding: 20px; background: #fff7ed; border-radius: 8px; border: 1px solid #fed7aa; margin-bottom: 12px; } .paid-buy-box p { color: #7c2d12; font-size: 15px; margin: 0; }</style> <script> // 复制功能 function copyToClipboard(text, btn) { if (navigator.clipboard && navigator.clipboard.writeText) { navigator.clipboard.writeText(text).then(function() { showCopied(btn); }).catch(function() { fallbackCopy(text, btn); }); } else { fallbackCopy(text, btn); } } function fallbackCopy(text, btn) { var textarea = document.createElement('textarea'); textarea.value = text; textarea.style.position = 'fixed'; textarea.style.opacity = '0'; document.body.appendChild(textarea); textarea.select(); try { document.execCommand('copy'); showCopied(btn); } catch (err) { btn.textContent = '失败'; } document.body.removeChild(textarea); } function showCopied(btn) { var originalText = btn.textContent; btn.textContent = '已复制!'; btn.classList.add('copied'); setTimeout(function() { btn.textContent = originalText; btn.classList.remove('copied'); }, 2000); } // Tab 切换 document.addEventListener('DOMContentLoaded', function() { var tabItems = document.querySelectorAll('.skill-tabs .tab-item'); var tabContents = document.querySelectorAll('.tab-content'); tabItems.forEach(function(item) { item.addEventListener('click', function() { var target = this.getAttribute('data-tab'); var targetContent = document.getElementById('tab-' + target); // 移除所有 active tabItems.forEach(function(t) { t.classList.remove('active'); }); tabContents.forEach(function(c) { c.classList.remove('active'); }); // 添加 active this.classList.add('active'); // 延迟添加动画类，确保过渡效果 setTimeout(function() { targetContent.classList.add('active'); }, 50); window.location.hash = target; }); }); var hash = window.location.hash.replace('#', ''); if (hash) { var targetTab = document.querySelector('.skill-tabs .tab-item[data-tab="' + hash + '"]'); if (targetTab) { targetTab.click(); } } // 中英文描述切换 var langSwitcher = document.getElementById('langSwitcher'); if (langSwitcher) { var descEn = document.querySelector('#markdownContent .desc-en'); var descCn = document.querySelector('#markdownContent .desc-cn'); var toggleBtn = document.getElementById('langToggleBtn'); var currentLang = 'cn'; toggleBtn.addEventListener('click', function() { if (currentLang === 'en') { currentLang = 'cn'; toggleBtn.textContent = 'English'; if (descEn) descEn.style.display = 'none'; if (descCn) descCn.style.display = 'block'; } else { currentLang = 'en'; toggleBtn.textContent = '中文'; if (descCn) descCn.style.display = 'none'; if (descEn) descEn.style.display = 'block'; } }); } }); </script></div> <script src="source/plugin/hl_skillhub/static/js/main.js"></script><div style="height:299px; overflow:hidden" class="cl"> <div class="hl_footer cl"> <div class="hl_fttop cl"> <div class="w1180 cl"> <div class="hl_ftl" style=" position: relative;"> <ul> <li> <div class="hl_h5">闲社论坛</div> <a href="plugin.php?id=hl_techmarket&mod=list" target="_blank">定制服务</a> <a href="plugin.php?id=keke_video_base&ac=list" target="_blank">闲社视频</a> <a href="portal.php?mod=topic&topicid=1">会员介绍</a> <a href="plugin.php?id=keke_group" target="_blank">开通会员</a> </li> <li> <div class="hl_h5">闲社论坛</div> <a href="forum.php" target="_blank">智能体论坛</a> <a href="plugin.php?id=hl_skillhub&mod=bundle&ac=list" target="_blank">技能自动化</a> <a href="plugin.php?id=hl_techmarket">AI服务市场</a> <a href="forum-9-1.html" target="_blank">大模型社区</a> </li> <li> <div class="hl_h5">网站服务</div> <a href="https://wpa.qq.com/msgrd?v=3&uin=515151560&site=qq&menu=yes" target="_blank" rel="nofollow">会员咨询：515151560</a> <a href="https://wpa.qq.com/msgrd?v=3&uin=515151570&site=qq&menu=yes" target="_blank" rel="nofollow">广告合作：515151570</a> <a href="https://wpa.qq.com/msgrd?v=3&uin=515151580&site=qq&menu=yes" target="_blank" rel="nofollow">投诉建议：515151580</a> <a href="https://wpa.qq.com/msgrd?v=3&uin=515151590&site=qq&menu=yes" target="_blank" rel="nofollow">售后指导：515151590</a> </li> <div class="clear"></div> </ul> <p style="font-size: 16px;color: #fff;width: 370px;text-align: center;margin-top: 5px;position: absolute; left:0px; bottom:0px"> 多链集团旗下-闲社网</p> </div> <div class="hl_ftm"> <div class="hl_h5">闲社网热线 </div> <div class="hl_h2">免费联系电话 </div> <div class="hl_fttel"> 0527-80111111 </div> <h6 class="cl mtm hl_time"><a href="http://wpa.b.qq.com/cgi/wpa.php?ln=1&key=XzkzODA1MTM1NF80NjQ5NDZfNDAwMDgwOTkxMV8yXw" target="_blank" rel="nofollow"><img src="template/xianshe/neoconex/qqline.png"></a> <a href="http://wpa.b.qq.com/cgi/wpa.php?ln=1&key=XzkzODA1MTM1NF80NjQ5NDZfNDAwMDgwOTkxMV8yXw" target="_blank" rel="nofollow"><img alt="qqline" title="qqline" src="template/xianshe/neoconex/qqline.png"></a></h6> <p class="hl_time">服务时间：周一到周日 8:00-24:00</p> </div> <div class="hl_ftl"> <ul> <li> <div class="hl_h5">公众号</div> <span style="font-size:18px;color: #fff;display: block;">闲社</span> <span style="font-size:18px;color: #fff;">APP下载闲社</span> </li> <div class="clear"></div> </ul> </div> <div class="hl_ftr"> <div class="hl_h5">关注闲社网</div> <div class="hl_guznzhuxx"> <ul> <li> <img alt="wxkf" title="wxkf" src="template/xianshe/neoconex/wxkf.jpg"> <p>闲社在线客服</p> </li> <li> <img alt="wx" title="wx" src="template/xianshe/neoconex/wx.jpg"> <p>关注闲社网微信</p> </li> <li style=" margin-right: 0;"> <img alt="app" title="app" src="template/xianshe/neoconex/app.png"> <p>闲社网APP</p> </li> <div class="clear"></div> </ul> </div> </div> <div class="clear"></div> </div> </div> </div> <div class="hl_ftbottom"> <div class="w1180 cl"> <div class="hl_ftblt"> <p> <a href="archiver/" rel="nofollow">Archiver</a><span>·</span><a href="forum.php?mobile=yes" rel="nofollow">手机版</a><span>·</span><span>闲社网·闲社论坛·智能体自动化市场· 多链控股集团有限公司</span> <span>·</span> <a href="http://beian.miit.gov.cn/" target="_blank" rel="nofollow">苏ICP备2025199260号-1</a></p> <p> Powered by <a href="http://www.discuz.net" target="_blank" rel="nofollow">Discuz!</a> <em>X5.0</em> © 2024-2026 <a href="https://www.xianshe.com/" target="_blank">闲社网·AI智能体论坛·AI自动化解决方案·http://xianshe.com</a> </p> </div> <div class="hl_ftrgh"> <a id="_pingansec_bottomimagelarge_p2p" rel="nofollow" href="https://kashen.com/d/?z6"><img alt="p2p_official_large" title="p2p_official_large" src="template/xianshe/neoconex/p2p_official_large.jpg" /></a> </div> <div class="clear"></div> </div> </div> </div> <div id="scrolltop"> <span hidefocus="true"><a title="返回顶部" onclick="window.scrollTo('0','0')" class="scrolltopa"><b>返回顶部</b></a></span> </div> <script type="text/javascript">_attachEvent(window, 'scroll', function () { showTopLink(); }); checkBlind();</script> </body> </html>

sota-agentSOTA代理技能

sota-agent