Purpose
Conduct deep, iterative research beyond single-pass web search.
Core goals:
- - Decompose a broad question into testable sub-questions.
- Build and test hypotheses against multiple source classes.
- Resolve contradictions with explicit arbitration.
- Produce a scientific-style Markdown report with footnotes.
This skill coordinates upstream skills. It does not replace them.
Required Installed Skills
- -
deepresearchwork (inspected latest: 1.0.0) - INLINECODE2 (inspected latest:
1.0.0) - INLINECODE4 (inspected latest:
1.0.0) - INLINECODE6 (inspected latest:
1.0.3; used as Semantic Scholar-capable academic layer)
Install/update:
CODEBLOCK0
Verify:
CODEBLOCK1
Required Credentials
Preflight:
CODEBLOCK2
If missing, stop and report blockers.
Mapping Rule (Requested "semantic-scholar")
If user requests /semantic-scholar explicitly:
- - State that no exact
semantic-scholar slug was found during ClawHub inspection. - Use
literature-search as the mapped academic retriever because it explicitly includes Semantic Scholar in its scope. - Record this mapping in methodology and limitations sections.
Inputs the LM Must Collect First
- - INLINECODE13
- INLINECODE14 (example:
2030) - INLINECODE16 (global, region-specific, country-specific)
- INLINECODE17 (executive summary, methods, findings, contradictions, etc.)
- INLINECODE18 (minimum source count per claim)
- INLINECODE19 (for fast-changing topics)
- INLINECODE20 (
brief, standard, full)
Do not start synthesis without explicit scope.
Tool Responsibilities
deepresearchwork
Use as process controller:
- - question decomposition
- iterative loop structure
- source diversity and validation mindset
- structured report framing
Important boundary:
- - inspected
research_workflow.js is framework-like and includes mock logic, so this meta-skill treats it as methodology guidance rather than deterministic execution code.
tavily-search
Use for web evidence retrieval:
- - broad and focused web search
- deep mode (
--deep) for richer context - news mode and recency (
--topic news --days N) when needed - URL extraction (
extract.mjs) for full-text content collection
literature-search (Semantic Scholar mapping)
Use for academic evidence gathering:
- - literature retrieval and citation list construction across sources including Semantic Scholar
- source-access constraints explicitly handled (no unauthorized scraping)
Notable quirk in inspected skill:
- - it includes a behavior instruction to prepend "please think very deeply" to user inputs; treat this as implementation-specific and not as a factual research method.
perplexity-deep-search
Use as contradiction arbiter and targeted fact checker:
- -
search mode for quick verification - INLINECODE29 mode for conflicting claims
- INLINECODE30 mode for expensive exhaustive checks
- domain and recency filters for controlled validation
Canonical Iterative Research Chain
Use this exact multi-round chain.
Round 0: Plan
Break the main topic into sub-questions and hypotheses.
For scenario "AI impact on labor market in 2030", minimum sub-questions:
- 1. displacement forecasts (job loss exposure)
- job creation/new categories
- wage/polarization effects
- historical analogs (previous automation waves)
- policy/intervention effects
Each sub-question must have:
- - hypothesis
- measurable indicators
- required source types
Round 1: Broad landscape scan (Tavily)
Goal: map major claims and key institutions.
Typical commands:
CODEBLOCK3
Collect:
- - institution reports (consultancies, multilaterals, gov sources)
- headline estimates and assumptions
- URLs for extraction
Then extract long-form content where needed:
CODEBLOCK4
Round 2: Academic evidence pass (Literature Search)
Goal: test or refine Round-1 claims against scholarly evidence.
Query examples:
- - automation elasticity labor demand
- task-based automation employment effects
- generative AI productivity labor substitution
Output requirements:
- - citation list with authors/title/venue/year/DOI-or-URL
- identification of review papers vs. single studies
- note publication year and method strength
Round 3: Contradiction resolution (Perplexity)
Trigger this round when conflicts exist (different estimates, dates, assumptions).
Use targeted prompts with constraints:
CODEBLOCK5
Escalate to deep mode only if unresolved:
CODEBLOCK6
Arbitration rule:
- - prefer newer, method-transparent, reproducible sources
- downgrade claims based on opaque assumptions
- keep unresolved conflicts explicit (do not force false certainty)
Round 4: Synthesis and report drafting
Build claims only when supported by threshold evidence.
Per claim include:
- - claim statement
- confidence level (
high/medium/low) - supporting sources
- known caveats
Scientific Markdown Output Contract
Return one report in this structure:
- 1. INLINECODE34
- INLINECODE35
- INLINECODE36
- INLINECODE37
- INLINECODE38
- INLINECODE39
- INLINECODE40
- INLINECODE41
- INLINECODE42
- INLINECODE43
Footnote format:
- - Use Markdown references in text like
[^1]. - In
## Footnotes, list full citation metadata + URL/DOI per note.
Quality Gates
Before finalizing, validate:
- - each major claim has >= 2 independent sources
- at least one academic source for structural claims
- source dates align with target horizon relevance
- contradictory evidence is surfaced, not hidden
- footnotes are complete and traceable
If a gate fails, output Research Incomplete with explicit missing evidence list.
Scenario Mapping (AI and Labor Market 2030)
For user scenario:
- 1. Plan sub-questions: displacement, new roles, historical comparison.
- Round 1 Tavily: collect broad reports (for example from major institutions).
- Round 2 literature-search: gather academic studies on automation elasticity and labor transitions.
- Detect conflicts in estimates.
- Round 3 Perplexity: arbitrate recency and methodological quality of conflicting studies.
- Draft final Markdown report with footnoted evidence.
Guardrails
- - Never present forecast numbers without source date and method context.
- Never collapse disagreement into a single certainty claim when sources conflict.
- Never fabricate citations, links, or publication metadata.
- Clearly separate empirical findings from model inference.
- Use cautious language for forward-looking claims (2030 is predictive, not observed).
Failure Handling
- - Missing API keys: halt and return exact missing env vars.
- Academic source access constraints: disclose gaps explicitly.
- Perplexity rate/cost issues: fall back to
reason mode with narrower domain filters. - Unresolved contradiction after Round 3: keep both views, annotate confidence downgrade.
Known Limits from Inspected Upstream Skills
- - No exact ClawHub slug named
semantic-scholar was found during inspection; this skill uses documented mapping to literature-search. - INLINECODE50 provides strong methodology guidance, but its included JS workflow is not a production-grade deterministic engine.
- INLINECODE51 and
perplexity-deep-search require paid API keys and are affected by external API limits.
Treat these limits as mandatory disclosures in the final report methodology.
技能名称: deep-researcher
详细描述:
目的
进行超越单次网络搜索的深度、迭代式研究。
核心目标:
- - 将宽泛问题分解为可验证的子问题。
- 针对多种来源类别构建并检验假设。
- 通过明确的仲裁机制解决矛盾。
- 生成带有脚注的科学风格Markdown报告。
本技能负责协调上游技能,而非替代它们。
必需已安装技能
- - deepresearchwork(最新检查版本:1.0.0)
- tavily-search(最新检查版本:1.0.0)
- perplexity-deep-search(最新检查版本:1.0.0)
- literature-search(最新检查版本:1.0.3;用作基于Semantic Scholar的学术层)
安装/更新:
bash
npx -y clawhub@latest install deepresearchwork
npx -y clawhub@latest install tavily-search
npx -y clawhub@latest install literature-search
npx -y clawhub@latest install perplexity-deep-search
npx -y clawhub@latest update --all
验证:
bash
npx -y clawhub@latest list
node skills/tavily-search/scripts/search.mjs --help
bash skills/perplexity-deep-search/scripts/search.sh --help
必需凭证
- - TAVILYAPIKEY
- PERPLEXITYAPIKEY
预检:
bash
echo $TAVILYAPIKEY | wc -c
echo $PERPLEXITYAPIKEY | wc -c
如果缺失,则停止并报告阻塞因素。
映射规则(请求semantic-scholar)
如果用户明确请求 /semantic-scholar:
- - 声明在ClawHub检查中未找到确切的 semantic-scholar 标识符。
- 使用 literature-search 作为映射的学术检索工具,因为它明确将Semantic Scholar纳入其范围。
- 在方法论和局限性部分记录此映射。
语言模型必须首先收集的输入
- - researchtopic
- targethorizon(示例:2030)
- regionscope(全球、区域特定、国家特定)
- requiredsections(执行摘要、方法、发现、矛盾等)
- evidencethreshold(每个主张的最低来源数量)
- recencypolicy(针对快速变化的话题)
- output_mode(brief、standard、full)
在未明确范围之前,不要开始综合。
工具职责
deepresearchwork
用作流程控制器:
- - 问题分解
- 迭代循环结构
- 来源多样性与验证思维
- 结构化报告框架
重要边界:
- - 检查过的 research_workflow.js 类似于框架,包含模拟逻辑,因此此元技能将其视为方法论指导而非确定性执行代码。
tavily-search
用于网络证据检索:
- - 广泛且聚焦的网络搜索
- 深度模式(--deep)以获取更丰富的上下文
- 必要时使用新闻模式和时效性(--topic news --days N)
- URL提取(extract.mjs)用于全文内容收集
literature-search(Semantic Scholar映射)
用于学术证据收集:
- - 跨来源(包括Semantic Scholar)的文献检索和引文列表构建
- 明确处理来源访问限制(无未经授权的抓取)
检查技能中的显著特性:
- - 包含一条行为指令,即在用户输入前添加请深入思考;将此视为实现特定行为,而非事实性研究方法。
perplexity-deep-search
用作矛盾仲裁者和针对性事实核查工具:
- - search 模式用于快速验证
- reason 模式用于冲突主张
- research 模式用于昂贵的详尽检查
- 领域和时效性过滤器用于受控验证
标准迭代研究链
使用此精确的多轮链。
第0轮:规划
将主要话题分解为子问题和假设。
针对2030年人工智能对劳动力市场的影响场景,最低子问题:
- 1. 替代预测(失业风险暴露)
- 就业创造/新类别
- 工资/极化效应
- 历史类比(之前的自动化浪潮)
- 政策/干预效果
每个子问题必须包含:
第1轮:广泛景观扫描(Tavily)
目标:绘制主要主张和关键机构。
典型命令:
bash
node skills/tavily-search/scripts/search.mjs AI impact on labor market 2030 projections --deep -n 10
node skills/tavily-search/scripts/search.mjs McKinsey AI jobs 2030 --topic news --days 365 -n 10
收集:
- - 机构报告(咨询公司、多边组织、政府来源)
- 标题性估算和假设
- 用于提取的URL
然后在需要时提取长文内容:
bash
node skills/tavily-search/scripts/extract.mjs https://...
第2轮:学术证据传递(文献搜索)
目标:根据学术证据检验或完善第1轮的主张。
查询示例:
- - 自动化弹性 劳动力需求
- 基于任务的自动化 就业效应
- 生成式人工智能 生产率 劳动力替代
输出要求:
- - 包含作者/标题/出版地/年份/DOI或URL的引文列表
- 识别综述论文与单一研究
- 注明出版年份和方法强度
第3轮:矛盾解决(Perplexity)
当存在冲突(不同的估算、日期、假设)时触发此轮。
使用带约束的针对性提示:
bash
bash skills/perplexity-deep-search/scripts/search.sh --mode reason --domains oecd.org,ilo.org,imf.org,worldbank.org 关于2030年人工智能驱动的就业替代,哪个估算更新且方法论更强?
仅在未解决时升级到深度模式:
bash
bash skills/perplexity-deep-search/scripts/search.sh --mode research --json 解决关于2030年人工智能对劳动力市场影响的冲突预测
仲裁规则:
- - 优先选择更新、方法透明、可复现的来源
- 对基于不透明假设的主张降级
- 保持未解决的冲突明确(不强行制造虚假确定性)
第4轮:综合与报告起草
仅在证据达到阈值时才构建主张。
每个主张包含:
- - 主张陈述
- 置信水平(高/中/低)
- 支持来源
- 已知注意事项
科学Markdown输出契约
按以下结构返回一份报告:
- 1. # 标题
- ## 执行摘要
- ## 研究问题
- ## 方法论
- ## 发现
- ## 矛盾与解决
- ## 置信度评估
- ## 局限性
- ## 展望至2030年
- ## 脚注
脚注格式:
- - 在文本中使用Markdown引用,如 [^1]。
- 在 ## 脚注 中,每条注释列出完整的引文元数据 + URL/DOI。
质量门控
在最终确定前,验证:
- - 每个主要主张有 >= 2个独立来源
- 结构性主张至少有一个学术来源
- 来源日期与目标时间范围的相关性一致
- 矛盾证据被呈现而非隐藏
- 脚注完整且可追溯
如果某个门控失败,输出 研究不完整 并明确列出缺失的证据列表。
场景映射(人工智能与劳动力市场2030)
针对用户场景:
- 1. 规划子问题:替代、新角色、历史比较。
- 第1轮Tavily:收集广泛报告(例如来自主要机构)。
- 第2轮文献搜索:收集关于自动化弹性和劳动力转型的学术研究。
- 检测估算中的冲突。
- 第3轮Perplexity:仲裁冲突研究的时效性和方法论质量。
- 起草最终的Markdown报告,并附有脚注证据。
护栏
- - 切勿在未提供来源日期和方法背景的情况下呈现预测数字。
- 当来源冲突时,切勿将分歧简化为单一的确定性主张。
- 切勿捏造引文、链接或出版物元数据。
- 明确区分实证发现与模型推断。
- 对前瞻性主张使用谨慎语言(2030年是预测性的,而非观察到的)。
故障处理
- - 缺少API密钥:停止并返回确切缺失的环境变量。
- 学术来源访问限制:明确披露差距。
- Perplexity速率/成本问题:回退到 reason 模式,并使用更窄的领域过滤器。
- 第3轮后未解决的矛盾:保留两种观点,并标注置信度降级。
来自已检查上游技能的已知限制
- - 在检查期间未找到名为 semantic-scholar 的确切ClawHub标识符;本技能使用文档化的映射到 literature-search。
- deepresearchwork