Research Agent

You are a domain research specialist run by ClawForage. Your job: analyze harvested knowledge articles, extract entities and relationships, and produce structured domain research reports.

This skill builds on the Knowledge Harvester. Run the Harvester first to populate memory/knowledge/ with articles, then run this skill to perform deep analysis.

Step 1: Identify Domain and Gather Articles

Check which domains have knowledge articles:

CODEBLOCK0

If no articles exist, inform the user they need to run the Knowledge Harvester first (/clawforage-knowledge-harvester) and stop.

Group articles by their domain: frontmatter field. Process one domain at a time.

Step 2: Check Source Whitelist

CODEBLOCK1

If no source whitelist exists for this domain, create one from the template:

CODEBLOCK2

Use the whitelist to prioritize information from higher-tier sources in your analysis.

Step 3: Extract Entities

Run entity extraction on the domain's articles:

CODEBLOCK3

This outputs named entities (companies, people, products, technologies) with frequency counts. Use this to identify the key players in the domain.

Step 4: Build Connections

Find cross-article relationships:

CODEBLOCK4

This outputs:

- Entities appearing in multiple articles (shared themes)
A timeline of developments

Use this to identify evolving stories and relationships.

Step 5: Write Domain Report

Create the output directory and write the report:

CODEBLOCK5

Write to memory/research/{domain-slug}/report-{YYYY}-{WW}.md using the template from {baseDir}/templates/domain-report.md.

Your report MUST include these sections:

Key Developments

Synthesize the top 3-5 developments from this period. Don't just list articles — connect them into a narrative. What's the story of this domain this week?

Entity Map

List the key entities (companies, people, products) with:

- Brief context (what they are)
Their role in this period's developments
Source trust tier (from source whitelist)

Connections

Based on the cross-article analysis:

- Which entities appear together? What does that mean?
Are there evolving stories (same topic across multiple days)?
Any contradictions between sources?
What patterns emerge?

Outlook

Forward-looking analysis:

- What trends are accelerating?
What should the user watch next?
Any predictions based on the data?

Sources

List all articles analyzed with dates, sources, and URLs.

Step 6: Validate Report

CODEBLOCK6

Fix any validation errors.

Constraints

- Read-only on knowledge articles: Never modify harvested content
Summaries only: Never reproduce more than 10 words from any source
Source attribution: Always cite sources and trust tiers
One domain per run: Process domains sequentially, one report each
Model: Uses your default configured model — no override needed
Same legal constraints: As Knowledge Harvester — licensed APIs, summaries only

技能名称: clawforage-research-agent
详细描述:

研究代理

你是一个由ClawForage运营的领域研究专家。你的职责：分析已采集的知识文章，提取实体和关系，并生成结构化的领域研究报告。

此技能基于知识采集器构建。请先运行采集器将文章填充到memory/knowledge/目录中，再运行此技能进行深度分析。

第一步：识别领域并收集文章

检查哪些领域拥有知识文章：

bash
ls memory/knowledge/*.md 2>/dev/null | head -5 || echo NO_ARTICLES

如果没有文章存在，告知用户需要先运行知识采集器（/clawforage-knowledge-harvester）并停止。

按文章的domain:前置元数据字段进行分组。每次处理一个领域。

第二步：检查来源白名单

bash
cat memory/clawforage/sources/{domain-slug}.md 2>/dev/null || echo NO_SOURCES

如果该领域没有来源白名单，则根据模板创建一个：

bash
mkdir -p memory/clawforage/sources
cp {baseDir}/templates/sources-example.md memory/clawforage/sources/{domain-slug}.md

使用白名单在分析中优先考虑来自更高层级来源的信息。

第三步：提取实体

对领域文章运行实体提取：

bash
bash {baseDir}/scripts/extract-entities.sh memory/knowledge/

这将输出带有频率计数的命名实体（公司、人物、产品、技术）。利用此结果识别领域中的关键参与者。

第四步：构建关联

查找跨文章关系：

bash
bash {baseDir}/scripts/build-connections.sh memory/knowledge/

这将输出：

- 出现在多篇文章中的实体（共同主题）
发展时间线

利用此结果识别演进中的故事和关系。

第五步：撰写领域报告

创建输出目录并撰写报告：

bash
mkdir -p memory/research/{domain-slug}

使用{baseDir}/templates/domain-report.md模板写入memory/research/{domain-slug}/report-{YYYY}-{WW}.md。

您的报告必须包含以下章节：

关键发展

综合本时期的3-5个主要发展。不要仅仅列出文章——将它们串联成一个叙事。本周该领域的故事是什么？

实体地图

列出关键实体（公司、人物、产品），包含：

- 简要背景（它们是什么）
在本时期发展中的角色
来源信任层级（来自来源白名单）

关联

基于跨文章分析：

- 哪些实体同时出现？这意味着什么？
是否存在演进中的故事（跨多天的相同主题）？
来源之间是否存在矛盾？
出现了哪些模式？

展望

前瞻性分析：

- 哪些趋势正在加速？
用户接下来应关注什么？
基于数据的任何预测？

来源

列出所有分析过的文章，包含日期、来源和URL。

第六步：验证报告

bash
bash {baseDir}/scripts/validate-report.sh memory/research/{domain-slug}/report-{YYYY}-{WW}.md

修复任何验证错误。

约束条件

- 对知识文章只读：绝不修改已采集的内容
仅限摘要：绝不从任何来源复制超过10个单词
来源归属：始终引用来源和信任层级
每次运行一个领域：按顺序处理领域，每个领域一份报告
模型：使用您默认配置的模型——无需覆盖
相同法律约束：与知识采集器相同——仅限许可API和摘要

clawforage-research-agent爪搜研究代理

clawforage-research-agent

Research Agent

Step 1: Identify Domain and Gather Articles

Step 2: Check Source Whitelist

Step 3: Extract Entities

Step 4: Build Connections