Appendix Table Writer (publishable survey tables)

Why this exists

The pipeline can produce index tables that are useful for planning/debugging, but read like internal artifacts.

This skill writes publishable, reader-facing tables that can live in an Appendix:

- cleaner layout
higher information density
survey-style organization (methods/benchmarks/risks), not intermediate state

Index tables remain in outline/tables_index.md and should not be copied verbatim into the paper.

Inputs

- outline/table_schema.md (table intent + evidence mapping)
INLINECODE2 (internal index; optional but recommended)
INLINECODE3
INLINECODE4
INLINECODE5
INLINECODE6
Optional: INLINECODE7

Read as needed:

- references/table_cell_hygiene.md when Appendix table cells still copy raw paper self-narration or generic result wrappers

Machine-readable assets:

- INLINECODE9

Output

- INLINECODE10

Roles (use explicitly)

Survey Table Curator (reader lens)

Mission: choose tables a reader actually wants in a survey Appendix.

Do:

- prefer 2-3 tables that answer big questions (methods, evaluation, risks)
make rows comparable (same row unit across the table)
make the table legible without reading the whole paper

Avoid:

- one-row-per-H3 index dumps
columns named like internal axes ("axes", "blocking_missing", "evidence readiness")

Production Editor (layout)

Mission: make the table look publishable in LaTeX.

Do:

- keep columns <= 4
keep cells short (phrases, not sentences)
use <br> sparingly (0-1 per cell; never a list dump)

Avoid:

- 6-8 columns with tiny unreadable text
cells that look like notes (semicolon chains + slash lists + long parentheticals)
slash-separated axis markers (A/B/C) in captions/headers/cells (post-merge voice gate will flag them); use commas or 'and' instead
internal axis jargon that reads like an intermediate artifact once printed (e.g., calling table columns "tokens"); prefer "protocol details/metadata/assumptions"

Evidence Steward (verifiability)

Mission: prevent hallucinations.

Do:

- every row must include citations in a dedicated column (e.g., "Key refs")
only restate what appears in evidence packs / anchor sheet
when evidence is thin, prefer fewer rows with stronger grounding

Avoid:

- "representative works" with no supporting claim in packs/anchors
adding benchmark/method details not present upstream

Table contract (publishable, Appendix-ready)

INLINECODE12 must:

- contain >=2 Markdown tables
use a caption line before each table, e.g. INLINECODE13
contain no headings (#, ##, ###) inside the file (the merger adds an Appendix heading)
contain no placeholders (TODO, TBD, FIXME, ..., unicode ellipsis)
contain citations in rows using [@BibKey] (keys must exist in citations/ref.bib)
avoid pipeline jargon and index-like column names

Workflow (explicit inputs)

- Start from GOAL.md (scope) and outline/table_schema.md (what each table must answer).
Use outline/tables_index.md as a shortlist source, but do not paste it verbatim.
Fill rows/cells using outline/subsection_briefs.jsonl, outline/evidence_drafts.jsonl, and outline/anchor_sheet.jsonl (no guessing).
Validate every cited key against citations/ref.bib.

Recommended Appendix tables (default set)

If you are unsure what to build, start with these two:

1) Method/architecture map (representative works)

- Row unit: work/system line (not H3 id)
Columns (example):

- Work (short name)
- Core idea (1 short phrase)
- Loop + interface assumptions (1 short phrase; reader-facing)
- Key refs (2-4 cite keys)

2) Evaluation protocol / benchmark map

- Row unit: benchmark / evaluation setting (or a canonical protocol dimension if benchmarks are thin)
Columns (example):

- Benchmark / setting
- Task + metric (phrases, not definitions)
- Key protocol constraints (budget/cost/latency/steps/tool access/threat model)
- Key refs (2-4 cite keys)

Optional third (only if it stays clean):
3) Risk / threat-surface map

- Row unit: threat/failure mode category
Columns: surface; why it matters; mitigation pattern; key refs

Positive / negative examples (style)

Bad (index table / internal notes):

- Column: "Axes"
Cell: planning / memory / tools / eval / safety (slash dump)
Rows: every H3 id with 5+ <br> lines

Good (survey table):

- Column labels are reader-facing ("Core idea", "Task + metric", "Constraint")
Cells are short phrases (no narration)
A reader can scan and compare rows quickly

Also good (avoid intermediate-artifact tells):

- Don't label columns as "token(s)". If you need the idea, rewrite as "protocol details/metadata/assumptions".
Avoid ASCII arrows like -> inside cells; prefer natural phrasing (e.g., "interleaves reasoning traces with tool actions").

When to stop / route upstream

If you cannot fill a row without guessing:

- remove the row (prefer fewer, solid rows), and
route upstream: strengthen evidence-draft / anchor-sheet for that area.

Script (generator + validator)

Quick Start

- INLINECODE35
INLINECODE36

All Options

- --workspace <workspace_dir> (required)
INLINECODE38 (optional; used only for runner bookkeeping)
INLINECODE39 (optional; ignored by the validator; kept for runner compatibility)
INLINECODE40 (optional; defaults to outline/tables_appendix.md)
INLINECODE42 (optional; ignored by the validator)

Examples

- Validate the default appendix tables file:

INLINECODE43

- Validate a workspace that writes appendix tables to a non-standard path:

INLINECODE44

Notes:

- This script writes outline/tables_appendix.md from the existing evidence artifacts and then validates the result.
It always writes a short report to output/TABLES_APPENDIX_REPORT.md.

附录表格撰写器（可发表的调查表格）

为何存在

流水线可生成对规划/调试有用的索引表格，但这些表格读起来像内部产物。

本技能撰写可发表、面向读者的表格，适用于附录：

- 更清晰的布局
更高的信息密度
调查式组织（方法/基准/风险），而非中间状态

索引表格保留在 outline/tables_index.md 中，不应逐字复制到论文中。

输入

- outline/tableschema.md（表格意图 + 证据映射）
outline/tablesindex.md（内部索引；可选但推荐）
outline/subsectionbriefs.jsonl
outline/evidencedrafts.jsonl
outline/anchor_sheet.jsonl
citations/ref.bib
可选：GOAL.md

按需读取：

- 当附录表格单元格仍复制原始论文自我叙述或通用结果包装时，参考 references/tablecellhygiene.md

机器可读资产：

- assets/tablecellhygiene.json

输出

- outline/tables_appendix.md

角色（明确使用）

调查表格策展人（读者视角）

使命：选择读者真正希望在调查附录中看到的表格。

应做：

- 优先选择2-3个回答重大问题的表格（方法、评估、风险）
使行具有可比性（整张表格使用相同的行单位）
使表格无需阅读整篇论文即可理解

避免：

- 每个H3对应一行的索引转储
列名类似内部轴（轴、阻塞_缺失、证据就绪度）

制作编辑（布局）

使命：使表格看起来可在LaTeX中发表。

应做：

- 保持列数 <= 4
保持单元格简短（短语，而非句子）
谨慎使用
（每个单元格0-1个；绝不要列表转储）

避免：

- 6-8列导致文本微小不可读
单元格看起来像注释（分号链 + 斜杠列表 + 长括号内容）
标题/表头/单元格中使用斜杠分隔的轴标记（A/B/C）（合并后的语音门控会标记它们）；改用逗号或和
内部轴术语，打印后读起来像中间产物（例如，将表格列称为标记）；优先使用协议细节/元数据/假设

证据管理员（可验证性）

使命：防止幻觉。

应做：

- 每行必须在专用列中包含引用（例如，关键参考文献）
仅重述证据包/锚点表中出现的内容
当证据薄弱时，优先选择行数较少但依据更强的行

避免：

- 在包/锚点中没有支持性主张的代表性作品
添加上游不存在的基准/方法细节

表格契约（可发表，适用于附录）

outline/tables_appendix.md 必须：

- 包含 >=2 个Markdown表格
每个表格前使用标题行，例如 附录表A1. ...
文件内不包含标题（#、##、###）（合并器会添加附录标题）
不包含占位符（TODO、TBD、FIXME、...、Unicode省略号）
行中使用 [@BibKey] 格式的引用（键必须存在于 citations/ref.bib 中）
避免流水线术语和类似索引的列名

工作流程（明确输入）

- 从 GOAL.md（范围）和 outline/tableschema.md（每个表格必须回答的内容）开始。
将 outline/tablesindex.md 作为候选来源，但不要逐字粘贴。
使用 outline/subsectionbriefs.jsonl、outline/evidencedrafts.jsonl 和 outline/anchor_sheet.jsonl 填充行/单元格（不猜测）。
对照 citations/ref.bib 验证每个引用的键。

正面/负面示例（风格）

差（索引表格/内部笔记）：

- 列：轴
单元格：规划 / 记忆 / 工具 / 评估 / 安全（斜杠转储）
行：每个H3 ID带有5个以上
行

好（调查表格）：

- 列标签面向读者（核心思想、任务 + 指标、约束）
单元格为简短短语（无叙述）
读者可以快速扫描并比较各行

同样好（避免中间产物痕迹）：

- 不要将列标记为标记。如果需要这个概念，改写为协议细节/元数据/假设。
避免单元格内使用ASCII箭头如->；优先使用自然措辞（例如，将推理轨迹与工具操作交错）。

何时停止/路由上游

如果无法在不猜测的情况下填充某行：

- 移除该行（优先选择较少但扎实的行），并且
路由上游：加强该区域的evidence-draft/anchor-sheet。

脚本（生成器 + 验证器）

快速开始

- python scripts/run.py --help
python scripts/run.py --workspace workspaces/

所有选项

- --workspace dir>（必需）
--unit-id （可选；仅用于运行器记账）
--inputs （可选；验证器忽略；为运行器兼容性保留）
--outputs （可选；默认为 outline/tablesappendix.md）
--checkpoint （可选；验证器忽略）

示例

- 验证默认附录表格文件：

python scripts/run.py --workspace workspaces/e2e-agent-survey-latex-verify-YYYYMMDD-HHMMSS

- 验证将附录表格写入非标准路径的工作空间：

python scripts/run.py --workspace workspaces/ --outputs outline/tables_appendix.md

注意：

- 此脚本从现有证据产物写入 outline/tablesappendix.md，然后验证结果。
它始终将简短报告写入 output/TABLESAPPENDIX_REPORT.md。

appendix-table-writer附录表格生成器