Appendix Table Writer (publishable survey tables)
Why this exists
The pipeline can produce index tables that are useful for planning/debugging, but read like internal artifacts.
This skill writes publishable, reader-facing tables that can live in an Appendix:
- - cleaner layout
- higher information density
- survey-style organization (methods/benchmarks/risks), not intermediate state
Index tables remain in outline/tables_index.md and should not be copied verbatim into the paper.
Inputs
- -
outline/table_schema.md (table intent + evidence mapping) - INLINECODE2 (internal index; optional but recommended)
- INLINECODE3
- INLINECODE4
- INLINECODE5
- INLINECODE6
- Optional: INLINECODE7
Read as needed:
- -
references/table_cell_hygiene.md when Appendix table cells still copy raw paper self-narration or generic result wrappers
Machine-readable assets:
Output
Roles (use explicitly)
Survey Table Curator (reader lens)
Mission: choose tables a reader actually wants in a survey Appendix.
Do:
- - prefer 2-3 tables that answer big questions (methods, evaluation, risks)
- make rows comparable (same row unit across the table)
- make the table legible without reading the whole paper
Avoid:
- - one-row-per-H3 index dumps
- columns named like internal axes ("axes", "blocking_missing", "evidence readiness")
Production Editor (layout)
Mission: make the table look publishable in LaTeX.
Do:
- - keep columns <= 4
- keep cells short (phrases, not sentences)
- use
<br> sparingly (0-1 per cell; never a list dump)
Avoid:
- - 6-8 columns with tiny unreadable text
- cells that look like notes (semicolon chains + slash lists + long parentheticals)
- slash-separated axis markers (A/B/C) in captions/headers/cells (post-merge voice gate will flag them); use commas or 'and' instead
- internal axis jargon that reads like an intermediate artifact once printed (e.g., calling table columns "tokens"); prefer "protocol details/metadata/assumptions"
Evidence Steward (verifiability)
Mission: prevent hallucinations.
Do:
- - every row must include citations in a dedicated column (e.g., "Key refs")
- only restate what appears in evidence packs / anchor sheet
- when evidence is thin, prefer fewer rows with stronger grounding
Avoid:
- - "representative works" with no supporting claim in packs/anchors
- adding benchmark/method details not present upstream
Table contract (publishable, Appendix-ready)
INLINECODE12 must:
- - contain >=2 Markdown tables
- use a caption line before each table, e.g. INLINECODE13
- contain no headings (
#, ##, ###) inside the file (the merger adds an Appendix heading) - contain no placeholders (
TODO, TBD, FIXME, ..., unicode ellipsis) - contain citations in rows using
[@BibKey] (keys must exist in citations/ref.bib) - avoid pipeline jargon and index-like column names
Workflow (explicit inputs)
- - Start from
GOAL.md (scope) and outline/table_schema.md (what each table must answer). - Use
outline/tables_index.md as a shortlist source, but do not paste it verbatim. - Fill rows/cells using
outline/subsection_briefs.jsonl, outline/evidence_drafts.jsonl, and outline/anchor_sheet.jsonl (no guessing). - Validate every cited key against
citations/ref.bib.
Recommended Appendix tables (default set)
If you are unsure what to build, start with these two:
1) Method/architecture map (representative works)
- - Row unit: work/system line (not H3 id)
- Columns (example):
- Work (short name)
- Core idea (1 short phrase)
- Loop + interface assumptions (1 short phrase; reader-facing)
- Key refs (2-4 cite keys)
2) Evaluation protocol / benchmark map
- - Row unit: benchmark / evaluation setting (or a canonical protocol dimension if benchmarks are thin)
- Columns (example):
- Benchmark / setting
- Task + metric (phrases, not definitions)
- Key protocol constraints (budget/cost/latency/steps/tool access/threat model)
- Key refs (2-4 cite keys)
Optional third (only if it stays clean):
3) Risk / threat-surface map
- - Row unit: threat/failure mode category
- Columns: surface; why it matters; mitigation pattern; key refs
Positive / negative examples (style)
Bad (index table / internal notes):
- - Column: "Axes"
- Cell:
planning / memory / tools / eval / safety (slash dump) - Rows: every H3 id with 5+
<br> lines
Good (survey table):
- - Column labels are reader-facing ("Core idea", "Task + metric", "Constraint")
- Cells are short phrases (no narration)
- A reader can scan and compare rows quickly
Also good (avoid intermediate-artifact tells):
- - Don't label columns as "token(s)". If you need the idea, rewrite as "protocol details/metadata/assumptions".
- Avoid ASCII arrows like
-> inside cells; prefer natural phrasing (e.g., "interleaves reasoning traces with tool actions").
When to stop / route upstream
If you cannot fill a row without guessing:
- - remove the row (prefer fewer, solid rows), and
- route upstream: strengthen
evidence-draft / anchor-sheet for that area.
Script (generator + validator)
Quick Start
- - INLINECODE35
- INLINECODE36
All Options
- -
--workspace <workspace_dir> (required) - INLINECODE38 (optional; used only for runner bookkeeping)
- INLINECODE39 (optional; ignored by the validator; kept for runner compatibility)
- INLINECODE40 (optional; defaults to
outline/tables_appendix.md) - INLINECODE42 (optional; ignored by the validator)
Examples
- - Validate the default appendix tables file:
INLINECODE43
- - Validate a workspace that writes appendix tables to a non-standard path:
INLINECODE44
Notes:
- - This script writes
outline/tables_appendix.md from the existing evidence artifacts and then validates the result. - It always writes a short report to
output/TABLES_APPENDIX_REPORT.md.
附录表格撰写器(可发表的调查表格)
为何存在
流水线可生成对规划/调试有用的索引表格,但这些表格读起来像内部产物。
本技能撰写可发表、面向读者的表格,适用于附录:
- - 更清晰的布局
- 更高的信息密度
- 调查式组织(方法/基准/风险),而非中间状态
索引表格保留在 outline/tables_index.md 中,不应逐字复制到论文中。
输入
- - outline/tableschema.md(表格意图 + 证据映射)
- outline/tablesindex.md(内部索引;可选但推荐)
- outline/subsectionbriefs.jsonl
- outline/evidencedrafts.jsonl
- outline/anchor_sheet.jsonl
- citations/ref.bib
- 可选:GOAL.md
按需读取:
- - 当附录表格单元格仍复制原始论文自我叙述或通用结果包装时,参考 references/tablecellhygiene.md
机器可读资产:
- - assets/tablecellhygiene.json
输出
- - outline/tables_appendix.md
角色(明确使用)
调查表格策展人(读者视角)
使命:选择读者真正希望在调查附录中看到的表格。
应做:
- - 优先选择2-3个回答重大问题的表格(方法、评估、风险)
- 使行具有可比性(整张表格使用相同的行单位)
- 使表格无需阅读整篇论文即可理解
避免:
- - 每个H3对应一行的索引转储
- 列名类似内部轴(轴、阻塞_缺失、证据就绪度)
制作编辑(布局)
使命:使表格看起来可在LaTeX中发表。
应做:
- - 保持列数 <= 4
- 保持单元格简短(短语,而非句子)
- 谨慎使用
(每个单元格0-1个;绝不要列表转储)
避免:
- - 6-8列导致文本微小不可读
- 单元格看起来像注释(分号链 + 斜杠列表 + 长括号内容)
- 标题/表头/单元格中使用斜杠分隔的轴标记(A/B/C)(合并后的语音门控会标记它们);改用逗号或和
- 内部轴术语,打印后读起来像中间产物(例如,将表格列称为标记);优先使用协议细节/元数据/假设
证据管理员(可验证性)
使命:防止幻觉。
应做:
- - 每行必须在专用列中包含引用(例如,关键参考文献)
- 仅重述证据包/锚点表中出现的内容
- 当证据薄弱时,优先选择行数较少但依据更强的行
避免:
- - 在包/锚点中没有支持性主张的代表性作品
- 添加上游不存在的基准/方法细节
表格契约(可发表,适用于附录)
outline/tables_appendix.md 必须:
- - 包含 >=2 个Markdown表格
- 每个表格前使用标题行,例如 附录表A1. ...
- 文件内不包含标题(#、##、###)(合并器会添加附录标题)
- 不包含占位符(TODO、TBD、FIXME、...、Unicode省略号)
- 行中使用 [@BibKey] 格式的引用(键必须存在于 citations/ref.bib 中)
- 避免流水线术语和类似索引的列名
工作流程(明确输入)
- - 从 GOAL.md(范围)和 outline/tableschema.md(每个表格必须回答的内容)开始。
- 将 outline/tablesindex.md 作为候选来源,但不要逐字粘贴。
- 使用 outline/subsectionbriefs.jsonl、outline/evidencedrafts.jsonl 和 outline/anchor_sheet.jsonl 填充行/单元格(不猜测)。
- 对照 citations/ref.bib 验证每个引用的键。
推荐的附录表格(默认集合)
如果不确定要构建什么,从以下两个开始:
1) 方法/架构图(代表性作品)
- - 行单位:作品/系统行(而非H3 ID)
- 列(示例):
- 作品(简称)
- 核心思想(1个简短短语)
- 循环 + 接口假设(1个简短短语;面向读者)
- 关键参考文献(2-4个引用键)
2) 评估协议/基准图
- - 行单位:基准/评估设置(如果基准较少,则为规范协议维度)
- 列(示例):
- 基准/设置
- 任务 + 指标(短语,而非定义)
- 关键协议约束(预算/成本/延迟/步骤/工具访问/威胁模型)
- 关键参考文献(2-4个引用键)
可选的第三个(仅当保持清晰时):
3) 风险/威胁面图
- - 行单位:威胁/故障模式类别
- 列:面;为何重要;缓解模式;关键参考文献
正面/负面示例(风格)
差(索引表格/内部笔记):
- - 列:轴
- 单元格:规划 / 记忆 / 工具 / 评估 / 安全(斜杠转储)
- 行:每个H3 ID带有5个以上
行
好(调查表格):
- - 列标签面向读者(核心思想、任务 + 指标、约束)
- 单元格为简短短语(无叙述)
- 读者可以快速扫描并比较各行
同样好(避免中间产物痕迹):
- - 不要将列标记为标记。如果需要这个概念,改写为协议细节/元数据/假设。
- 避免单元格内使用ASCII箭头如->;优先使用自然措辞(例如,将推理轨迹与工具操作交错)。
何时停止/路由上游
如果无法在不猜测的情况下填充某行:
- - 移除该行(优先选择较少但扎实的行),并且
- 路由上游:加强该区域的evidence-draft/anchor-sheet。
脚本(生成器 + 验证器)
快速开始
- - python scripts/run.py --help
- python scripts/run.py --workspace workspaces/
所有选项
- - --workspace dir>(必需)
- --unit-id (可选;仅用于运行器记账)
- --inputs (可选;验证器忽略;为运行器兼容性保留)
- --outputs (可选;默认为 outline/tablesappendix.md)
- --checkpoint (可选;验证器忽略)
示例
python scripts/run.py --workspace workspaces/e2e-agent-survey-latex-verify-YYYYMMDD-HHMMSS
python scripts/run.py --workspace workspaces/ --outputs outline/tables_appendix.md
注意:
- - 此脚本从现有证据产物写入 outline/tablesappendix.md,然后验证结果。
- 它始终将简短报告写入 output/TABLESAPPENDIX_REPORT.md。