Automatically run a lightweight diary study and turn it into weekly product signals and experiment-ready recommendations.
When to use
- - The team must decide what to build, fix, or prioritize and needs behavior evidence over time, not one-off opinions.
- A founder or PM asks for scoped weekly learning (for example, onboarding drop-off causes this week).
- You need to compare what people report with what they actually do across days or touchpoints.
- You need low-overhead, repeatable diary loops tied to one explicit research goal.
- You need actionable weekly outputs with direct experiment plans.
When not to use
- - The request is only broad brand sentiment with no product decision attached.
- The team needs statistically representative market sizing instead of qualitative behavior understanding.
- The study context is high-risk regulated or sensitive (medical, legal, minors, government) without dedicated compliance design.
- The required data can be answered with existing telemetry alone and no user context is needed.
- You cannot secure participant consent, secure storage, and redaction safeguards.
Security & Permissions
- - Minimize access: only access participant contact data, diary entries, and tool scopes required for this study.
- Redact by default: remove names, emails, phone numbers, handles, account IDs, exact addresses, and direct identifiers from all synthesized outputs.
- Keep raw and reports separate: store raw unredacted entries in a restricted location; publish only redacted report artifacts.
- No secret exfiltration: never copy API keys, tokens, credentials, or hidden configs into prompts, reports, or tickets.
- No unnecessary filesystem/shell actions: prefer explicit API-based tool calls (Slack/Email/Notion/Linear/GitHub APIs) and avoid opaque shell operations.
- Multimedia controls: store attachments in restricted buckets/folders, enforce signed URL expiry, and reference attachment IDs in reports instead of raw files.
- Retention discipline: define and enforce retention windows per study; delete or anonymize raw data after the retention window.
- Auditability: log consent status, prompt schedule events, reminder events, and report generation events.
Credential contract:
- - Always required (all deployments):
- INLINECODE0 : runtime profile selector used to choose enabled integrations and routing mode.
- INLINECODE1 : restricted location for raw entries.
- INLINECODE2 : redacted report output location.
- INLINECODE3 : deterministic redaction tokenization salt.
- Conditional (only when integration enabled):
- Notion:
RESEARCH_NOTION_TOKEN, RESEARCH_NOTION_DATABASE_ID. - Slack:
RESEARCH_SLACK_BOT_TOKEN, RESEARCH_SLACK_SIGNING_SECRET. - Email:
RESEARCH_EMAIL_PROVIDER, RESEARCH_EMAIL_API_KEY, RESEARCH_EMAIL_FROM. - Optional ticket sync:
RESEARCH_LINEAR_TOKEN, RESEARCH_LINEAR_TEAM_ID, RESEARCH_GITHUB_TOKEN, RESEARCH_GITHUB_REPO.
Profile selector note:
- - Example values:
core, slack, email, notion, mixed. - INLINECODE20 is a non-secret selector and does not grant external API access.
Strict rule:
- - Unset tokens for disabled integrations; do not preload unused credentials.
Scope anchor + supported study types
Primary scope anchor:
- - Founder/PM request: "Track onboarding drop-offs this week and tell me what to fix."
Support these study types in the same workflow:
- - Habits/frequency: understand repeated usage cadence for feature X.
- Motivations/emotions: understand why users choose or avoid task Y and how that changes over time.
- Journey progression: map research -> decision -> purchase touchpoints and friction points.
- Channel/device comparisons: compare behavior by channel, device, or context across a week.
- General behavior tracking: broad behavior shifts inside a defined product scope.
Rule:
- - Use the primary user story as a scope anchor only.
- Keep method flexible so one skill run can execute any supported study type above.
Diary study timeline
A) Plan
- 1. Define the decision to inform and one clear study goal.
- Write 3-7 research questions tied to that decision.
- Choose focus type:
- -
broad_behavior: wide behavior context. - INLINECODE22 : product feature/flow usage.
- INLINECODE23 : one behavior or task.
- INLINECODE24 : multi-stage touchpoint progression.
- 4. Choose timing mode:
- -
signal default for lightweight studies. - INLINECODE26 for rare/high-value moments.
- INLINECODE27 for high-frequency habits.
- INLINECODE28 when one mode is insufficient.
- 5. Set duration using expected event frequency:
- - Daily/high frequency: 5-10 days.
- Medium frequency: 10-14 days.
- Rare events: 14-28 days.
- 6. Select tools (Notion/Airtable/Google Sheets optional) and storage boundaries.
- Define participant profile and segmentation tags.
- Set sample target, overrecruit factor, incentives, min entries, and max entry caps.
- Finalize templates and follow-up rules.
- Prepare pilot mode (24h internal test).
B) Recruit + consent
- 1. Import participant candidates from CSV or JSON export.
- Screen and segment against the participant profile.
- Overrecruit to protect against drop-off.
- Send consent template with plain-language participation rules.
- Activate participants only after explicit consent confirmation.
C) Pre-study brief
- 1. Run a short onboarding call or send equivalent written onboarding.
- Explain what to report, how often, and expected entry duration.
- Explain incentive and payout requirements (minimums and caps).
- Share good entry examples and poor entry examples.
- Confirm participant timezone and preferred prompt window.
D) Run & monitor
- 1. Send prompts on schedule by selected timing mode.
- Track compliance daily (sent, completed, missed, late, fallback-used).
- Send reminders for missed entries.
- If missed, send reduced-effort fallback prompt before marking non-compliant.
- Handle drop-off with rescue messaging and simplified restart instructions.
- Allow mid-study adjustments only when they reduce confusion without steering outcomes.
Mid-study adjustment rules (bias-safe):
- - Allowed: clarify wording, shorten entry length, shift send window, simplify format.
- Not allowed: inject hypotheses, add leading examples, redefine success criteria midstream.
- Always log change timestamp, rationale, and affected participants.
E) Optional post-study interview
- 1. Schedule 20-30 minute debrief with a subset of participants.
- Validate strongest themes and probe contradictions.
- Investigate surprising behaviors and unresolved gaps.
- Keep interview prompts neutral and evidence-grounded.
F) Analysis
- 1. Revisit original research questions and decision context.
- Code/tag entries by theme, stage, trigger, blocker, and sentiment.
- Analyze behavior shifts over time and key influencing factors.
- For journey studies, map touchpoints, transitions, delays, and abandonment points.
- Apply saturation logic before finalizing claims.
- Produce only evidence-linked recommendations with testable experiments.
Timing mode selection guide
Use this decision sequence:
- 1. Is the behavior rare but high-value (for example, activation failure, checkout abandonment)?
- 2. Is the behavior frequent and routine (for example, daily usage habit)?
- 3. Is this a lightweight weekly pulse where events may not self-trigger reliably?
- 4. Do you need baseline routine + specific event captures?
Mode guidance:
- -
event mode: - Prompt immediately after target event; require short contextual details.
- Best for low-frequency, high-insight moments.
- INLINECODE34 mode:
- Prompt on fixed cadence (daily or every X days).
- Best for repeated routines and trend tracking.
- INLINECODE35 mode (default):
- Prompt at planned check-in windows with contextual cueing.
- Best for low-friction weekly decision loops.
- INLINECODE36 mode:
- Combine interval baseline + event trigger + signal rescue prompts.
- Best for onboarding flows and journeys with uneven activity.
Mixed-mode patterns:
- - Onboarding diagnostics:
- INLINECODE37 once daily +
event when user exits onboarding unfinished. - Habit studies:
- INLINECODE39 daily +
signal at end-of-week reflection. - Journey mapping:
- INLINECODE41 per stage transition +
signal every 2-3 days for missing stages.
Sample size and overrecruit
Qualitative target ranges (completed participants):
- - Focused product question: 8-12 completes.
- Segment comparison (2 segments): 12-20 completes total.
- Journey mapping across multiple stages: 15-24 completes.
- High variability context: 20-30 completes.
Overrecruit rule:
- - Formula:
recruit_n = ceil(sample_size_target * overrecruit_factor). - Default
overrecruit_factor: 1.4. - Typical range:
1.3 to 1.8 depending on expected attrition.
Attrition planning:
- - Anticipated drop-off <20%: factor 1.3-1.4.
- Anticipated drop-off 20-35%: factor 1.4-1.6.
- Anticipated drop-off >35%: factor 1.6-1.8.
Saturation reminder:
- - Increase sample only when new entries still introduce decision-relevant themes.
- Stop expansion when additional participants mostly confirm existing patterns.
Incentive and response requirement model
Core rules:
- - Pay for consistent participation quality, not raw message volume.
- Set clear minimum entry threshold for completion payout.
- Set max counted entries to prevent incentive gaming.
Recommended structure:
- 1. Completion base payout:
- - Paid if participant meets
min_entries_required and consent/compliance rules.
- 2. Optional per-entry bonus:
- - Small amount per valid entry, counted only up to
max_entries_allowed.
- 3. Quality gate:
- - Entry must include required closed + open components; low-information spam is non-qualifying.
Default calculation model:
- -
min_entries_required = ceil(planned_prompt_count * 0.7). - INLINECODE51 .
- INLINECODE52 .
Anti-gaming controls:
- - Ignore duplicate entries submitted within a short lockout window unless marked correction.
- Cap bonus at
max_entries_allowed. - Reject entries with copied repetitive text across multiple days unless justified.
Pilot mode
Run pilot before live recruitment:
- - Duration: 24 hours.
- Participants: 1-2 internal testers.
- Mode: use intended production timing mode (or mixed if planned).
Pilot checks:
- 1. Entry completion time median is 5-10 minutes.
- No entry exceeds 15 minutes unless intentionally long-form.
- Questions are clear without facilitator intervention.
- Closed + open responses together produce usable evidence.
- Reminder + fallback sequence works as expected.
Pilot pass criteria:
- - At least 80% of pilot entries meet quality requirements.
- No major ambiguity in prompt wording.
- Report synthesis can produce at least one evidence-backed signal.
If pilot fails:
- - Simplify prompts.
- Remove ambiguous wording.
- Reduce entry burden before launching live study.
Compliance monitoring
Track per participant daily:
- - promptssent
- entriescompleted
- entriesmissed
- reminderssent
- fallbackentriescompleted
- consecutivemisseddays
Reminder policy:
- - First miss: gentle reminder.
- Second miss in a row: fallback prompt (reduced effort).
- Third miss in a row: dropout rescue message and optional pause.
Fallback prompt requirements:
- - 1 closed question + 1 short open response.
- Estimated effort: <=3 minutes.
- No penalty for using fallback when participant discloses constraints.
Dropout rescue flow:
- 1. Acknowledge friction without blame.
- Offer simplified participation option.
- Confirm whether participant wants to continue or exit.
- Preserve data rights and payout rules clearly.
Optional post-study interview
Use only when:
- - Weekly signals contain unresolved contradictions.
- Behavior shifts need causal explanation.
- Journey transitions are unclear from diary logs.
Interview constraints:
- - Duration 20-30 minutes.
- Use neutral prompts that reference participant's own prior entries.
- Do not force confirmation of existing hypotheses.
Outputs:
- - Add debrief notes as supplemental evidence.
- Mark evidence provenance as
debrief vs diary.
Analysis and saturation logic
Analysis sequence:
- 1. Re-state study goal and research questions.
- Normalize entries and redact identifiers.
- Tag entries with codebook fields (behavior, trigger, barrier, outcome, stage, emotion).
- Create participant-by-day timeline to inspect change over time.
- Identify influencing factors (context, device/channel, constraints, external events).
- For journey studies, map touchpoints and failure handoffs.
- Calculate theme emergence trend by period.
- Apply saturation test before final recommendations.
Saturation logic:
- - Compute percentage of new decision-relevant themes in the latest window.
- Default window: last 20% of entries.
- If new decision-relevant themes <10% and no high-risk unknowns, treat as approaching saturation.
- If >=10% or contradictions remain unresolved, extend study or run targeted debrief.
Recommendation requirements:
- - Every recommendation maps to a specific signal.
- Every signal maps to at least two evidence points when possible.
- When confidence is low, recommend risk-reduction experiments, not major irreversible changes.
Weekly User Signals + Recommendations quality bar
Report constraints:
- - Max length: 1 page.
- Include exactly top 3 signals.
- Each signal must include:
- Evidence snippets (redacted).
- Confidence (
Low/Med/High) and explicit reason. - One testable recommendation as an experiment plan.
Experiment plan minimum:
- - Hypothesis.
- Experiment method.
- Primary metric.
- Decision rule.
- Owner and timebox.
Writing rules:
- - No generic advice.
- Tie every claim to observed pattern + evidence.
- Distinguish observed behavior from participant interpretation.
- Explicitly note change-over-time where relevant.
Communication templates index
Use templates in
/prompts:
- - Consent: INLINECODE60
- Onboarding brief: INLINECODE61
- Diary entry template bank: INLINECODE62
- Adaptive follow-up rules: INLINECODE63
- Optional debrief guide: INLINECODE64
Standard operational snippets:
"We are running a short diary study to understand how this product decision should be improved. You will share brief, in-context entries over [X days]. Typical entry time is 5-10 minutes."
"Quick reminder for today's diary check-in. A short entry is enough; if time is tight, use the short-form option."
"Thanks for today's entry. Your details on what happened and why are directly helping this week's product decisions."
"No pressure. If the current format is heavy, we can switch to a lighter 2-minute check-in so your perspective is still represented."
Examples
Example Study Brief 1: Onboarding drop-off
CODEBLOCK0
Example Study Brief 2: Habit/frequency for feature X
CODEBLOCK1
Example Study Brief 3: Multi-touchpoint journey
CODEBLOCK2
Example Weekly Report 1: Onboarding drop-off (1 page format)
Period: 2026-03-02 to 2026-03-08
Decision focus: Reduce onboarding abandonment in first session
Top 3 User Signals + Recommendations
CODEBLOCK3
Example Weekly Report 2: Habit/frequency for feature X (1 page format)
Period: 2026-03-02 to 2026-03-08
Decision focus: Increase weekly repeat use of Feature X
Top 3 User Signals + Recommendations
CODEBLOCK4
Example Weekly Report 3: Journey mapping (1 page format)
Period: 2026-03-02 to 2026-03-08
Decision focus: Reduce stalls from research to purchase decision
Top 3 User Signals + Recommendations
CODEBLOCK5
技能名称:持续用户研究
详细描述:
自动运行轻量级日记研究,并将其转化为每周产品信号和可实验的建议。
何时使用
- - 团队必须决定构建、修复或优先处理什么,并且需要基于时间的行为证据,而非一次性意见。
- 创始人或产品经理要求有范围的每周学习(例如,本周的新用户引导流失原因)。
- 你需要比较用户在不同日子或接触点上报告的内容与实际行为。
- 你需要与一个明确的研究目标相关联的低开销、可重复的日记循环。
- 你需要带有直接实验计划的、可操作的每周产出。
何时不使用
- - 需求仅为广泛的品牌情感,没有附带产品决策。
- 团队需要统计上具有代表性的市场规模估算,而非定性行为理解。
- 研究背景属于高风险监管或敏感领域(医疗、法律、未成年人、政府),且没有专门的合规设计。
- 所需数据仅凭现有遥测技术即可回答,无需用户背景信息。
- 你无法确保获得参与者同意、安全存储和编辑保护措施。
安全与权限
- - 最小化访问:仅访问本研究所需的参与者联系数据、日记条目和工具范围。
- 默认编辑:从所有综合输出中移除姓名、电子邮件、电话号码、用户名、账户ID、确切地址和直接标识符。
- 原始数据和报告分离:将未经编辑的原始条目存储在受限位置;仅发布经过编辑的报告产物。
- 无秘密泄露:切勿将API密钥、令牌、凭据或隐藏配置复制到提示、报告或工单中。
- 无不必要的文件系统/Shell操作:优先使用基于API的显式工具调用(Slack/Email/Notion/Linear/GitHub API),避免不透明的Shell操作。
- 多媒体控制:将附件存储在受限的存储桶/文件夹中,强制使用签名URL过期,并在报告中引用附件ID而非原始文件。
- 保留纪律:为每项研究定义并执行保留窗口;在保留窗口后删除或匿名化原始数据。
- 可审计性:记录同意状态、提示计划事件、提醒事件和报告生成事件。
凭据契约:
- - 始终需要(所有部署):
- CONTINUOUSUSERRESEARCHPROFILE:运行时配置文件选择器,用于选择启用的集成和路由模式。
- RESEARCHSTUDYSTORAGERAWPATH:原始条目的受限存储位置。
- RESEARCHSTUDYSTORAGEREPORTSPATH:编辑后报告的输出位置。
- RESEARCHREDACTIONSALT:确定性编辑令牌化盐值。
- 条件性(仅在集成启用时):
- Notion:RESEARCHNOTIONTOKEN, RESEARCHNOTIONDATABASEID。
- Slack:RESEARCHSLACKBOTTOKEN, RESEARCHSLACKSIGNINGSECRET。
- 电子邮件:RESEARCHEMAILPROVIDER, RESEARCHEMAILAPIKEY, RESEARCHEMAILFROM。
- 可选工单同步:RESEARCHLINEARTOKEN, RESEARCHLINEARTEAMID, RESEARCHGITHUBTOKEN, RESEARCHGITHUBREPO。
配置文件选择器说明:
- - 示例值:core, slack, email, notion, mixed。
- CONTINUOUSUSERRESEARCH_PROFILE 是一个非秘密选择器,不授予外部API访问权限。
严格规则:
- - 为禁用的集成取消设置令牌;不要预加载未使用的凭据。
范围锚点 + 支持的研究类型
主要范围锚点:
- - 创始人/产品经理请求:跟踪本周的新用户引导流失情况,并告诉我该修复什么。
在同一工作流程中支持以下研究类型:
- - 习惯/频率:了解功能X的重复使用节奏。
- 动机/情感:了解用户为何选择或避免任务Y,以及这如何随时间变化。
- 旅程进展:绘制研究 -> 决策 -> 购买的接触点和摩擦点。
- 渠道/设备比较:比较一周内不同渠道、设备或情境下的行为。
- 一般行为追踪:在定义的产品范围内追踪广泛的行为变化。
规则:
- - 仅将主要用户故事作为范围锚点。
- 保持方法灵活,以便一次技能运行可以执行上述任何支持的研究类型。
日记研究时间线
A) 计划
- 1. 定义要告知的决策和一个清晰的研究目标。
- 写出与该决策相关的3-7个研究问题。
- 选择焦点类型:
- - broadbehavior:广泛的行为背景。
- targetedproduct:产品功能/流程使用。
- targeted_activity:一种行为或任务。
- journey:多阶段接触点进展。
- 4. 选择时机模式:
- - signal 默认用于轻量级研究。
- event 用于罕见/高价值时刻。
- interval 用于高频习惯。
- mixed 当单一模式不足时。
- 5. 使用预期事件频率设置持续时间:
- - 每日/高频:5-10天。
- 中频:10-14天。
- 罕见事件:14-28天。
- 6. 选择工具(可选Notion/Airtable/Google Sheets)和存储边界。
- 定义参与者画像和细分标签。
- 设置样本目标、过度招募因子、激励措施、最少条目数和最大条目上限。
- 最终确定模板和跟进规则。
- 准备试点模式(24小时内部测试)。
B) 招募 + 同意
- 1. 从CSV或JSON导出导入候选参与者。
- 根据参与者画像进行筛选和细分。
- 过度招募以防止流失。
- 发送包含通俗易懂参与规则的同意模板。
- 仅在获得明确同意确认后激活参与者。
C) 研究前简报
- 1. 进行一次简短的上线通话或发送等效的书面上线说明。
- 解释要报告什么、报告频率以及预期的条目时长。
- 解释激励和支付要求(最低和上限)。
- 分享好的条目示例和差的条目示例。
- 确认参与者的时区和偏好的提示窗口。
D) 运行与监控
- 1. 根据选定的时机模式按计划发送提示。
- 每日跟踪合规性(已发送、已完成、已错过、已延迟、已使用备用方案)。
- 为错过的条目发送提醒。
- 如果错过,在标记为不合规之前发送简化努力的备用提示。
- 使用挽回消息和简化的重启说明处理流失情况。
- 允许在研究中期进行调整,但仅限于减少混淆且不引导结果的情况。
研究中期调整规则(偏差安全):
- - 允许:澄清措辞、缩短条目长度、调整发送窗口、简化格式。
- 不允许:注入假设、添加引导性示例、中途重新定义成功标准。
- 始终记录更改时间戳、理由和受影响的参与者。
E) 可选研究后访谈
- 1. 与部分参与者安排20-30分钟的汇报。
- 验证最强的主题并探究矛盾之处。
- 调查令人惊讶的行为和未解决的差距。
- 保持访谈提示的中立性和基于证据。
F) 分析
- 1. 重新审视原始研究问题和决策背景。
- 按主题、阶段、触发因素、障碍和情感对条目进行编码/标记。
- 分析行为随时间的变化及关键影响因素。
- 对于旅程研究,绘制接触点、过渡、延迟和放弃点。
- 在最终确定结论前应用饱和逻辑。
- 仅产出与证据关联的建议和可测试的实验。
时机模式选择指南
使用此决策序列:
- 1. 行为是否罕见但高价值(例如,激活失败、结账放弃)?
- 2. 行为是否频繁且常规(例如,日常使用习惯)?
- 3. 这是否是一个轻量级的每周脉搏,事件可能无法可靠地自我触发?
- 4. 你是否需要基线常规 + 特定事件捕获?
模式指南:
- - event 模式:
- 在目标事件发生后立即提示;需要简短的上下文细节。
- 最适合低频、高洞察力的时刻。
- interval 模式:
- 按固定节奏提示(每日或每X天)。
- 最适合重复的常规和趋势追踪。
- signal 模式(默认):
- 在计划的签到窗口提示,并带有上下文提示。
- 最适合低摩擦的每周决策循环。
- mixed 模式:
- 结合间隔基线 + 事件触发 + 信号备用提示。
- 最适合活动不均衡的新用户引导流程和旅程。
混合模式示例:
- - 新用户引导诊断:
- 每日一次 signal + 用户未完成引导退出时触发 event。
- 习惯研究:
- 每日 interval + 周末反思时触发 signal。
- 旅程映射:
- 每个阶段过渡触发 event + 每2-3天为缺失阶段触发 signal。
样本量与过度招募
定性目标范围(完成参与者):