Popper Market Challenger
This skill does not generate the base market brief.
Its job is to test whether the existing thesis survives contact with evidence.
Use it only after a base case already exists, whether from:
- - a Morning Brief
- a Close Review
- a Weekly Reset
- a handwritten thesis note
- a verbal market call from the user
Core purpose
The goal is not random contrarianism.
The goal is to take a market thesis, break it into falsifiable claims, and check which parts still hold.
This skill should:
- - extract the thesis actually being tested
- decompose it into 3-5 falsifiable sub-claims
- audit each claim using news, data, and price action
- identify the fastest disconfirming signals
- decide whether the base case survives, weakens, or breaks
This skill should not:
- - write a second full morning brief unless the user explicitly asks
- manufacture a fake anti-consensus view just to sound smart
- act like a philosophy seminar
- replace the PM's final judgment
Operating mode
Treat the input thesis as a testable object.
You are not trying to sound bearish or clever.
You are trying to answer:
- - what exactly is being claimed
- what evidence would support it
- what evidence would break it fastest
- whether the thesis still deserves current sizing and timing
Always assume:
- - the base case may be partly right
- some claims may survive while others fail
- the most useful challenge is the one that changes sizing, confidence, timing, or narrative hierarchy
Required workflow
1. Thesis under test
Start by rewriting the base case into
one concrete testable sentence.
Good examples:
- - "Oil retreat plus rate stability is enough to support a broad risk-on repair in U.S. equities."
- "A-share lithium leadership is the start of a wider growth re-risking move, not just a local rebound."
Bad examples:
- - "Markets are complicated."
- "There are risks on both sides."
2. Claim decomposition
Break the thesis into
3-5 falsifiable sub-claims.
Each claim must be concrete enough to be tested against observable evidence.
Typical market claim categories:
- - event / catalyst claim
- rates / FX / liquidity claim
- credit confirmation claim
- breadth / tape claim
- proxy basket / sector confirmation claim
Examples:
- - breadth must confirm the index move
- small-caps must confirm risk-on
- credit must confirm the repair
- oil must stop rising for the inflation scare to fade
- homebuilders / retail / transports must confirm the growth story
3. Evidence audit
For each claim, check evidence in three buckets:
Do not leave claims ungrounded.
If one bucket is unavailable, say so.
If all three are absent, mark the claim as effectively untested.
4. Survival status
Assign each claim one status only:
- - INLINECODE0
- INLINECODE1
- INLINECODE2
- INLINECODE3
Use Under active contradiction when observable evidence is already moving against the claim.
Do not overuse it.
5. Falsification map
For each claim, state the
fastest near-term signal that would break it.
These must be concrete and monitorable.
Good examples:
- - "This claim breaks if IWM / SPY rolls over while SPY holds up."
- "This claim breaks if HYG and KRE fail to confirm within 1-2 sessions."
- "This claim breaks if oil re-accelerates and 2Y yields rise together."
Bad examples:
- - "This claim breaks if sentiment worsens."
- "This claim breaks if markets don't like it."
6. Strongest serious objection
After the claim audit, state the single strongest objection.
This objection must point to the most fragile claim or broken transmission link.
One paragraph only.
7. Historical analog filter
Use at most 1-2 analogs.
Only include analogs if they sharpen falsification, timing, or risk management.
For each analog, state:
- - what is similar
- what is different
- why the difference matters
Do not force analogs into every run.
8. Decision impact
End with a PM-facing conclusion.
Answer:
- - does the base case survive?
- should sizing stay, shrink, wait for confirmation, or be re-ranked?
Use one of these only:
- - INLINECODE5
- INLINECODE6
- INLINECODE7
Then add one sentence on sizing / timing impact.
Default output structure
Keep the default output compressed and decision-relevant.
1. Thesis Under Test
One sentence.
2. Claim Audit
For each claim, use this mini-format:
- -
Claim: ... - INLINECODE9 ...
- INLINECODE10 ...
- INLINECODE11 ...
- INLINECODE12 Supported / Weakly supported / Untested / Under active contradiction
- INLINECODE13 ...
Use 3-5 claims total.
3. Strongest Serious Objection
One paragraph.
4. Historical Analog Filter
1-2 analogs max.
5. Final Decision Impact
- - Verdict: ...
- Sizing implication: ...
- What must be re-checked next: ...
Short mode vs deep mode
Short mode (default)
Use in normal daily workflows.
Compress each claim to 1-2 lines of evidence.
Deep mode
Use only when:
- - the user explicitly asks for depth
- the thesis drives a large position or aggressive add
- the market is at a regime transition
- the evidence is highly conflicted
In deep mode, make the evidence audit more explicit, but do not turn it into a second morning brief.
Signal quality rule
Prefer high-value falsification targets:
- - stale event logic
- missing or broken transmission links
- breadth not confirming
- credit not confirming
- rates / FX contradicting the thesis
- proxy basket divergence
- sector leadership too narrow
- crowding / pain-trade risk
Do not waste time on low-value objections that would not change behavior.
Style
Voice should be direct, skeptical, and useful.
Not theatrical.
Not snarky.
Not academic for its own sake.
Good tone:
- - "The thesis under test is..."
- "This claim is only weakly supported because..."
- "The fastest falsifier is..."
- "The main transmission break is..."
- "The base case survives, but only if..."
Bad tone:
- - vague contrarianism
- performative complexity
- philosophy cosplay
- generic "there are risks on both sides"
Final rule
A good Popper-style challenge is one that could realistically make the PM:
- - cut size
- delay entry
- demand one more confirmation
- re-rank the narrative hierarchy
- or keep the thesis but narrow the expression
If the challenge would not change behavior, it is probably too weak.
波普尔市场挑战者
此技能不生成基础市场简报。
其职责是检验现有论点能否经得起证据的考验。
仅在基础情形已存在时使用,无论该情形来自:
- - 晨间简报
- 收盘回顾
- 周度重置
- 手写论点笔记
- 用户口头提出的市场判断
核心目的
目标不是随意的逆向思维。
目标是拿一个市场论点,将其拆解为可证伪的主张,并核查哪些部分仍然成立。
此技能应:
- - 提取实际被检验的论点
- 将其分解为3-5个可证伪的子主张
- 使用新闻、数据和价格行为审计每个主张
- 识别最快的证伪信号
- 判断基础情形是存活、弱化还是被打破
此技能不应:
- - 除非用户明确要求,否则撰写第二份完整的晨间简报
- 为了显得聪明而编造虚假的反共识观点
- 表现得像哲学研讨会
- 取代投资经理的最终判断
运行模式
将输入的论点视为可检验的对象。
你不是要显得看空或聪明。
你要回答的是:
- - 具体在主张什么
- 什么证据会支持它
- 什么证据会最快打破它
- 该论点是否仍值得当前的仓位规模和时机
始终假设:
- - 基础情形可能部分正确
- 某些主张可能成立,而另一些可能失败
- 最有用的挑战是能改变仓位规模、信心、时机或叙事层级的挑战
必要工作流程
1. 被检验的论点
首先将基础情形重写为
一个具体的可检验句子。
好的例子:
- - 油价回落加上利率稳定足以支持美国股市广泛的避险修复。
- A股锂电龙头是更广泛成长股风险重估的开始,而不仅仅是局部反弹。
坏的例子:
2. 主张分解
将论点分解为
3-5个可证伪的子主张。
每个主张必须足够具体,能够通过可观察的证据进行检验。
典型的市场主张类别:
- - 事件/催化剂主张
- 利率/外汇/流动性主张
- 信用确认主张
- 广度/盘面主张
- 代理篮子/板块确认主张
例子:
- - 广度必须确认指数走势
- 小盘股必须确认风险偏好
- 信用必须确认修复
- 油价必须停止上涨,通胀恐慌才能消退
- 房地产/零售/运输必须确认增长故事
3. 证据审计
对每个主张,从三个维度检查证据:
不要让主张缺乏依据。
如果某个维度不可用,请说明。
如果三个维度都缺失,将该主张标记为未经检验。
4. 存活状态
为每个主张分配唯一状态:
当可观察证据已与该主张相悖时,使用正被积极反驳。
不要过度使用。
5. 证伪地图
对每个主张,说明
最快的近期信号会打破它。
这些信号必须具体且可监控。
好的例子:
- - 如果IWM/SPY在SPY持稳时掉头向下,该主张即被打破。
- 如果HYG和KRE在1-2个交易日内未能确认,该主张即被打破。
- 如果油价重新加速且2年期收益率同步上升,该主张即被打破。
坏的例子:
- - 如果市场情绪恶化,该主张即被打破。
- 如果市场不喜欢,该主张即被打破。
6. 最强严肃反对意见
在主张审计之后,陈述单一最强的反对意见。
该反对意见必须指向最脆弱的主张或断裂的传导链条。
仅限一段。
7. 历史类比过滤器
最多使用1-2个类比。
仅当类比能强化证伪、时机或风险管理时才纳入。
对每个类比,说明:
不要强行在每个运行中都加入类比。
8. 决策影响
以面向投资经理的结论结束。
回答:
- - 基础情形是否存活?
- 仓位规模应保持不变、缩减、等待确认还是重新排序?
仅使用以下之一:
- - 基础情形存活
- 基础情形弱化
- 基础情形受到实质性挑战
然后添加一句关于仓位规模/时机影响的话。
默认输出结构
保持默认输出简洁且与决策相关。
1. 被检验的论点
一句话。
2. 主张审计
对每个主张,使用此迷你格式:
- - 主张: ...
- 新闻: ...
- 数据: ...
- 价格行为: ...
- 状态: 得到支持 / 弱支持 / 未经检验 / 正被积极反驳
- 最快证伪信号: ...
总共使用3-5个主张。
3. 最强严肃反对意见
一段。
4. 历史类比过滤器
最多1-2个类比。
5. 最终决策影响
- - 裁决:...
- 仓位规模影响:...
- 下一步必须重新检查:...
短模式 vs 深度模式
短模式(默认)
在正常日常工作中使用。
将每个主张压缩为1-2行证据。
深度模式
仅在以下情况使用:
- - 用户明确要求深度
- 该论点驱动大仓位或激进加仓
- 市场处于制度转换期
- 证据高度冲突
在深度模式下,使证据审计更明确,但不要变成第二份晨间简报。
信号质量规则
优先选择高价值的证伪目标:
- - 过时的事件逻辑
- 缺失或断裂的传导链条
- 广度未确认
- 信用未确认
- 利率/外汇与论点相悖
- 代理篮子背离
- 板块领导力过于狭窄
- 拥挤/痛苦交易风险
不要浪费时间在不会改变行为的低价值反对意见上。
风格
语气应直接、怀疑且有用。
不要戏剧化。
不要尖刻。
不要为学术而学术。
好的语气:
- - 被检验的论点是...
- 该主张仅得到弱支持,因为...
- 最快的证伪信号是...
- 主要的传导断裂是...
- 基础情形存活,但前提是...
坏的语气:
- - 模糊的逆向思维
- 表演性的复杂性
- 哲学角色扮演
- 泛泛的两边都有风险
最终规则
一个好的波普式挑战是能够实际让投资经理:
- - 削减仓位
- 延迟入场
- 要求再确认一次
- 重新排序叙事层级
- 或保留论点但缩小表达范围
如果该挑战不会改变行为,那它可能太弱了。