Blocked Page Fallback
Use this skill when normal web fetch/search is not enough, but the goal may still be reachable through alternate lawful paths.
Do Not Do
- - do not bypass login
- do not evade anti-bot or access controls
- do not brute-force endpoints
Fallback Ladder
1. Broaden discovery
- - search multiple engines
- use site-specific search
- try alternate titles, aliases, slugs, and locale variants
2. Switch transport
- - if plain fetch is thin, use a browser-rendered path
- if browser path is noisy, pivot back to targeted fetch on discovered links
3. Pivot source types
Try allowed alternatives:
- - official docs or help centers
- official API or export surfaces
- feeds, sitemaps, changelogs, or release notes
- search-engine cached snippets where available
- public mirrors or archive copies that are openly reachable
- reputable secondary databases
4. Use structural clues
If the exact page is blocked, search by:
- - page title fragments
- quoted snippets
- IDs, handles, usernames, product codes, or canonical names
- internal link labels and breadcrumb terms
5. Keep going until confidence is earned
Do not stop after:
- - one blocked fetch
- one empty browser render
- one weak search pass
Stop when:
- - authoritative or converging sources answer the question
- the remaining blocker is concrete and real
- additional paths are now duplicative
Output Pattern
Return:
- 1. primary path that failed
- fallback paths attempted
- which fallback produced signal
- best answer now available
- what would require user-authorized login or a first-party API
技能名称:被屏蔽页面回退策略
详细描述:
被屏蔽页面回退策略
当常规网页抓取/搜索无法满足需求,但目标仍可能通过其他合法途径实现时,使用此技能。
禁止事项
- - 不得绕过登录验证
- 不得规避反爬虫或访问控制机制
- 不得对端点进行暴力破解
回退阶梯
1. 扩大发现范围
- - 使用多个搜索引擎进行搜索
- 采用特定站点搜索
- 尝试替代标题、别名、短链接及地区变体
2. 切换传输方式
- - 若纯抓取效果不佳,改用浏览器渲染路径
- 若浏览器路径干扰过多,则转向对已发现链接进行定向抓取
3. 转换来源类型
尝试允许的替代方案:
- - 官方文档或帮助中心
- 官方API或导出接口
- 订阅源、站点地图、更新日志或发布说明
- 可获取的搜索引擎缓存片段
- 可公开访问的公共镜像或存档副本
- 信誉良好的二级数据库
4. 利用结构线索
若目标页面被屏蔽,可通过以下方式搜索:
- - 页面标题片段
- 引用文本片段
- ID、用户名、产品代码或规范名称
- 内部链接标签和面包屑导航术语
5. 持续尝试直至获得可靠结果
遇到以下情况时不要停止:
- - 一次抓取被屏蔽
- 一次浏览器渲染无结果
- 一次弱搜索尝试
遇到以下情况时停止:
- - 权威或趋同来源回答了问题
- 剩余障碍具体且真实存在
- 额外路径已无新意
输出模式
返回:
- 1. 失败的主要路径
- 尝试过的回退路径
- 产生有效信号的回退方式
- 当前可获得的最佳答案
- 需要用户授权登录或第一方API才能获取的内容