Authorized Session Scrape
Use this skill when the user has legitimate access and the work should continue inside their own logged-in browser session.
Hard Rule
Do not bypass login or session controls.
This skill begins only after:
- - the user confirms they want to proceed
- the login flow is opened locally
- the user completes sign-in themselves
Best Use Cases
- - account-only pages
- post-login search or filtering
- paginated dashboards or content libraries
- collections where one page is not enough
- workflows where plain fetch misses the authenticated state
Workflow
1. Open the user session
- - use the local browser path
- navigate to the target area
- if needed, prompt the user to finish login in the browser
2. Verify real access
Before scraping deeply, confirm:
- - account home or target page is visible
- search box, filters, or result list are actually present
- content is not still hidden behind a modal or loading shell
3. Expand within the site
Once logged in:
- - use the site's own search
- apply filters, sorting, and date ranges when helpful
- open multiple relevant items or tabs
- continue through pagination until results become repetitive or low-value
4. Extract systematically
Track internally:
- - which sections were searched
- what filters were applied
- which pages or tabs produced useful signal
- where the session still limits access
5. Summarize with provenance
Distinguish:
- - facts seen in public pages
- facts seen only after authenticated login
- what still requires manual action by the user
Output Pattern
Return:
- 1. where you searched inside the logged-in session
- what filters or navigation paths were used
- what the strongest results were
- what remains partial or unavailable
- what next click path you would use if continuing
技能名称:授权会话抓取
详细描述:
授权会话抓取
当用户拥有合法访问权限,且工作应在用户自身已登录的浏览器会话中继续时,使用此技能。
硬性规则
不得绕过登录或会话控制。
此技能仅在以下条件满足后开始:
- - 用户确认希望继续操作
- 登录流程已在本地打开
- 用户自行完成登录
最佳使用场景
- - 仅限账户访问的页面
- 登录后的搜索或筛选操作
- 分页仪表盘或内容库
- 单页无法满足需求的集合
- 普通抓取无法获取认证状态的工作流程
工作流程
1. 打开用户会话
- - 使用本地浏览器路径
- 导航至目标区域
- 如有需要,提示用户在浏览器中完成登录
2. 验证实际访问权限
在深入抓取前,确认:
- - 账户主页或目标页面可见
- 搜索框、筛选器或结果列表实际存在
- 内容未被弹窗或加载界面遮挡
3. 在网站内扩展操作
登录后:
- - 使用网站自身的搜索功能
- 在必要时应用筛选、排序和日期范围
- 打开多个相关项目或标签页
- 持续翻页,直至结果重复或价值降低
4. 系统化提取
内部记录:
- - 搜索了哪些板块
- 应用了哪些筛选条件
- 哪些页面或标签页产生了有效信息
- 会话在哪些方面仍存在访问限制
5. 附带来源进行总结
区分:
- - 在公开页面中看到的事实
- 仅在登录认证后看到的事实
- 仍需用户手动操作的内容
输出格式
返回:
- 1. 在登录会话中搜索了哪些位置
- 使用了哪些筛选条件或导航路径
- 最有力的结果是什么
- 哪些内容仍不完整或无法获取
- 若继续操作,下一步的点击路径是什么