Universal Web Scraper
AI-driven data extraction from 55+ Actors across all major platforms. This skill automatically selects the best Actor for your task.
Prerequisites
- -
APIFY_TOKEN configured in OpenClaw settings - Node.js 20.6+
- INLINECODE1 CLI (auto-installed via skill metadata)
Input Sanitization Rules
Before substituting any value into a bash command:
- - ACTORID: Must be either a technical name (
owner/actor-name — alphanumeric, hyphens, dots, one slash) or a raw ID (exactly 17 alphanumeric characters, e.g., oeiQgfg5fsmIJB7Cn). Reject values containing shell metacharacters (` ; | & $ ( ) { } < > ! \n `).
- **SEARCH_KEYWORDS**: Plain text words only. Reject shell metacharacters.
- **JSON_INPUT**: Must be valid JSON. Must not contain single quotes (use escaped double quotes). Validate structure before use.
- **Output filenames**: Must match YYYY-MM-DDdescriptive-name.{csv,json}. No path separators (/, ..), no spaces, no metacharacters.
## Workflow
Copy this checklist and track progress:
CODEBLOCK0
### Step 1: Understand User Goal and Select Actor
First, understand what the user wants to achieve. Then select the best Actor from the options below.
#### Instagram Actors (12)
| Actor ID | Best For |
|----------|----------|
| apify/instagram-profile-scraper | Profile data, follower counts, bio info |
| apify/instagram-post-scraper | Individual post details, engagement metrics |
| apify/instagram-comment-scraper | Comment extraction, sentiment analysis |
| apify/instagram-hashtag-scraper | Hashtag content, trending topics |
| apify/instagram-hashtag-stats | Hashtag performance metrics |
| apify/instagram-reel-scraper | Reels content and metrics |
| apify/instagram-search-scraper | Search users, places, hashtags |
| apify/instagram-tagged-scraper | Posts tagged with specific accounts |
| apify/instagram-followers-count-scraper | Follower count tracking |
| apify/instagram-scraper | Comprehensive Instagram data |
| apify/instagram-api-scraper | API-based Instagram access |
| apify/export-instagram-comments-posts | Bulk comment/post export |
#### Facebook Actors (14)
| Actor ID | Best For |
|----------|----------|
| apify/facebook-pages-scraper | Page data, metrics, contact info |
| apify/facebook-page-contact-information | Emails, phones, addresses from pages |
| apify/facebook-posts-scraper | Post content and engagement |
| apify/facebook-comments-scraper | Comment extraction |
| apify/facebook-likes-scraper | Reaction analysis |
| apify/facebook-reviews-scraper | Page reviews |
| apify/facebook-groups-scraper | Group content and members |
| apify/facebook-events-scraper | Event data |
| apify/facebook-ads-scraper | Ad creative and targeting |
| apify/facebook-search-scraper | Search results |
| apify/facebook-reels-scraper | Reels content |
| apify/facebook-photos-scraper | Photo extraction |
| apify/facebook-marketplace-scraper | Marketplace listings |
| apify/facebook-followers-following-scraper | Follower/following lists |
#### TikTok Actors (14)
| Actor ID | Best For |
|----------|----------|
| clockworks/tiktok-scraper | Comprehensive TikTok data |
| clockworks/free-tiktok-scraper | Free TikTok extraction |
| clockworks/tiktok-profile-scraper | Profile data |
| clockworks/tiktok-video-scraper | Video details and metrics |
| clockworks/tiktok-comments-scraper | Comment extraction |
| clockworks/tiktok-followers-scraper | Follower lists |
| clockworks/tiktok-user-search-scraper | Find users by keywords |
| clockworks/tiktok-hashtag-scraper | Hashtag content |
| clockworks/tiktok-sound-scraper | Trending sounds |
| clockworks/tiktok-ads-scraper | Ad content |
| clockworks/tiktok-discover-scraper | Discover page content |
| clockworks/tiktok-explore-scraper | Explore content |
| clockworks/tiktok-trends-scraper | Trending content |
| clockworks/tiktok-live-scraper | Live stream data |
#### YouTube Actors (5)
| Actor ID | Best For |
|----------|----------|
| streamers/youtube-scraper | Video data and metrics |
| streamers/youtube-channel-scraper | Channel information |
| streamers/youtube-comments-scraper | Comment extraction |
| streamers/youtube-shorts-scraper | Shorts content |
| streamers/youtube-video-scraper-by-hashtag | Videos by hashtag |
#### Google Maps Actors (4)
| Actor ID | Best For |
|----------|----------|
| compass/crawler-google-places | Business listings, ratings, contact info |
| compass/google-maps-extractor | Detailed business data |
| compass/Google-Maps-Reviews-Scraper | Review extraction |
| poidata/google-maps-email-extractor | Email discovery from listings |
#### Other Actors (6)
| Actor ID | Best For |
|----------|----------|
| apify/google-search-scraper | Google search results |
| apify/google-trends-scraper | Google Trends data |
| voyager/booking-scraper | Booking.com hotel data |
| voyager/booking-reviews-scraper | Booking.com reviews |
| maxcopell/tripadvisor-reviews | TripAdvisor reviews |
| vdrmota/contact-info-scraper | Contact enrichment from URLs |
---
#### Actor Selection by Use Case
| Use Case | Primary Actors |
|----------|---------------|
| **Lead Generation** | compass/crawler-google-places, poidata/google-maps-email-extractor, vdrmota/contact-info-scraper |
| **Influencer Discovery** | apify/instagram-profile-scraper, clockworks/tiktok-profile-scraper, streamers/youtube-channel-scraper |
| **Brand Monitoring** | apify/instagram-tagged-scraper, apify/instagram-hashtag-scraper, compass/Google-Maps-Reviews-Scraper |
| **Competitor Analysis** | apify/facebook-pages-scraper, apify/facebook-ads-scraper, apify/instagram-profile-scraper |
| **Content Analytics** | apify/instagram-post-scraper, clockworks/tiktok-scraper, streamers/youtube-scraper |
| **Trend Research** | apify/google-trends-scraper, clockworks/tiktok-trends-scraper, apify/instagram-hashtag-stats |
| **Review Analysis** | compass/Google-Maps-Reviews-Scraper, voyager/booking-reviews-scraper, maxcopell/tripadvisor-reviews |
| **Audience Analysis** | apify/instagram-followers-count-scraper, clockworks/tiktok-followers-scraper, apify/facebook-followers-following-scraper |
---
#### Multi-Actor Workflows
For complex tasks, chain multiple Actors:
| Workflow | Step 1 | Step 2 |
|----------|--------|--------|
| **Lead enrichment** | compass/crawler-google-places → | vdrmota/contact-info-scraper |
| **Influencer vetting** | apify/instagram-profile-scraper → | apify/instagram-comment-scraper |
| **Competitor deep-dive** | apify/facebook-pages-scraper → | apify/facebook-posts-scraper |
| **Local business analysis** | compass/crawler-google-places → | compass/Google-Maps-Reviews-Scraper |
#### Can't Find a Suitable Actor?
If none of the Actors above match the user's request, search the Apify Store directly:
CODEBLOCK1
Replace SEARCHKEYWORDS with 1-3 simple terms (e.g., "LinkedIn profiles", "Amazon products", "Twitter").
### Step 2: Fetch Actor Schema
Fetch the Actor's input schema and details dynamically using mcpc:
CODEBLOCK2
Replace ACTORID with the selected Actor (e.g., compass/crawler-google-places).
This returns:
- Actor description and README
- Required and optional input parameters
- Output fields (if available)
### Step 3: Ask User Preferences
Before running, ask:
1. **Output format**:
- **Quick answer** - Display top few results in chat (no file saved)
- **CSV** - Full export with all fields
- **JSON** - Full export in JSON format
2. **Number of results**: Based on character of use case
### Step 4: Run the Script
**Quick answer (display in chat, no file):**
CODEBLOCK3
**CSV:**
CODEBLOCK4
**JSON:**
CODEBLOCK5
### Step 5: Summarize Results and Offer Follow-ups
After completion, report:
- Number of results found
- File location and name
- Key fields available
- **Suggested follow-up workflows** based on results:
| If User Got | Suggest Next |
|-------------|--------------|
| Business listings | Enrich with vdrmota/contact-info-scraper or get reviews |
| Influencer profiles | Analyze engagement with comment scrapers |
| Competitor pages | Deep-dive with post/ad scrapers |
| Trend data | Validate with platform-specific hashtag scrapers |
## Security & Data Privacy
This skill instructs the agent to select an Apify Actor, fetch its schema (via mcpc), and run scrapers. The included script communicates only with api.apify.com and writes outputs to files under the current working directory; it does not access unrelated system files or other environment variables.
Apify Actors only scrape publicly available data and do not collect private or personally identifiable information beyond what is openly accessible on the target platforms. For additional security assurance, you can check an Actor's permission level by querying https://api.apify.com/v2/acts/:actorId — an Actor with LIMITEDPERMISSIONS operates in a restricted sandbox, while FULLPERMISSIONS indicates broader system access. For full details, see [Apify's General Terms and Conditions](https://docs.apify.com/legal/general-terms-and-conditions).
## Error Handling
APIFYTOKEN not found - Ask user to configure APIFYTOKEN in OpenClaw settings
mcpc not found - Run npm install -g @apify/mcpc
Actor not found - Check Actor ID spelling
Run FAILED - Ask user to check Apify console link in error output
Timeout - Reduce input size or increase --timeout`
通用网页抓取工具
基于AI的数据提取,覆盖55+个跨主流平台的Actor。此技能会自动为您的任务选择最佳Actor。
前置条件
- - 在OpenClaw设置中配置APIFY_TOKEN
- Node.js 20.6+
- mcpc CLI(通过技能元数据自动安装)
输入净化规则
在将任何值替换到bash命令之前:
- - ACTORID:必须是技术名称(owner/actor-name — 字母数字、连字符、点号、一个斜杠)或原始ID(恰好17个字母数字字符,例如oeiQgfg5fsmIJB7Cn)。拒绝包含shell元字符( ; | & $ ( ) { } < > ! \n )的值。
- SEARCHKEYWORDS:仅限纯文本单词。拒绝shell元字符。
- JSONINPUT:必须是有效的JSON。不得包含单引号(使用转义双引号)。使用前验证结构。
- 输出文件名:必须匹配YYYY-MM-DDdescriptive-name.{csv,json}。无路径分隔符(/、..)、无空格、无元字符。
工作流程
复制此清单并跟踪进度:
任务进度:
- - [ ] 步骤1:理解用户目标并选择Actor
- [ ] 步骤2:通过mcpc获取Actor模式
- [ ] 步骤3:询问用户偏好(格式、文件名)
- [ ] 步骤4:运行抓取脚本
- [ ] 步骤5:总结结果并提供后续建议
步骤1:理解用户目标并选择Actor
首先,理解用户想要实现的目标。然后从以下选项中选择最佳Actor。
Instagram Actors(12个)
| Actor ID | 最佳用途 |
|---|
| apify/instagram-profile-scraper | 个人资料数据、粉丝数、简介信息 |
| apify/instagram-post-scraper |
单个帖子详情、互动指标 |
| apify/instagram-comment-scraper | 评论提取、情感分析 |
| apify/instagram-hashtag-scraper | 标签内容、热门话题 |
| apify/instagram-hashtag-stats | 标签表现指标 |
| apify/instagram-reel-scraper | Reels内容和指标 |
| apify/instagram-search-scraper | 搜索用户、地点、标签 |
| apify/instagram-tagged-scraper | 标记特定账号的帖子 |
| apify/instagram-followers-count-scraper | 粉丝数追踪 |
| apify/instagram-scraper | 全面的Instagram数据 |
| apify/instagram-api-scraper | 基于API的Instagram访问 |
| apify/export-instagram-comments-posts | 批量评论/帖子导出 |
Facebook Actors(14个)
| Actor ID | 最佳用途 |
|---|
| apify/facebook-pages-scraper | 页面数据、指标、联系信息 |
| apify/facebook-page-contact-information |
从页面提取邮箱、电话、地址 |
| apify/facebook-posts-scraper | 帖子内容和互动 |
| apify/facebook-comments-scraper | 评论提取 |
| apify/facebook-likes-scraper | 反应分析 |
| apify/facebook-reviews-scraper | 页面评论 |
| apify/facebook-groups-scraper | 群组内容和成员 |
| apify/facebook-events-scraper | 活动数据 |
| apify/facebook-ads-scraper | 广告创意和定向 |
| apify/facebook-search-scraper | 搜索结果 |
| apify/facebook-reels-scraper | Reels内容 |
| apify/facebook-photos-scraper | 照片提取 |
| apify/facebook-marketplace-scraper | 市场列表 |
| apify/facebook-followers-following-scraper | 粉丝/关注列表 |
TikTok Actors(14个)
| Actor ID | 最佳用途 |
|---|
| clockworks/tiktok-scraper | 全面的TikTok数据 |
| clockworks/free-tiktok-scraper |
免费TikTok提取 |
| clockworks/tiktok-profile-scraper | 个人资料数据 |
| clockworks/tiktok-video-scraper | 视频详情和指标 |
| clockworks/tiktok-comments-scraper | 评论提取 |
| clockworks/tiktok-followers-scraper | 粉丝列表 |
| clockworks/tiktok-user-search-scraper | 通过关键词查找用户 |
| clockworks/tiktok-hashtag-scraper | 标签内容 |
| clockworks/tiktok-sound-scraper | 热门音效 |
| clockworks/tiktok-ads-scraper | 广告内容 |
| clockworks/tiktok-discover-scraper | 发现页面内容 |
| clockworks/tiktok-explore-scraper | 探索内容 |
| clockworks/tiktok-trends-scraper | 热门内容 |
| clockworks/tiktok-live-scraper | 直播数据 |
YouTube Actors(5个)
| Actor ID | 最佳用途 |
|---|
| streamers/youtube-scraper | 视频数据和指标 |
| streamers/youtube-channel-scraper |
频道信息 |
| streamers/youtube-comments-scraper | 评论提取 |
| streamers/youtube-shorts-scraper | Shorts内容 |
| streamers/youtube-video-scraper-by-hashtag | 按标签搜索视频 |
Google Maps Actors(4个)
| Actor ID | 最佳用途 |
|---|
| compass/crawler-google-places | 商家列表、评分、联系信息 |
| compass/google-maps-extractor |
详细商家数据 |
| compass/Google-Maps-Reviews-Scraper | 评论提取 |
| poidata/google-maps-email-extractor | 从列表中提取邮箱 |
其他Actors(6个)
| Actor ID | 最佳用途 |
|---|
| apify/google-search-scraper | Google搜索结果 |
| apify/google-trends-scraper |
Google趋势数据 |
| voyager/booking-scraper | Booking.com酒店数据 |
| voyager/booking-reviews-scraper | Booking.com评论 |
| maxcopell/tripadvisor-reviews | TripAdvisor评论 |
| vdrmota/contact-info-scraper | 从URL提取联系信息 |
按用例选择Actor
| 用例 | 主要Actors |
|---|
| 潜在客户生成 | compass/crawler-google-places、poidata/google-maps-email-extractor、vdrmota/contact-info-scraper |
| 网红发现 |
apify/instagram-profile-scraper、clockworks/tiktok-profile-scraper、streamers/youtube-channel-scraper |
|
品牌监控 | apify/instagram-tagged-scraper、apify/instagram-hashtag-scraper、compass/Google-Maps-Reviews-Scraper |
|
竞争对手分析 | apify/facebook-pages-scraper、apify/facebook-ads-scraper、apify/instagram-profile-scraper |
|
内容分析 | apify/instagram-post-scraper、clockworks/tiktok-scraper、streamers/youtube-scraper |
|
趋势研究 | apify/google-trends-scraper、clockworks/tiktok-trends-scraper、apify/instagram-hashtag-stats |
|
评论分析 | compass/Google-Maps-Reviews-Scraper、voyager/booking-reviews-scraper、maxcopell/tripadvisor-reviews |
|
受众分析 | apify/instagram-followers-count-scraper、clockworks/tiktok-followers-scraper、apify/facebook-followers-following-scraper |
多Actor工作流
对于复杂任务,可串联多个Actor:
| 工作流 | 步骤1 | 步骤2 |
|---|
| 潜在客户丰富 | compass/crawler-google-places → | vdrmota/contact-info-scraper |
| 网红审核 |
apify/instagram-profile-scraper → | apify/instagram-comment-scraper |
|
竞争对手