Meta Ad Spy — Competitor Ad Intelligence Skill

A two-phase skill for extracting and analyzing competitor ads from Meta platforms.

Architecture Overview

CODEBLOCK0

PHASE 1: Playwright Scraper

When to use: Always as the first step, or when user has no API token.
What it gets: Ad creatives (image/video URLs), ad copy, CTA text, page name, start date, active status, platforms (Facebook/Instagram), ad format (carousel, video, static).
What it can't get: Spend ranges, impressions, demographic breakdown (those need Phase 2).

Setup

CODEBLOCK1

Core Playwright Script

Write this to /tmp/meta_ad_scraper.py:

CODEBLOCK2

How to Run Phase 1

CODEBLOCK3

Or from within Python (for page ID lookups):
CODEBLOCK4

Filters Available in Phase 1

Filter	Values	Notes
INLINECODE1	INLINECODE2, `inactive`, INLINECODE4	INLINECODE5 = currently running
INLINECODE6

PHASE 2: Meta Graph API

When to use: After Phase 1, or when user wants spend/impression/demographic data.
Requirements: Meta developer account + access token (see setup below).
What it gets: Spend ranges, impression ranges, demographic distribution (EU/political), delivery by region, ad creative details, estimated audience size.

Setup Instructions (tell the user)

1. Go to Meta for Developers → Create App
Go to facebook.com/ID → Confirm identity (required for spend data)
Generate a User Access Token with ads_read permission from Graph API Explorer
Set as env var: INLINECODE29

Core API Script

Write this to /tmp/meta_ad_api.py:

CODEBLOCK5

API Filter Reference

Parameter	Values	Notes
INLINECODE31	Any string	Keyword search in ad content
INLINECODE32

Data Fields Available from API

Always available (all ads):

- ad_archive_id — Unique ad ID
INLINECODE60, page_name — Advertiser page
INLINECODE62 — Ad copy text(s)
INLINECODE63, ad_creative_link_descriptions — Headlines
INLINECODE65, ad_delivery_stop_time — Run dates
INLINECODE67 — FB/Instagram/Messenger/Audience Network
INLINECODE68 — Link to view the actual ad

EU/UK/Political ads only:

- spend — {lower_bound, upper_bound, currency} — Spend RANGE, not exact
INLINECODE71 — {lower_bound, upper_bound} — Impression RANGE
INLINECODE73 — INLINECODE74
INLINECODE75 — [{age, gender, percentage}] array
INLINECODE77 — Geographic breakdown
INLINECODE78 — "Paid for by" disclaimer

⚠️ Important: Spend and impressions are RANGES, not exact numbers. For non-EU/non-political ads in most countries including US and India, spend/impression data will NOT be returned. The official API is primarily a transparency tool. For richer commercial ad data, see the third-party alternatives in references/alternatives.md.

ANALYSIS WORKFLOW

When a user wants competitor ad intelligence, follow this flow:

Step 1 — Clarify the Target

Ask (or infer from context):

- Who — brand name OR Facebook Page ID (better)
Where — country/region (US, IN, ALL, etc.)
What — active only, or historical too?
Goal — creative inspiration, spend monitoring, format analysis, copy patterns?

Step 2 — Find the Page ID (if only brand name given)

CODEBLOCK6

Step 3 — Run Phase 1 (Playwright)

Always run Phase 1 first. Write and execute /tmp/meta_ad_scraper.py.

Step 4 — Run Phase 2 (API), if token available

Check for META_ACCESS_TOKEN env var. If present, run Phase 2.
If missing, tell user what Phase 2 would add, and give setup instructions.

Step 5 — Synthesize & Report

Produce a structured competitive intelligence report covering:

CODEBLOCK7

COMMON WORKFLOWS

"What ads is [Brand] running right now?"

CODEBLOCK8

"Show me competitor video ads in India"

CODEBLOCK9

"How much is [Brand] spending on ads?" (EU/political only)

CODEBLOCK10

"Show me ads that have been running the longest" (= likely winners)

CODEBLOCK11

"Find ads about [topic/product keyword]"

CODEBLOCK12

LIMITATIONS & WORKAROUNDS

Limitation	Workaround
Spend data only for EU/political	Target EU countries in API query
No CTR/conversion data

NOTES ON LEGALITY & ETHICS

- The Meta Ad Library is public data — no login required for commercial ads
Using it for competitive research is explicitly Meta's stated purpose for the tool
The official API is a transparency tool — use it as intended
Playwright scraping of public pages is generally legal (ref: hiQ v. LinkedIn, 2022)
Do NOT attempt to scrape user data, private profiles, or non-public content
Always respect rate limits and avoid aggressive scraping

Meta Ad Spy — 竞争对手广告情报技能

一个用于从Meta平台提取和分析竞争对手广告的两阶段技能。

架构概览

阶段 1: Playwright 爬虫（无需 API 密钥）
└── facebook.com/ads/library → 广告创意、文案、状态、平台、日期

阶段 2: Meta Graph API（需要访问令牌）
└── graph.facebook.com/v23.0/ads_archive → 支出范围、展示次数、人口统计数据

分析层：Claude 综合两个来源的洞察

阶段 1: Playwright 爬虫

使用时机：始终作为第一步，或当用户没有 API 令牌时。
获取内容：广告创意（图片/视频 URL）、广告文案、CTA 文本、页面名称、开始日期、活跃状态、平台（Facebook/Instagram）、广告格式（轮播、视频、静态）。
无法获取：支出范围、展示次数、人口统计细分（这些需要阶段 2）。

环境设置

bash
pip install playwright --break-system-packages
playwright install chromium
pip install asyncio --break-system-packages

核心 Playwright 脚本

将此写入 /tmp/metaadscraper.py：

python
import asyncio
import json
import re
import sys
from playwright.asyncapi import asyncplaywright

抓取 Meta 广告库以获取竞争对手广告。
必须提供 searchquery 或 pageid 之一。

results = []

# 构建 URL
base = https://www.facebook.com/ads/library/?
params = {
activestatus: activestatus,
adtype: adtype,
country: country,
mediatype: mediatype,
}
if search_query:
params[q] = search_query
params[searchtype] = keywordunordered
elif page_id:
params[viewallpageid] = pageid
params[search_type] = page

url = base + &.join(f{k}={v} for k, v in params.items())

async with async_playwright() as p:
browser = await p.chromium.launch(
headless=True,
args=[
--no-sandbox,
--disable-blink-features=AutomationControlled,
--disable-dev-shm-usage,
]
)
context = await browser.new_context(
viewport={width: 1440, height: 900},
useragent=Mozilla/5.0 (Macintosh; Intel Mac OS X 1015_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36,
locale=en-US,
)
# 隐身：隐藏 webdriver
await context.addinitscript(
Object.defineProperty(navigator, webdriver, { get: () => undefined });
)

page = await context.new_page()
print(f[阶段 1] 导航至: {url})
await page.goto(url, wait_until=networkidle, timeout=30000)
await page.waitfortimeout(3000)

# 滚动以加载更多广告
ads_loaded = 0
scroll_attempts = 0
while adsloaded < maxads and scroll_attempts < 20:
await page.evaluate(window.scrollTo(0, document.body.scrollHeight))
await page.waitfortimeout(2000)
# 统计广告卡片数量
adcards = await page.queryselectorall([data-testid=ad-card], .7jvw, [class*=x8t9es0])
adsloaded = len(adcards)
scroll_attempts += 1
if scroll_attempts % 5 == 0:
print(f[阶段 1] 已加载 {ads_loaded} 个广告...)

# 通过 JavaScript 提取广告数据
ads_data = await page.evaluate(
() => {
const ads = [];
// Meta 广告库在 div 中渲染广告；提取所有可见文本/图片数据
// 查找包含库 ID 的广告存档链接
const links = document.querySelectorAll(a[href*=ads/archive]);
const seen_ids = new Set();
links.forEach(link => {
const href = link.href;
const id_match = href.match(/id=(\d+)/);
if (idmatch && !seenids.has(id_match[1])) {
seenids.add(idmatch[1]);
// 向上查找广告容器
let container = link;
for (let i = 0; i < 8; i++) {
container = container.parentElement;
if (!container) break;
}
const getText = (el, fallback=) => el ? el.innerText.trim() : fallback;
const getAttr = (el, attr, fallback=) => el ? el.getAttribute(attr) || fallback : fallback;

ads.push({
adarchiveid: id_match[1],
adsnapshoturl: href,
page_name: getText(container?.querySelector([class*=page-name], strong)),
ad_body: getText(container?.querySelector([data-ad-preview=message], [class*=body])),
ad_title: getText(container?.querySelector([class*=title])),
cta_text: getText(container?.querySelector([class*=cta], button)),
image_url: getAttr(container?.querySelector(img[src*=fbcdn]), src),
started_running: getText(container?.querySelector([class*=started-running])),
platforms: Array.from(container?.querySelectorAll([class*=platform]) || []).map(el => el.innerText.trim()).filter(Boolean),
raw_text: container?.innerText?.substring(0, 500) || ,
});
}
});
return ads;
}
)

# 同时捕获网络请求以获取更丰富的数据
print(f[阶段 1] 从 DOM 中提取了 {len(ads_data)} 个广告)
results = adsdata[:maxads]

await browser.close()

return results

async def main():
query = sys.argv[1] if len(sys.argv) > 1 else Nike shoes
ads = await scrapeadlibrary(searchquery=query, maxads=20)
print(json.dumps(ads, indent=2, ensure_ascii=False))

if name == main:
asyncio.run(main())

如何运行阶段 1

bash
python /tmp/metaadscraper.py 竞争对手品牌名称

或者从 Python 内部运行（用于页面 ID 查找）：
python
ads = await scrapeadlibrary(pageid=434174436675167, activestatus=active)

阶段 1 可用的筛选条件

筛选条件	值	说明
activestatus	active, inactive, all	active = 当前正在运行
adtype

阶段 2: Meta Graph API

使用时机：阶段 1 之后，或当用户需要支出/展示次数/人口统计数据时。
要求：Meta 开发者账户 + 访问令牌（参见下方设置）。
获取内容：支出范围、展示次数范围、人口统计分布（欧盟/政治类）、按地区投放、广告创意详情、预估受众规模。

设置说明（告知用户）

1. 前往 Meta for Developers → 创建应用
前往 facebook.com/ID → 确认身份

meta-ad-spy元广告侦察

meta-ad-spy

Meta Ad Spy — Competitor Ad Intelligence Skill

Architecture Overview

PHASE 1: Playwright Scraper

Setup

Core Playwright Script

How to Run Phase 1

Filters Available in Phase 1

PHASE 2: Meta Graph API

Setup Instructions (tell the user)

Core API Script

API Filter Reference

Data Fields Available from API

ANALYSIS WORKFLOW

Step 1 — Clarify the Target

Step 2 — Find the Page ID (if only brand name given)

Step 3 — Run Phase 1 (Playwright)

Step 4 — Run Phase 2 (API), if token available

Step 5 — Synthesize & Report

COMMON WORKFLOWS

"What ads is [Brand] running right now?"

"Show me competitor video ads in India"

"How much is [Brand] spending on ads?" (EU/political only)

"Show me ads that have been running the longest" (= likely winners)

"Find ads about [topic/product keyword]"

LIMITATIONS & WORKAROUNDS

NOTES ON LEGALITY & ETHICS

SEE ALSO

Meta Ad Spy — 竞争对手广告情报技能

架构概览

阶段 1: Playwright 爬虫

环境设置

核心 Playwright 脚本

如何运行阶段 1

阶段 1 可用的筛选条件

阶段 2: Meta Graph API

设置说明（告知用户）

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement