Podcast Transcript Mining & Authority Positioning
Overview
This advanced content intelligence skill transforms raw podcast transcripts into strategic assets for thought leadership positioning. Whether you're a solopreneur, consultant, or agency, this skill automatically:
- - Extracts guest appearances from uploaded transcripts or RSS feeds
- Identifies speaking topics and expertise areas mentioned in conversations
- Harvests quote-worthy soundbites for social media and content repurposing
- Maps target audiences by analyzing which podcasts your ideal customers listen to
- Generates guest appearance portfolios showcasing your media presence
- Creates personalized podcast pitch templates for outreach campaigns
- Tracks competitor appearances to identify untapped speaking opportunities
The skill integrates with Slack for notifications, Google Search for podcast discovery, WordPress for blog syndication of guest posts, and Zapier for workflow automation. It uses OpenAI's GPT-4 for semantic analysis and natural language extraction, ensuring intelligent identification of context-relevant content rather than simple keyword matching.
Why this matters: Manual transcript review takes 2-4 hours per episode. This skill processes an entire podcast season in minutes, surfacing your best moments and identifying 10-15 qualified podcast outreach targets automatically.
Quick Start
Example 1: Extract Soundbites from a Single Transcript
CODEBLOCK0
Example 2: Build a Guest Appearance Portfolio
CODEBLOCK1
Example 3: Generate Podcast Pitch Templates
CODEBLOCK2
Example 4: Discover Untapped Speaking Opportunities
CODEBLOCK3
Example 5: Repurpose Content Across Channels
Extract all key insights from this podcast transcript and generate:
- 10 LinkedIn carousel slide ideas
- 1 blog post outline (1,500 words)
- 5 Twitter threads (5 tweets each)
- 3 TikTok/Shorts scripts (60 seconds)
- 1 email newsletter edition
Capabilities
1. Transcript Ingestion & Processing
- - Supported formats: MP3/WAV audio files (auto-transcribed via Whisper API), PDF transcripts, plain text, Google Docs links, Notion databases
- Batch processing: Upload up to 50 transcripts simultaneously
- RSS feed integration: Automatically pull and process new episodes from podcast RSS feeds (Apple Podcasts, Spotify, Anchor, Buzzsprout)
- Speaker identification: Automatically detect and label host vs. guest speech patterns
- Timestamp preservation: Maintain accurate timestamps for easy audio reference
2. Soundbite Extraction Engine
- - Semantic analysis: Uses GPT-4 to identify contextually relevant, quote-worthy moments (not just keyword matches)
- Customizable extraction criteria: Extract by topic, sentiment, length, or expertise area
- Multi-format output: Social posts, LinkedIn content, email subject lines, blog pull-quotes
- Source attribution: Automatically includes episode title, host name, publication date, and audio timestamp
- Sentiment scoring: Flag emotional moments (laughs, surprising revelations, passionate declarations) for maximum engagement
3. Authority Portfolio Builder
- - Automated portfolio generation: Creates professional guest appearance pages with:
- Episode metadata (title, host, publish date, listener count)
- Topic summaries (auto-generated or custom)
- Key quotes and moments
- Episode links and call-to-action buttons
- Professional headshots and bios
- - Export formats: HTML, Markdown, WordPress shortcodes, JSON
- Integration: Push directly to WordPress sites via REST API
- Analytics-ready: Include UTM parameters for tracking guest appearance traffic
4. Podcast Pitch Generation
- - Personalized templates: AI-generated pitch emails that reference specific host episodes or recent news
- Competitor analysis: Identify gaps where similar guests haven't appeared
- Subject line A/B variants: Generate 3 subject line options with predicted open rates
- Follow-up sequences: Auto-generate 3-email follow-up templates (7-day, 14-day, 30-day)
- Bulk outreach prep: Export all pitches to CSV for email automation tools (Lemlist, Outreach, Mailchimp)
5. Audience Intelligence
- - Listener demographic mapping: Analyze podcast descriptions and guest bios to infer audience composition
- Niche alignment scoring: Rate podcasts 1-10 for fit with your target audience
- Growth tracking: Monitor listener count trends over time
- Competitor appearance tracking: See which podcasts have hosted your competitors
- Seasonal patterns: Identify which topics are trending in your industry
6. Content Repurposing Automation
- - Multi-channel output: Generate content tailored for:
- LinkedIn (carousel slides, articles, native videos)
- Twitter/X (threads, quote tweets, engagement hooks)
- TikTok/Reels/Shorts (script outlines with timing)
- Email newsletters (curated insights, call-to-action variants)
- Blog posts (long-form articles with SEO optimization)
- Slack updates (notifications for team members)
- - Brand voice consistency: Maintain tone and style across all outputs
- SEO optimization: Auto-generate meta descriptions, H1/H2 tags, keyword suggestions
Configuration
Required Environment Variables
CODEBLOCK5
Optional Configuration
CODEBLOCK6
Setup Instructions
Step 1: Connect Your Data Sources
CODEBLOCK7
Step 2: Configure Slack Notifications
CODEBLOCK8
Step 3: Set Up WordPress Integration (Optional)
CODEBLOCK9
Step 4: Customize Extraction Preferences
# Edit extraction rules for your niche
openclaw podcast-mining configure \
--min-quote-length 20 \
--sentiment-threshold 0.8 \
--output-formats linkedin_carousel,blog_post
Example Outputs
Output 1: Extracted Soundbites (JSON)
CODEBLOCK11
Output 2: Guest Appearance Portfolio (HTML)
CODEBLOCK12
Output 3: Podcast Pitch Template
CODEBLOCK13
Output 4: Content Repurposing Bundle (LinkedIn Carousel)
# LinkedIn Carousel: "5 Lessons from 8 Podcast Guest Appearances"
Slide 1: "I've appeared on 8 podcasts in 6 months. Here are the 5 biggest
lessons that surprised me about audience building, positioning, and growth."
Slide 2: "Lesson 1: Your best content isn't what you think it is. The moments
that got the most engagement were unscripted, vulnerable, and messy—not polished
talking points."
Slide 3: "Lesson 2: Podcast listeners are HUNGRY for specific, tactical advice.
Generic frameworks don't work. They want the exact playbook you used."
Slide 4: "Lesson 3: Host chemistry matters more than topic. The best episodes
were with hosts who challenged me, asked follow-ups, and weren't afraid to
disagree."
Slide 5: "Lesson 4: Repurposing is non-negotiable. One 45-min episode generated
12 pieces of content, 50K+ impressions, and 3 qualified leads."
Slide 6: "Lesson 5: Guest appearances compound. Each appearance makes the next
one easier. Hosts see your previous appearances and trust you more."
Tips & Best Practices
1. Optimize for Soundbite Quality
- - Focus on specificity: Extract quotes that include numbers, frameworks, or surprising insights
- Avoid generic wisdom: "Success takes hard work" won't perform. "We grew 300% by cutting our feature set in half" will
- Timestamp everything: Include the exact moment in the podcast so listeners can jump to that section
- Test on your audience: Share extracted soundbites with your email list to see which resonate most
2. Build a Strategic Podcast List
- - Start with 50 target podcasts: Use this skill to identify 50 podcasts with 10K-500K monthly listeners in your niche
- Tier by difficulty: Separate "easy wins" (small, growing podcasts) from "stretch goals" (top-tier shows)
- Track competitor appearances: See which shows have hosted your competitors and prioritize those
- Monitor growth trends: Focus on podcasts with 20%+ monthly growth (you'll reach a bigger audience in 6 months)
3. Personalize Every Pitch
- - Reference a specific episode: "I loved your episode on [Topic] with [Guest]" performs 5x better than generic pitches
- Mention a specific quote: "You mentioned that [quote]—I have a framework that builds on that idea"
- Show you're a real listener: Mention a recent episode, not one from 2 years ago
- Lead with their audience, not your credentials: "Your listeners are asking about X, and I have specific answers"
4. Repurpose Strategically
- - Create a content calendar: Map one podcast appearance to 12+ pieces of content across channels
- Batch record: Record your TikTok/Reel scripts all at once after the podcast episode airs
- Sequence your promotion: Release blog post first (SEO), then LinkedIn carousel, then Twitter thread, then email
- Tag the host: Always mention the host and podcast in your repurposed content (they'll often amplify it)
5. Measure What Matters
- - Track traffic from guest appearances: Use UTM parameters to measure podcast-sourced visitors
- Monitor lead quality: Are podcast listeners higher-quality leads than other sources?
- Measure brand lift: Track Google search volume for your name after major appearances
- Calculate ROI: One guest appearance = 40 hours of sales conversations? That's worth $5K-10K in value
6. Scale Your Outreach
- - Use automation tools: Export pitch templates to Lemlist or Outreach for automated follow-ups
- Batch your pitching:
播客文稿挖掘与权威定位
概述
这项高级内容智能技能将原始播客文稿转化为思想领导力定位的战略资产。无论您是独立创业者、顾问还是代理机构,该技能都能自动:
- - 提取嘉宾出场信息:从上传的文稿或RSS订阅源中提取
- 识别演讲主题:识别对话中提到的专业领域
- 采集值得引用的金句:用于社交媒体和内容再利用
- 绘制目标受众画像:通过分析理想客户收听的播客来定位
- 生成嘉宾出场作品集:展示您的媒体曝光情况
- 创建个性化播客推销模板:用于外展活动
- 追踪竞争对手出场情况:识别未开发的演讲机会
该技能可与 Slack(通知)、Google搜索(播客发现)、WordPress(嘉宾文章博客分发)和 Zapier(工作流自动化)集成。它使用OpenAI的GPT-4进行语义分析和自然语言提取,确保智能识别上下文相关内容,而非简单的关键词匹配。
为何重要: 手动审阅文稿每集需要2-4小时。该技能可在几分钟内处理整个播客季,自动呈现您的精彩时刻并识别10-15个合格的播客外展目标。
快速入门
示例1:从单篇文稿中提取金句
上传我的播客文稿(PDF或文本文件),提取关于数字营销策略的5个最值得引用的时刻。
将其格式化为社交媒体帖子(280字符)和LinkedIn帖子(1300字符)。
示例2:构建嘉宾出场作品集
过去6个月我出现在8个播客中。以下是文稿链接:[列表]。
创建一个Markdown格式的专业嘉宾出场作品集页面,可添加到我的网站。
包括剧集标题、主持人、讨论的关键主题以及每次出场的2句话简介。
示例3:生成播客推销模板
分析这3份竞争对手文稿,识别B2B SaaS和创业创始人类别中适合我作为嘉宾的15个播客。
为每个播客生成一封150字的个性化推销邮件,引用主持人在近期剧集中说过的话。
示例4:发现未开发的演讲机会
搜索我领域(高管教练、科技创始人)中月听众5万-50万的播客。
识别哪些播客从未邀请过具有我背景(10年以上科技行业经验、5次退出)的嘉宾。
创建优先列表,包含联系信息和建议的剧集角度。
示例5:跨渠道内容再利用
从这份播客文稿中提取所有关键见解,并生成:
- - 10个LinkedIn轮播幻灯片创意
- 1篇博客文章大纲(1500字)
- 5条Twitter话题(每条5条推文)
- 3个TikTok/Shorts脚本(60秒)
- 1期电子邮件通讯
功能
1. 文稿导入与处理
- - 支持格式: MP3/WAV音频文件(通过Whisper API自动转写)、PDF文稿、纯文本、Google Docs链接、Notion数据库
- 批量处理: 同时上传最多50份文稿
- RSS订阅源集成: 自动拉取并处理播客RSS订阅源(Apple Podcasts、Spotify、Anchor、Buzzsprout)的新剧集
- 说话人识别: 自动检测并标记主持人与嘉宾的说话模式
- 时间戳保留: 保持准确时间戳,便于音频参考
2. 金句提取引擎
- - 语义分析: 使用GPT-4识别上下文相关、值得引用的时刻(不仅仅是关键词匹配)
- 可定制提取标准: 按主题、情感、长度或专业领域提取
- 多格式输出: 社交媒体帖子、LinkedIn内容、邮件主题行、博客引用
- 来源归属: 自动包含剧集标题、主持人姓名、发布日期和音频时间戳
- 情感评分: 标记情感时刻(笑声、惊人发现、激情宣言)以获得最大参与度
3. 权威作品集构建器
- - 自动作品集生成: 创建专业嘉宾出场页面,包含:
- 剧集元数据(标题、主持人、发布日期、听众数)
- 主题摘要(自动生成或自定义)
- 关键引用和时刻
- 剧集链接和行动号召按钮
- 专业头像和个人简介
- - 导出格式: HTML、Markdown、WordPress短代码、JSON
- 集成: 通过REST API直接推送到WordPress网站
- 分析就绪: 包含UTM参数,用于追踪嘉宾出场流量
4. 播客推销生成
- - 个性化模板: AI生成的推销邮件,引用特定主持人剧集或近期新闻
- 竞争对手分析: 识别类似嘉宾尚未出现的空白领域
- 主题行A/B变体: 生成3个主题行选项及预测打开率
- 跟进序列: 自动生成3封邮件跟进模板(7天、14天、30天)
- 批量外展准备: 将所有推销导出为CSV,用于邮件自动化工具(Lemlist、Outreach、Mailchimp)
5. 受众情报
- - 听众人口统计映射: 分析播客描述和嘉宾简介,推断受众构成
- 领域匹配评分: 对播客与目标受众的匹配度进行1-10分评分
- 增长追踪: 监控听众数量随时间的变化趋势
- 竞争对手出场追踪: 查看哪些播客邀请过您的竞争对手
- 季节性模式: 识别您行业中的热门话题
6. 内容再利用自动化
- LinkedIn(轮播幻灯片、文章、原生视频)
- Twitter/X(话题、引用推文、互动钩子)
- TikTok/Reels/Shorts(带时间标注的脚本大纲)
- 电子邮件通讯(精选见解、行动号召变体)
- 博客文章(带SEO优化的长文)
- Slack更新(团队成员通知)
- - 品牌声音一致性: 在所有输出中保持语气和风格
- SEO优化: 自动生成元描述、H1/H2标签、关键词建议
配置
必需的环境变量
bash
OPENAI
APIKEY=sk-... # GPT-4语义分析访问
GOOGLE
SEARCHAPI_KEY=AIza... # 播客发现和研究
SLACK
WEBHOOKURL=https://hooks.slack... # 通知和团队更新
WHISPER
APIKEY=sk-... # 音频转写(可选,用于MP3)
WORDPRESS
APIKEY=... # WordPress集成(可选)
WORDPRESS
SITEURL=https://yoursite.com # 您的WordPress域名
可选配置
yaml
config.yml
extraction:
min
quotelength: 15 # 金句最少字数
max
quotelength: 280 # 社交媒体帖子最大字数
sentiment_threshold: 0.7 # 情感时刻0-1评分
podcast_discovery:
listener_min: 10000 # 最低月听众数
listener_max: 1000000 # 最高月听众数
language: en
output_formats:
- linkedin_carousel
- twitter_thread
- blog_post
- email_newsletter
- slack_message
repurposing:
blogwordcount: 1500
emailsubjectvariants: 3
twitterthreadlength: 5
tiktok_duration: 60
设置说明
步骤1:连接数据源
bash
链接您的播客RSS订阅源
openclaw podcast-mining add-feed https://feeds.example.com/podcast.xml
或直接上传文稿
openclaw podcast-mining upload ./transcripts/
步骤2:配置Slack通知
bash
从Slack应用设置获取您的Slack webhook URL
粘贴到SLACKWEBHOOKURL环境变量中
测试连接:
openclaw podcast-mining test-slack
步骤3:设置WordPress集成(可选)
bash
在设置 > REST API中生成WordPress API令牌
openclaw podcast-mining configure-wordpress \
--site-url https://yoursite.com \
--api-key YOUR_KEY
步骤4:自定义提取偏好
bash
为您的领域编辑提取规则
openclaw podcast-mining configure \
--min-quote-length 20 \
--sentiment-threshold 0.8 \
--output-formats linkedin
carousel,blogpost
示例输出
输出1:提取的金句(JSON)
json
{
episode: {
title: 从零到1000万美元ARR:扩展您的SaaS,
host: Sarah Chen,
podcast: