Home/多媒体与音视频/ai-podcast-creation
A

ai-podcast-creation

by @inferen-shv
4.7(130)

Automatically generates podcast content using AI technology, including scriptwriting, voice synthesis, and post-production, significantly simplifying the podcast creation process and lowering production barriers.

AI PodcastGenerative AudioContent CreationAudio ProductionAI VoiceoversGitHub
Installation
npx skills add inferen-sh/skills --skill ai-podcast-creation
compare_arrows

Before / After Comparison

1
Before

Traditional podcast production involves multiple steps such as scriptwriting, recording, editing, and post-production, which are time-consuming and labor-intensive, requiring professional equipment and skills, leading to high costs.

After

Utilizing AI tools for podcast creation can automate script generation, synthesize natural speech, add background music and sound effects, and even perform preliminary editing. This significantly simplifies the production process, lowers technical barriers and costs, enabling content creators to update podcasts more frequently.

description SKILL.md

ai-podcast-creation

AI Podcast Creation

Create AI-powered podcasts and audio content via inference.sh CLI.

Quick Start

Requires inference.sh CLI (infsh). Install instructions

infsh login

# Generate podcast segment
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.",
  "voice": "am_michael"
}'

Available Voices

Kokoro TTS

Voice ID Description Best For

af_sarah American female, warm Host, narrator

af_nicole American female, professional News, business

am_michael American male, authoritative Documentary, tech

am_adam American male, conversational Casual podcast

bf_emma British female, refined Audiobooks

bm_george British male, classic Formal content

DIA TTS (Conversational)

Voice ID Description Best For

dia-conversational Natural conversation Dialogue, interviews

Chatterbox

Voice ID Description Best For

chatterbox-default Expressive Casual, entertainment

Podcast Workflows

Simple Narration

# Single voice podcast segment
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.",
  "voice": "am_michael"
}'

Multi-Voice Conversation

# Host introduction
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.",
  "voice": "am_michael"
}' > host_intro.json

# Guest response
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Thanks for having me. I am excited to share what we have been working on.",
  "voice": "af_sarah"
}' > guest_response.json

# Merge into conversation
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-url>", "<guest-url>"],
  "crossfade_ms": 500
}'

Full Episode Pipeline

# 1. Generate script with Claude
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights."
}' > script.json

# 2. Generate intro music
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds"
}' > intro_music.json

# 3. Generate host segments
infsh app run infsh/kokoro-tts --input '{
  "prompt": "<host-lines>",
  "voice": "am_michael"
}' > host.json

# 4. Generate guest segments
infsh app run infsh/kokoro-tts --input '{
  "prompt": "<guest-lines>",
  "voice": "af_sarah"
}' > guest.json

# 5. Generate outro music
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast outro music, matching intro style, fade out, 10 seconds"
}' > outro_music.json

# 6. Merge everything
infsh app run infsh/media-merger --input '{
  "audio_files": [
    "<intro-music>",
    "<host>",
    "<guest>",
    "<outro-music>"
  ],
  "crossfade_ms": 1000
}'

NotebookLM-Style Content

Generate podcast-style discussions from documents.

# 1. Extract key points
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>"
}' > discussion_script.json

# 2. Generate Host A
infsh app run infsh/kokoro-tts --input '{
  "prompt": "<host-a-lines>",
  "voice": "am_michael"
}' > host_a.json

# 3. Generate Host B
infsh app run infsh/kokoro-tts --input '{
  "prompt": "<host-b-lines>",
  "voice": "af_sarah"
}' > host_b.json

# 4. Interleave and merge
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"],
  "crossfade_ms": 300
}'

Audiobook Chapter

# Long-form narration
infsh app run infsh/kokoro-tts --input '{
  "prompt": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...",
  "voice": "bf_emma",
  "speed": 0.9
}'

Audio Enhancement

Add Background Music

# 1. Generate podcast audio
infsh app run infsh/kokoro-tts --input '{
  "prompt": "<podcast-script>",
  "voice": "am_michael"
}' > podcast.json

# 2. Generate ambient music
infsh app run infsh/ai-music --input '{
  "prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable"
}' > background.json

# 3. Mix with lower background volume
infsh app run infsh/media-merger --input '{
  "audio_files": ["<podcast-url>"],
  "background_audio": "<background-url>",
  "background_volume": 0.15
}'

Add Sound Effects

# Transition sounds between segments
infsh app run infsh/ai-music --input '{
  "prompt": "Short podcast transition sound, whoosh, 2 seconds"
}' > transition.json

Script Writing Tips

Prompt for Claude

infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a podcast script with these requirements:
  - Topic: [YOUR TOPIC]
  - Duration: 5 minutes (about 750 words)
  - Format: Two hosts (HOST_A and HOST_B)
  - Tone: Conversational, informative, engaging
  - Include: Hook intro, 3 main points, call to action
  - Mark speaker changes clearly

  Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally."
}'

Podcast Templates

Interview Format

HOST: Introduction and welcome
GUEST: Thank you, happy to be here
HOST: First question about background
GUEST: Response with story
HOST: Follow-up question
GUEST: Deeper insight
... continue pattern ...
HOST: Closing question
GUEST: Final thoughts
HOST: Thank you and outro

Solo Episode

Introduction with hook
Topic overview
Point 1 with examples
Point 2 with examples
Point 3 with examples
Summary and takeaways
Call to action
Outro

News Roundup

Intro music
Welcome and date
Story 1: headline + details
Story 2: headline + details
Story 3: headline + details
Analysis/opinion segment
Outro

Best Practices

  • Natural punctuation - Use commas and periods for pacing

  • Short sentences - Easier to speak and listen

  • Varied voices - Different speakers prevent monotony

  • Background music - Subtle, at 10-15% volume

  • Crossfades - Smooth transitions between segments

  • Edit scripts - Remove filler before generating

Related Skills

# Text-to-speech models
npx skills add inference-sh/skills@text-to-speech

# AI music generation
npx skills add inference-sh/skills@ai-music-generation

# LLM for scripts
npx skills add inference-sh/skills@llm-models

# Content pipelines
npx skills add inference-sh/skills@ai-content-pipeline

# Full platform skill
npx skills add inference-sh/skills@infsh-cli

Browse all apps: infsh app list --category audio Weekly Installs4.4KRepositoryinferen-sh/skillsGitHub Stars159First Seen6 days agoSecurity AuditsGen Agent Trust HubPassSocketPassSnykPassInstalled onclaude-code3.5Kgemini-cli3.2Kcodex3.1Kamp3.1Kkimi-cli3.1Kgithub-copilot3.1K

forumUser Reviews (0)

Write a Review

Effect
Usability
Docs
Compatibility

No reviews yet

Statistics

Installs6.5K
Rating4.7 / 5.0
Version
Updated2026年3月17日
Comparisons1

User Rating

4.7(130)
5
0%
4
0%
3
0%
2
0%
1
0%

Rate this Skill

0.0

Compatible Platforms

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

Timeline

Created2026年3月17日
Last Updated2026年3月17日