elevenlabs-tts
ElevenLabsを通じて22種類以上の高品質な音声によるテキスト読み上げ(TTS)サービスを提供し、自然で流暢な音声コンテンツを生成します。
npx skills add inferen-sh/skills --skill elevenlabs-ttsBefore / After 効果比較
1 组従来のTTS音声は不自然で硬く、感情表現に乏しく、聞き心地が悪かった。
高度なTTS技術により、自然で滑らかな音声、豊かな感情表現、優れた聴覚体験を提供します。
elevenlabs-tts
ElevenLabs Text-to-Speech
Premium text-to-speech with 22+ voices via inference.sh CLI.
Quick Start
Requires inference.sh CLI (infsh). Install instructions
infsh login
# Generate speech with ElevenLabs
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'
Available Models
Model ID Best For Latency
Multilingual v2
eleven_multilingual_v2
Highest quality, 32 languages
~250ms
Turbo v2.5
eleven_turbo_v2_5
Balance of speed & quality
~150ms
Flash v2.5
eleven_flash_v2_5
Ultra-low latency
~75ms
Voice Library
Female Voices
Voice Style
aria
American, conversational
alice
British, confident
bella
American, warm
jessica
American, expressive
laura
American, professional
lily
British, soft
sarah
American, friendly
Male Voices
Voice Style
george
British, authoritative
adam
American, deep
bill
American, mature
brian
American, conversational
callum
Transatlantic, intense
charlie
Australian, natural
chris
American, casual
daniel
British, commanding
eric
American, friendly
harry
American, young
liam
American, articulate
matilda
American, warm
river
American, confident
roger
American, authoritative
will
American, bright
Examples
Basic Speech
infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'
Choose a Model
# Highest quality
infsh app run elevenlabs/tts --input '{
"text": "This is our premium multilingual model with the best quality.",
"voice": "aria",
"model": "eleven_multilingual_v2"
}'
# Ultra-fast for real-time applications
infsh app run elevenlabs/tts --input '{
"text": "Flash model for low-latency applications.",
"voice": "brian",
"model": "eleven_flash_v2_5"
}'
Voice Tuning
infsh app run elevenlabs/tts --input '{
"text": "Fine-tune the voice characteristics for your use case.",
"voice": "bella",
"stability": 0.3,
"similarity_boost": 0.9,
"style": 0.4
}'
Parameter Range Effect
stability
0-1
Higher = more consistent, lower = more expressive
similarity_boost
0-1
Higher = closer to original voice character
style
0-1
Higher = more style exaggeration
use_speaker_boost
true/false
Enhances speaker clarity
Output Formats
# High-quality MP3
infsh app run elevenlabs/tts --input '{
"text": "High quality audio output.",
"voice": "daniel",
"output_format": "mp3_44100_192"
}'
Format Description
mp3_44100_128
MP3 at 44.1kHz, 128kbps (default)
mp3_44100_192
MP3 at 44.1kHz, 192kbps
pcm_16000
Raw PCM at 16kHz
pcm_22050
Raw PCM at 22.05kHz
pcm_24000
Raw PCM at 24kHz
pcm_44100
Raw PCM at 44.1kHz
Multilingual
ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.
# Spanish
infsh app run elevenlabs/tts --input '{
"text": "Hola, bienvenidos a nuestra presentación.",
"voice": "aria",
"model": "eleven_multilingual_v2"
}'
# French
infsh app run elevenlabs/tts --input '{
"text": "Bonjour, bienvenue à notre démonstration.",
"voice": "alice",
"model": "eleven_multilingual_v2"
}'
Voice + Video Workflow
# 1. Generate voiceover
infsh app run elevenlabs/tts --input '{
"text": "Introducing the future of AI-powered content creation.",
"voice": "george"
}' > voiceover.json
# 2. Create talking head video
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
Use Cases
-
Voiceovers: Product demos, explainer videos, commercials
-
Audiobooks: Long-form narration with consistent voices
-
Podcasts: AI hosts with natural delivery
-
E-learning: Course narration in multiple languages
-
Accessibility: High-quality screen reader content
-
IVR: Professional phone system messages
-
Video Narration: Documentary and social media content
Related Skills
# ElevenLabs multi-speaker dialogue
npx skills add inference-sh/skills@elevenlabs-dialogue
# ElevenLabs voice changer
npx skills add inference-sh/skills@elevenlabs-voice-changer
# ElevenLabs sound effects
npx skills add inference-sh/skills@elevenlabs-sound-effects
# All TTS models (Kokoro, DIA, Chatterbox, and more)
npx skills add inference-sh/skills@text-to-speech
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli
Browse all audio apps: infsh app list --category audio
Weekly Installs11.6KRepositoryinferen-sh/skillsGitHub Stars159First Seen1 day agoSecurity AuditsGen Agent Trust HubPassSocketWarnSnykPassInstalled onclaude-code9.4Kgithub-copilot8.1Kgemini-cli8.1Kcodex8.1Kkimi-cli8.1Kamp8.1K
ユーザーレビュー (0)
レビューを書く
レビューなし
統計データ
ユーザー評価
この Skill を評価