E

elevenlabs-tts

by @inferen-shv
4.7(2,000)

通过ElevenLabs提供22种以上优质语音的文本转语音服务,生成自然流畅的语音内容。

text-to-speechvoice-synthesisai-narrationelevenlabs-ttsGitHub
安装方式
npx skills add inferen-sh/skills --skill elevenlabs-tts
compare_arrows

Before / After 效果对比

1
使用前

传统TTS声音生硬不自然,情感表达欠缺,听感差。

使用后

高级TTS技术,声音自然流畅,情感丰富,听觉体验佳。

SKILL.md

elevenlabs-tts

ElevenLabs Text-to-Speech

Premium text-to-speech with 22+ voices via inference.sh CLI.

Quick Start

Requires inference.sh CLI (infsh). Install instructions

infsh login

# Generate speech with ElevenLabs
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'

Available Models

Model ID Best For Latency

Multilingual v2 eleven_multilingual_v2 Highest quality, 32 languages ~250ms

Turbo v2.5 eleven_turbo_v2_5 Balance of speed & quality ~150ms

Flash v2.5 eleven_flash_v2_5 Ultra-low latency ~75ms

Voice Library

Female Voices

Voice Style

aria American, conversational

alice British, confident

bella American, warm

jessica American, expressive

laura American, professional

lily British, soft

sarah American, friendly

Male Voices

Voice Style

george British, authoritative

adam American, deep

bill American, mature

brian American, conversational

callum Transatlantic, intense

charlie Australian, natural

chris American, casual

daniel British, commanding

eric American, friendly

harry American, young

liam American, articulate

matilda American, warm

river American, confident

roger American, authoritative

will American, bright

Examples

Basic Speech

infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'

Choose a Model

# Highest quality
infsh app run elevenlabs/tts --input '{
  "text": "This is our premium multilingual model with the best quality.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# Ultra-fast for real-time applications
infsh app run elevenlabs/tts --input '{
  "text": "Flash model for low-latency applications.",
  "voice": "brian",
  "model": "eleven_flash_v2_5"
}'

Voice Tuning

infsh app run elevenlabs/tts --input '{
  "text": "Fine-tune the voice characteristics for your use case.",
  "voice": "bella",
  "stability": 0.3,
  "similarity_boost": 0.9,
  "style": 0.4
}'

Parameter Range Effect

stability 0-1 Higher = more consistent, lower = more expressive

similarity_boost 0-1 Higher = closer to original voice character

style 0-1 Higher = more style exaggeration

use_speaker_boost true/false Enhances speaker clarity

Output Formats

# High-quality MP3
infsh app run elevenlabs/tts --input '{
  "text": "High quality audio output.",
  "voice": "daniel",
  "output_format": "mp3_44100_192"
}'

Format Description

mp3_44100_128 MP3 at 44.1kHz, 128kbps (default)

mp3_44100_192 MP3 at 44.1kHz, 192kbps

pcm_16000 Raw PCM at 16kHz

pcm_22050 Raw PCM at 22.05kHz

pcm_24000 Raw PCM at 24kHz

pcm_44100 Raw PCM at 44.1kHz

Multilingual

ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.

# Spanish
infsh app run elevenlabs/tts --input '{
  "text": "Hola, bienvenidos a nuestra presentación.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# French
infsh app run elevenlabs/tts --input '{
  "text": "Bonjour, bienvenue à notre démonstration.",
  "voice": "alice",
  "model": "eleven_multilingual_v2"
}'

Voice + Video Workflow

# 1. Generate voiceover
infsh app run elevenlabs/tts --input '{
  "text": "Introducing the future of AI-powered content creation.",
  "voice": "george"
}' > voiceover.json

# 2. Create talking head video
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

Use Cases

  • Voiceovers: Product demos, explainer videos, commercials

  • Audiobooks: Long-form narration with consistent voices

  • Podcasts: AI hosts with natural delivery

  • E-learning: Course narration in multiple languages

  • Accessibility: High-quality screen reader content

  • IVR: Professional phone system messages

  • Video Narration: Documentary and social media content

Related Skills

# ElevenLabs multi-speaker dialogue
npx skills add inference-sh/skills@elevenlabs-dialogue

# ElevenLabs voice changer
npx skills add inference-sh/skills@elevenlabs-voice-changer

# ElevenLabs sound effects
npx skills add inference-sh/skills@elevenlabs-sound-effects

# All TTS models (Kokoro, DIA, Chatterbox, and more)
npx skills add inference-sh/skills@text-to-speech

# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

Browse all audio apps: infsh app list --category audio Weekly Installs11.6KRepositoryinferen-sh/skillsGitHub Stars159First Seen1 day agoSecurity AuditsGen Agent Trust HubPassSocketWarnSnykPassInstalled onclaude-code9.4Kgithub-copilot8.1Kgemini-cli8.1Kcodex8.1Kkimi-cli8.1Kamp8.1K

用户评价 (0)

发表评价

效果
易用性
文档
兼容性

暂无评价

统计数据

安装量52.0K
评分4.7 / 5.0
版本
更新日期2026年5月9日
对比案例1 组

用户评分

4.7(2,000)
5
41%
4
47%
3
12%
2
1%
1
0%

为此 Skill 评分

0.0

兼容平台

🔧Claude Code

时间线

创建2026年3月18日
最后更新2026年5月9日