---
id: sm3-elevenlabs-tts
name: "elevenlabs-tts"
url: https://skills.yangsir.net/skill/sm3-elevenlabs-tts
author: inferen-sh
domain: multimedia
tags: ["text-to-speech", "voice-synthesis", "ai-narration", "elevenlabs-tts"]
install_count: 52000
rating: 4.70 (2000 reviews)
github: https://github.com/inferen-sh/skills
---

# elevenlabs-tts

> 通过ElevenLabs提供22种以上优质语音的文本转语音服务，生成自然流畅的语音内容。

**Stats**: 52,000 installs · 4.7/5 (2000 reviews)

## Before / After 对比

### ElevenLabs文本转语音

## Readme

# elevenlabs-tts

# ElevenLabs Text-to-Speech

Premium text-to-speech with 22+ voices via [inference.sh](https://inference.sh) CLI.

## Quick Start

Requires inference.sh CLI (`infsh`). [Install instructions](https://raw.githubusercontent.com/inference-sh/skills/refs/heads/main/cli-install.md)

```
infsh login

# Generate speech with ElevenLabs
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'

```

## Available Models

Model
ID
Best For
Latency

Multilingual v2
`eleven_multilingual_v2`
Highest quality, 32 languages
~250ms

Turbo v2.5
`eleven_turbo_v2_5`
Balance of speed & quality
~150ms

Flash v2.5
`eleven_flash_v2_5`
Ultra-low latency
~75ms

## Voice Library

### Female Voices

Voice
Style

`aria`
American, conversational

`alice`
British, confident

`bella`
American, warm

`jessica`
American, expressive

`laura`
American, professional

`lily`
British, soft

`sarah`
American, friendly

### Male Voices

Voice
Style

`george`
British, authoritative

`adam`
American, deep

`bill`
American, mature

`brian`
American, conversational

`callum`
Transatlantic, intense

`charlie`
Australian, natural

`chris`
American, casual

`daniel`
British, commanding

`eric`
American, friendly

`harry`
American, young

`liam`
American, articulate

`matilda`
American, warm

`river`
American, confident

`roger`
American, authoritative

`will`
American, bright

## Examples

### Basic Speech

```
infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'

```

### Choose a Model

```
# Highest quality
infsh app run elevenlabs/tts --input '{
  "text": "This is our premium multilingual model with the best quality.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# Ultra-fast for real-time applications
infsh app run elevenlabs/tts --input '{
  "text": "Flash model for low-latency applications.",
  "voice": "brian",
  "model": "eleven_flash_v2_5"
}'

```

### Voice Tuning

```
infsh app run elevenlabs/tts --input '{
  "text": "Fine-tune the voice characteristics for your use case.",
  "voice": "bella",
  "stability": 0.3,
  "similarity_boost": 0.9,
  "style": 0.4
}'

```

Parameter
Range
Effect

`stability`
0-1
Higher = more consistent, lower = more expressive

`similarity_boost`
0-1
Higher = closer to original voice character

`style`
0-1
Higher = more style exaggeration

`use_speaker_boost`
true/false
Enhances speaker clarity

### Output Formats

```
# High-quality MP3
infsh app run elevenlabs/tts --input '{
  "text": "High quality audio output.",
  "voice": "daniel",
  "output_format": "mp3_44100_192"
}'

```

Format
Description

`mp3_44100_128`
MP3 at 44.1kHz, 128kbps (default)

`mp3_44100_192`
MP3 at 44.1kHz, 192kbps

`pcm_16000`
Raw PCM at 16kHz

`pcm_22050`
Raw PCM at 22.05kHz

`pcm_24000`
Raw PCM at 24kHz

`pcm_44100`
Raw PCM at 44.1kHz

### Multilingual

ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.

```
# Spanish
infsh app run elevenlabs/tts --input '{
  "text": "Hola, bienvenidos a nuestra presentación.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# French
infsh app run elevenlabs/tts --input '{
  "text": "Bonjour, bienvenue à notre démonstration.",
  "voice": "alice",
  "model": "eleven_multilingual_v2"
}'

```

## Voice + Video Workflow

```
# 1. Generate voiceover
infsh app run elevenlabs/tts --input '{
  "text": "Introducing the future of AI-powered content creation.",
  "voice": "george"
}' > voiceover.json

# 2. Create talking head video
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

```

## Use Cases

- **Voiceovers**: Product demos, explainer videos, commercials

- **Audiobooks**: Long-form narration with consistent voices

- **Podcasts**: AI hosts with natural delivery

- **E-learning**: Course narration in multiple languages

- **Accessibility**: High-quality screen reader content

- **IVR**: Professional phone system messages

- **Video Narration**: Documentary and social media content

## Related Skills

```
# ElevenLabs multi-speaker dialogue
npx skills add inference-sh/skills@elevenlabs-dialogue

# ElevenLabs voice changer
npx skills add inference-sh/skills@elevenlabs-voice-changer

# ElevenLabs sound effects
npx skills add inference-sh/skills@elevenlabs-sound-effects

# All TTS models (Kokoro, DIA, Chatterbox, and more)
npx skills add inference-sh/skills@text-to-speech

# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

```

Browse all audio apps: `infsh app list --category audio`
Weekly Installs11.6KRepository[inferen-sh/skills](https://github.com/inferen-sh/skills)GitHub Stars159First Seen1 day agoSecurity Audits[Gen Agent Trust HubPass](/inferen-sh/skills/elevenlabs-tts/security/agent-trust-hub)[SocketWarn](/inferen-sh/skills/elevenlabs-tts/security/socket)[SnykPass](/inferen-sh/skills/elevenlabs-tts/security/snyk)Installed onclaude-code9.4Kgithub-copilot8.1Kgemini-cli8.1Kcodex8.1Kkimi-cli8.1Kamp8.1K

---
*Source: https://skills.yangsir.net/skill/sm3-elevenlabs-tts*
*Markdown mirror: https://skills.yangsir.net/api/skill/sm3-elevenlabs-tts/markdown*