Home/数据 & AI/alicloud-ai-audio-tts-voice-design
A

alicloud-ai-audio-tts-voice-design

by @ciniencev
4.6(125)

Provides Qwen TTS voice design service from Alibaba Cloud Model Studio, creating controllable synthetic voices through natural language descriptions.

Alibaba Cloud AITTSText-to-SpeechVoice SynthesisAudio DesignGitHub
Installation
npx skills add cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-design
compare_arrows

Before / After Comparison

1
Before

Traditional Text-to-Speech (TTS) services generate voices that lack personality and emotional expression, making it difficult to fine-tune them according to specific needs. The sound quality is often stiff, limiting application scenarios.

After

By utilizing Alibaba Cloud Model Studio Qwen TTS voice design model, highly controllable synthetic voices can be created through natural language descriptions, achieving personalized voice customization and emotional expression, significantly enhancing user experience and application flexibility.

description SKILL.md

alicloud-ai-audio-tts-voice-design

Category: provider

Model Studio Qwen TTS Voice Design

Use voice design models to create controllable synthetic voices from natural language descriptions.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-vd-2026-01-26

  • qwen3-tts-vd-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_design)

Request

  • voice_prompt (string, required) target voice description

  • text (string, required)

  • stream (bool, optional)

Response

  • audio_url (string) or streaming PCM chunks

  • voice_id (string)

  • request_id (string)

Operational guidance

  • Write voice prompts with tone, pace, emotion, and timbre constraints.

  • Build a reusable voice prompt library for product consistency.

  • Validate generated voice in short utterances before long scripts.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "This is a voice-design demo"

Output location

  • Default output: output/ai-audio-tts-voice-design/audio/

  • Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-voice-design
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-design/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-design/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-voice-design/.

  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  • Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.

  • Run one minimal read-only query first to verify connectivity and permissions.

  • Execute the target operation with explicit parameters and bounded scope.

  • Verify results and save output/evidence files.

References

  • references/sources.md

Weekly Installs226Repositorycinience/alicloud-skillsGitHub Stars357First SeenFeb 26, 2026Security AuditsGen Agent Trust HubPassSocketPassSnykPassInstalled ongemini-cli224github-copilot224codex224kimi-cli224amp224cursor224

forumUser Reviews (0)

Write a Review

Effect
Usability
Docs
Compatibility

No reviews yet

Statistics

Installs3.3K
Rating4.6 / 5.0
Version
Updated2026年3月17日
Comparisons1

User Rating

4.6(125)
5
0%
4
0%
3
0%
2
0%
1
0%

Rate this Skill

0.0

Compatible Platforms

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

Timeline

Created2026年3月17日
Last Updated2026年3月17日