alicloud-ai-audio-tts-voice-design
Provides Qwen TTS voice design service from Alibaba Cloud Model Studio, creating controllable synthetic voices through natural language descriptions.
npx skills add cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-designBefore / After Comparison
1 组Traditional Text-to-Speech (TTS) services generate voices that lack personality and emotional expression, making it difficult to fine-tune them according to specific needs. The sound quality is often stiff, limiting application scenarios.
By utilizing Alibaba Cloud Model Studio Qwen TTS voice design model, highly controllable synthetic voices can be created through natural language descriptions, achieving personalized voice customization and emotional expression, significantly enhancing user experience and application flexibility.
description SKILL.md
alicloud-ai-audio-tts-voice-design
Category: provider
Model Studio Qwen TTS Voice Design
Use voice design models to create controllable synthetic voices from natural language descriptions.
Critical model names
Use one of these exact model strings:
-
qwen3-tts-vd-2026-01-26 -
qwen3-tts-vd-realtime-2026-01-15
Prerequisites
- Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
- Set
DASHSCOPE_API_KEYin your environment, or adddashscope_api_keyto~/.alibabacloud/credentials.
Normalized interface (tts.voice_design)
Request
-
voice_prompt(string, required) target voice description -
text(string, required) -
stream(bool, optional)
Response
-
audio_url(string) or streaming PCM chunks -
voice_id(string) -
request_id(string)
Operational guidance
-
Write voice prompts with tone, pace, emotion, and timbre constraints.
-
Build a reusable voice prompt library for product consistency.
-
Validate generated voice in short utterances before long scripts.
Local helper script
Prepare a normalized request JSON and validate response schema:
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "This is a voice-design demo"
Output location
-
Default output:
output/ai-audio-tts-voice-design/audio/ -
Override base dir with
OUTPUT_DIR.
Validation
mkdir -p output/alicloud-ai-audio-tts-voice-design
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-design/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-design/validate.txt is generated.
Output And Evidence
-
Save artifacts, command outputs, and API response summaries under
output/alicloud-ai-audio-tts-voice-design/. -
Include key parameters (region/resource id/time range) in evidence files for reproducibility.
Workflow
-
Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
-
Run one minimal read-only query first to verify connectivity and permissions.
-
Execute the target operation with explicit parameters and bounded scope.
-
Verify results and save output/evidence files.
References
references/sources.md
Weekly Installs226Repositorycinience/alicloud-skillsGitHub Stars357First SeenFeb 26, 2026Security AuditsGen Agent Trust HubPassSocketPassSnykPassInstalled ongemini-cli224github-copilot224codex224kimi-cli224amp224cursor224
forumUser Reviews (0)
Write a Review
No reviews yet
Statistics
User Rating
Rate this Skill