alicloud-ai-audio-tts-voice-design
提供阿里云模型工作室的Qwen TTS语音设计服务,通过自然语言描述创建可控的合成语音。
npx skills add cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-designBefore / After 效果对比
1 组传统的文本转语音(TTS)服务生成的声音缺乏个性和情感表达,难以根据特定需求进行精细化调整,听感生硬,限制了应用场景。
采用阿里云Model Studio Qwen TTS语音设计模型,通过自然语言描述即可创建高度可控的合成声音,实现声音的个性化定制和情感表达,显著提升用户体验和应用灵活性。
description SKILL.md
alicloud-ai-audio-tts-voice-design
Category: provider
Model Studio Qwen TTS Voice Design
Use voice design models to create controllable synthetic voices from natural language descriptions.
Critical model names
Use one of these exact model strings:
-
qwen3-tts-vd-2026-01-26 -
qwen3-tts-vd-realtime-2026-01-15
Prerequisites
- Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
- Set
DASHSCOPE_API_KEYin your environment, or adddashscope_api_keyto~/.alibabacloud/credentials.
Normalized interface (tts.voice_design)
Request
-
voice_prompt(string, required) target voice description -
text(string, required) -
stream(bool, optional)
Response
-
audio_url(string) or streaming PCM chunks -
voice_id(string) -
request_id(string)
Operational guidance
-
Write voice prompts with tone, pace, emotion, and timbre constraints.
-
Build a reusable voice prompt library for product consistency.
-
Validate generated voice in short utterances before long scripts.
Local helper script
Prepare a normalized request JSON and validate response schema:
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "This is a voice-design demo"
Output location
-
Default output:
output/ai-audio-tts-voice-design/audio/ -
Override base dir with
OUTPUT_DIR.
Validation
mkdir -p output/alicloud-ai-audio-tts-voice-design
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-design/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-design/validate.txt is generated.
Output And Evidence
-
Save artifacts, command outputs, and API response summaries under
output/alicloud-ai-audio-tts-voice-design/. -
Include key parameters (region/resource id/time range) in evidence files for reproducibility.
Workflow
-
Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
-
Run one minimal read-only query first to verify connectivity and permissions.
-
Execute the target operation with explicit parameters and bounded scope.
-
Verify results and save output/evidence files.
References
references/sources.md
Weekly Installs226Repositorycinience/alicloud-skillsGitHub Stars357First SeenFeb 26, 2026Security AuditsGen Agent Trust HubPassSocketPassSnykPassInstalled ongemini-cli224github-copilot224codex224kimi-cli224amp224cursor224
forum用户评价 (0)
发表评价
暂无评价
统计数据
用户评分
为此 Skill 评分