首页/数据 & AI/alicloud-ai-audio-tts-voice-design
A

alicloud-ai-audio-tts-voice-design

by @ciniencev
4.4(131)

提供阿里云模型工作室的Qwen TTS语音设计服务,通过自然语言描述创建可控的合成语音。

Alibaba Cloud AITTSText-to-SpeechVoice SynthesisAudio DesignGitHub
安装方式
npx skills add cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-design
compare_arrows

Before / After 效果对比

1
使用前

传统的文本转语音(TTS)服务生成的声音缺乏个性和情感表达,难以根据特定需求进行精细化调整,听感生硬,限制了应用场景。

使用后

采用阿里云Model Studio Qwen TTS语音设计模型,通过自然语言描述即可创建高度可控的合成声音,实现声音的个性化定制和情感表达,显著提升用户体验和应用灵活性。

description SKILL.md

alicloud-ai-audio-tts-voice-design

Category: provider

Model Studio Qwen TTS Voice Design

Use voice design models to create controllable synthetic voices from natural language descriptions.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-vd-2026-01-26

  • qwen3-tts-vd-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_design)

Request

  • voice_prompt (string, required) target voice description

  • text (string, required)

  • stream (bool, optional)

Response

  • audio_url (string) or streaming PCM chunks

  • voice_id (string)

  • request_id (string)

Operational guidance

  • Write voice prompts with tone, pace, emotion, and timbre constraints.

  • Build a reusable voice prompt library for product consistency.

  • Validate generated voice in short utterances before long scripts.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "This is a voice-design demo"

Output location

  • Default output: output/ai-audio-tts-voice-design/audio/

  • Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-voice-design
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-design/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-design/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-voice-design/.

  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  • Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.

  • Run one minimal read-only query first to verify connectivity and permissions.

  • Execute the target operation with explicit parameters and bounded scope.

  • Verify results and save output/evidence files.

References

  • references/sources.md

Weekly Installs226Repositorycinience/alicloud-skillsGitHub Stars357First SeenFeb 26, 2026Security AuditsGen Agent Trust HubPassSocketPassSnykPassInstalled ongemini-cli224github-copilot224codex224kimi-cli224amp224cursor224

forum用户评价 (0)

发表评价

效果
易用性
文档
兼容性

暂无评价

统计数据

安装量3.3K
评分4.4 / 5.0
版本
更新日期2026年4月27日
对比案例1 组

用户评分

4.4(131)
5
60%
4
40%
3
1%
2
0%
1
0%

为此 Skill 评分

0.0

兼容平台

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

时间线

创建2026年3月17日
最后更新2026年4月27日
🎁 Agent 知识卡片