← 返回
开发者工具 Key 中文

IMA AI Text To Speech — seed-tts, DouBao

Convert text, scripts, and captions into natural voiceovers for videos, explainers, product demos, and social posts.
将文本、脚本和字幕转换为自然配音,用于视频、解说、产品演示和社交帖子。
allenfancy-gan allenfancy-gan 来源
开发者工具 clawhub v1.0.8 5 版本 99899.9 Key: 需要
★ 0
Stars
📥 998
下载
💾 28
安装
5
版本
#latest

概述

IMA TTS AI — Text-to-Speech Generator

For complete API documentation, security details, all parameters, speaker list, and Python examples, read SKILL-DETAIL.md.

Model ID Reference (CRITICAL)

Friendly Namemodel_idNotes
--------------------------------
Seed TTS 2.0seed-tts-2.0✅ Default and only supported model

Sub-models (via extra-params):

  • seed-tts-2.0-expressive — More expressive, emotional (default)
  • seed-tts-2.0-standard — More stable, neutral

When User Says "帮我制作旁白/配音"

Must ask first:

QuestionParameterRequired
-------------------------------
要朗读的内容/文案prompt✅ Yes

Recommend asking:

QuestionParameterOptions
------------------------------
音色/发音人speaker魅力苏菲、Vivi、云舟、大壹 等 (see SKILL-DETAIL.md)

Optional:

QuestionParameterRange
----------------------------
情感/情绪audio_params.emotionneutral, sad, angry
语速audio_params.speech_rate[-50, 100], 0=normal
音量audio_params.loudness_rate[-50, 100], 0=normal

User Input Parsing

User saysParameterValue
-----------------------------
旁白/配音/朗读prompt + speakerAsk for content first
女声/femalespeakere.g. zh_female_vv_uranus_bigtts
男声/malespeakere.g. zh_male_sophie_uranus_bigtts
语速快/slowaudio_params.speech_ratePositive/negative value
expressive/standardmodelSub-model selection

Script Usage

# List available TTS models
python3 {baseDir}/scripts/ima_tts_create.py --api-key $IMA_API_KEY --list-models

# Generate speech (default model: seed-tts-2.0)
python3 {baseDir}/scripts/ima_tts_create.py \
  --api-key $IMA_API_KEY \
  --model-id seed-tts-2.0 \
  --prompt "Text to be spoken here." \
  --user-id {user_id} \
  --output-json

# With speaker and emotion
python3 {baseDir}/scripts/ima_tts_create.py \
  --api-key $IMA_API_KEY \
  --model-id seed-tts-2.0 \
  --prompt "阳光青年音色测试,你好世界。" \
  --extra-params '{"model":"seed-tts-2.0-expressive","speaker":"zh_male_sophie_uranus_bigtts","audio_params":{"emotion":"neutral"}}' \
  --user-id {user_id} \
  --output-json

Sending Results to User

# ✅ CORRECT: Use remote URL directly
message(action="send", media=audio_url, caption="✅ 语音合成成功!\n• 模型:[Name]\n• 耗时:[X]s\n• 积分:[N pts]\n\n🔗 原始链接:[url]")

# ❌ WRONG: Never download to local file

UX Protocol (Brief)

  1. Pre-generation: "🔊 开始语音合成… 模型:[Name],预计[X~Y]秒,消耗[N]积分"
  2. Progress: Every 10-15s: "⏳ 语音合成中… [P]%"
  3. Success: Send audio via media=audio_url + include link in caption
  4. Failure: Natural language error + suggest retry. See SKILL-DETAIL.md for error translation.

Never say to users: script names, API endpoints, attribute_id, technical parameter names.

Environment

Base URL: https://api.imastudio.com

Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en

Core Flow

  1. GET /open/v1/product/list?app=ima&platform=web&category=text_to_speech → get attribute_id, credit
  2. POST /open/v1/tasks/create → get task_id
  3. POST /open/v1/tasks/detail → poll every 2-5s until resource_status==1

MANDATORY: Always query product list first. attribute_id is required.

Estimated Generation Time

ModelEstimated TimePoll Every
----------------------------------
seed-tts-2.05~30s3s

User Preference Memory

Storage: ~/.openclaw/memory/ima_prefs.json

  • Save when user explicitly says "用XXX音色" / "默认用XXX"
  • Clear when user says "换个音色" / "推荐一个"

Popular Speakers (Quick Reference)

CategorySpeaker Namespeaker ID
-----------------------------------
通用魅力苏菲zh_male_sophie_uranus_bigtts
通用Vivizh_female_vv_uranus_bigtts
通用云舟zh_male_m191_uranus_bigtts
视频配音大壹zh_male_dayi_uranus_bigtts
角色扮演知性灿灿zh_female_cancan_uranus_bigtts

Full speaker list: See volcengine_tts_timbre_list.json in project or SKILL-DETAIL.md.

⚠️ Important: Use native format (_uranus_bigtts), NOT BV_streaming format.

版本历史

共 5 个版本

  • v1.0.8 当前
    2026-05-03 03:18 安全 安全
  • v1.0.7
    2026-03-29 11:39 安全
  • v1.0.1
    2026-03-26 21:58
  • v1.0.5
    2026-03-18 19:26
  • v1.0.0
    2026-03-11 16:40

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Video Frames

steipete
使用 ffmpeg 从视频中提取帧或短片。
★ 131 📥 52,547
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 214 📥 46,216
design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 422 📥 115,951