Skill工具集

全部技能分类浏览

← 返回

开发者工具 Key 中文

SOTA Zero-shot Voice Cloning TTS

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer...

基于MOSS API的语音优先OpenClaw技能，适用于用户希望以偏好的音色获得语音回复的场景，可使用已有的voice_id或参照...

xiami2019

开发者工具 clawhub v1.0.0 1 版本 100000 Key: 需要

★ 1

Stars

📥 734

下载

💾 7

安装

1

版本

#latest#speech#tts#voice clone

概述

EchoForge Moss Voice

Use this skill to run voice interaction with user-preferred timbre.

Required runtime config

MOSI_API_KEY (required)
MOSI_BASE_URL (optional, default https://studio.mosi.cn)

Always send:

Authorization: Bearer

Inputs

Collect:

text (required, what to speak)
Voice source (one of):
voice_id (preferred when available), or
reference_audio (public URL), or
local audio path (upload first, then clone voice)

Optional:

expected_duration_sec
sampling_params:
max_new_tokens (default 512)
temperature (default 1.7)
top_p (default 0.8)
top_k (default 25)
meta_info (default false)

Workflow

Resolve voice source.

If voice_id is available, use it directly.
If only local audio path is available:
Upload file: POST /api/v1/files/upload with multipart field file.
Clone voice: POST /api/v1/voice/clone with file_id (or url).
If returned voice status is not active, poll GET /api/v1/voices/{voice_id} until ACTIVE or timeout.
If reference_audio URL is available, use it directly in TTS.

Run TTS: POST /v1/audio/tts.

Required payload:
model: "moss-tts"
text
one of voice_id or reference_audio

Parse response:

Decode audio_data (base64) to WAV.
Read duration_s and usage when present.

Return a concise result:

voice_id used
output file path
duration
brief status message

Error handling

If 4010 or 4011: API key missing/invalid, ask user to fix MOSI_API_KEY.
If 4020: insufficient credits, ask user to recharge.
If 4029: rate limited, retry with exponential backoff.
If 5002: invalid audio URL or decode failed, ask user for another clip.
If 5004: timeout, shorten text and retry.

Operational constraints

Keep request rate <= 5 RPM.
Keep single request text short enough to avoid timeout.
Never print or log raw API keys.
Prefer reusing stable voice_id for multi-turn voice chat to reduce latency.

版本历史

共 1 个版本

v1.0.0 当前

2026-03-30 04:20 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

developer-tools

CodeConductor.ai

larsonreever

AI驱动平台，提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。

★ 65 📥 179,827

developer-tools

Github

steipete

使用 `gh` CLI 与 GitHub 交互，通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。

★ 666 📥 323,779

developer-tools

Gog

steipete

Google Workspace 命令行工具，支持 Gmail、日历、云端硬盘、通讯录、表格和文档。

★ 920 📥 185,724