← 返回
开发者工具 Key 中文

SOTA Zero-shot Voice Cloning TTS

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer...
基于MOSS API的语音优先OpenClaw技能,适用于用户希望以偏好的音色获得语音回复的场景,可使用已有的voice_id或参照...
xiami2019
开发者工具 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 1
Stars
📥 734
下载
💾 7
安装
1
版本
#latest#speech#tts#voice clone

概述

EchoForge Moss Voice

Use this skill to run voice interaction with user-preferred timbre.

Required runtime config

  • MOSI_API_KEY (required)
  • MOSI_BASE_URL (optional, default https://studio.mosi.cn)

Always send:

  • Authorization: Bearer

Inputs

Collect:

  • text (required, what to speak)
  • Voice source (one of):
  • voice_id (preferred when available), or
  • reference_audio (public URL), or
  • local audio path (upload first, then clone voice)

Optional:

  • expected_duration_sec
  • sampling_params:
  • max_new_tokens (default 512)
  • temperature (default 1.7)
  • top_p (default 0.8)
  • top_k (default 25)
  • meta_info (default false)

Workflow

  1. Resolve voice source.
    • If voice_id is available, use it directly.
    • If only local audio path is available:
    • Upload file: POST /api/v1/files/upload with multipart field file.
    • Clone voice: POST /api/v1/voice/clone with file_id (or url).
    • If returned voice status is not active, poll GET /api/v1/voices/{voice_id} until ACTIVE or timeout.
    • If reference_audio URL is available, use it directly in TTS.
  2. Run TTS: POST /v1/audio/tts.
    • Required payload:
    • model: "moss-tts"
    • text
    • one of voice_id or reference_audio
  3. Parse response:
    • Decode audio_data (base64) to WAV.
    • Read duration_s and usage when present.
  4. Return a concise result:
    • voice_id used
    • output file path
    • duration
    • brief status message

Error handling

  • If 4010 or 4011: API key missing/invalid, ask user to fix MOSI_API_KEY.
  • If 4020: insufficient credits, ask user to recharge.
  • If 4029: rate limited, retry with exponential backoff.
  • If 5002: invalid audio URL or decode failed, ask user for another clip.
  • If 5004: timeout, shorten text and retry.

Operational constraints

  • Keep request rate <= 5 RPM.
  • Keep single request text short enough to avoid timeout.
  • Never print or log raw API keys.
  • Prefer reusing stable voice_id for multi-turn voice chat to reduce latency.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 04:20 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 65 📥 179,827
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 666 📥 323,779
developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 920 📥 185,724