← 返回
开发者工具 Key 中文

mmEasyVoice

Simple text-to-speech skill using MiniMax Voice API. Converts text to audio with customizable voice selection. Use for generating speech audio from text.
{"answer":"基于 MiniMax Voice API 的简单文本转语音技能。支持自定义声音选择,将文本转换为音频。用于从文本生成语音音频。"}
blue-coconut
开发者工具 clawhub v1.0.2 2 版本 100000 Key: 需要
★ 0
Stars
📥 904
下载
💾 20
安装
2
版本
#latest

概述

MiniMax Easy Voice

Simple text-to-speech skill powered by MiniMax Voice API. Converts any text into natural-sounding audio with customizable voice selection.

Usage

[Step 1] Preparation

IMPORTANT: Before using this skill, always verify the environment:

python check_environment.py

If API key is not set:

export MINIMAX_VOICE_API_KEY="your-api-key-here"

Requirements:

  • Python 3.8+
  • MINIMAX_VOICE_API_KEY environment variable (required)
  • FFmpeg (optional, for audio merging/conversion)

[Step 2] Generate speech audio from text

Convert any text to speech in one command:

# Default voice (male-qn-qingse)
python mmvoice.py tts "Hello, this is a test." -o test.mp3

# Chinese text
python mmvoice.py tts "你好,这是一个测试。" -o test_cn.mp3

# Specify a different voice by "-v voice_id"
python mmvoice.py tts "Hello world" -v female-shaonv -o hello.mp3

# Specify output path
python mmvoice.py tts "Your text" -o /path/to/output.mp3

Required parameters:

  • text: The text you want to convert to speech
  • -o OUTPUT: Output audio file path (required)

Optional parameters:

  • -v VOICE_ID: Voice to use (default: male-qn-qingse)

Usage Details

Text length limits:

  • Up to 10,000 characters per request
  • For longer text, split into multiple requests and merge later

Pause insertion: Use <#x#> where x = pause duration in seconds

  • Example: "Hello<#1.5#>world" = 1.5 second pause between words
  • Range: 0.01 to 99.99 seconds

Emotion matching: speech-2.8 models automatically match emotions to your text content

Voice Selection

Choose the right voice for your content by consulting the voice catalog:

# List all available voices
python mmvoice.py list-voices

Voice catalog: reference/voice_catalog.md

Contains:

  • All available system voices (male, female)
  • Voice characteristics and recommended use cases
  • How to select the right voice for your content

Advanced Options

Voice Management

List available voices:

python mmvoice.py list-voices

Clone a voice from audio sample:

python mmvoice.py clone audio_file.mp3 --voice-id my-custom-voice

Design a voice from description:

python mmvoice.py design "A warm, gentle female voice" --voice-id designed-voice

Audio Processing

Merge multiple audio files:

python mmvoice.py merge file1.mp3 file2.mp3 file3.mp3 -o combined.mp3

Convert audio format:

python mmvoice.py convert input.wav -o output.mp3 --format mp3

Reference Documents

Open these when needed for more details:

DocumentWhen to Use
-----------------------
reference/voice_catalog.mdChoosing a voice_id
reference/getting-started.mdEnvironment setup
reference/audio-guide.mdAudio processing
reference/voice-guide.mdVoice cloning and design
reference/troubleshooting.mdCommon issues and solutions

Troubleshooting

Common issues:

  1. API key not set: Run export MINIMAX_VOICE_API_KEY="your-key"
  2. FFmpeg missing: Install with brew install ffmpeg (macOS) or sudo apt install ffmpeg (Ubuntu)
  3. Voice not found: Use python mmvoice.py list-voices to see available voices

Run environment check:

python check_environment.py

See reference/troubleshooting.md for more solutions.

版本历史

共 2 个版本

  • v1.0.2 当前
    2026-03-29 16:49 安全 安全
  • v1.0.1
    2026-03-07 01:57

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

mmVoiceMaker

blue-coconut
支持使用 MiniMax Voice API 与 FFmpeg 实现语音合成、语音克隆、语音设计及音频后期处理,用于文字转语音及创作。
★ 3 📥 1,105
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 666 📥 323,791
developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 920 📥 185,727