← 返回
开发者工具 中文

ComfyUI TTS

Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options.
通过 ComfyUI 的 Qwen-TTS API 将文本转换为语音音频,支持自定义语音、风格、模型和输出选项。
yhsi5358 yhsi5358 来源
开发者工具 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 1,434
下载
💾 70
安装
1
版本
#latest

概述

ComfyUI TTS Skill

Generate speech audio using ComfyUI's Qwen-TTS service. This skill allows you to convert text to speech through ComfyUI's API.

Configuration

Environment Variables

Set these environment variables to configure the ComfyUI connection:

export COMFYUI_HOST="localhost"      # ComfyUI server host
export COMFYUI_PORT="8188"           # ComfyUI server port
export COMFYUI_OUTPUT_DIR=""         # Optional: Custom output directory

Usage

Basic Text-to-Speech

Generate audio from text using default settings:

scripts/tts.sh "你好,世界"

Advanced Options

Customize voice characteristics:

# Specify character and style
scripts/tts.sh "你好" --character "Girl" --style "Emotional"

# Change model size
scripts/tts.sh "你好" --model "3B"

# Specify output file
scripts/tts.sh "你好" --output "/path/to/output.wav"

# Combine options
scripts/tts.sh "你好,这是测试" \
  --character "Girl" \
  --style "Emotional" \
  --model "1.7B" \
  --output "~/audio/test.wav"

Available Options

OptionDescriptionDefault
------------------------------
--characterVoice character (Girl/Boy/etc.)"Girl"
--styleSpeaking style (Emotional/Neutral/etc.)"Emotional"
--modelModel size (0.5B/1.7B/3B)"1.7B"
--outputOutput file pathAuto-generated
--temperatureGeneration temperature (0-1)0.9
--top-pTop-p sampling0.9
--top-kTop-k sampling50

Workflow

The skill performs these steps:

  1. Construct Workflow: Builds a ComfyUI workflow JSON with your text and settings
  2. Submit Job: Sends the workflow to ComfyUI's /prompt endpoint
  3. Poll Status: Monitors job completion via /history endpoint
  4. Retrieve Audio: Returns the path to the generated audio file

Troubleshooting

Connection Refused

  • Verify ComfyUI is running: curl http://$COMFYUI_HOST:$COMFYUI_PORT/system_stats
  • Check host and port settings

Job Timeout

  • Large models (3B) take longer to generate
  • Try smaller models (0.5B, 1.7B) for faster results

Output Not Found

  • Check ComfyUI's output directory configuration
  • Verify file permissions

API Reference

The skill uses ComfyUI's native API endpoints:

  • POST /prompt - Submit workflow
  • GET /history - Check job status
  • Output files are saved to ComfyUI's configured output directory

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 05:05 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 329 📥 92,498
design-media

Nano Banana Pro

steipete
使用 Nano Banana Pro (Gemini 3 Pro Image) 生成或编辑图像。支持文生图、图生图及 1K/2K/4K 分辨率,适用于图像创建、修改及编辑请求,使用 --input-image 指定输入图像。
★ 422 📥 115,885
design-media

Video Frames

steipete
使用 ffmpeg 从视频中提取帧或短片。
★ 131 📥 52,525