Volcengine STT

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...

利用火山引擎语音转文字API将音频转录为文本。适用于用户希望用火山引擎替代Whisper/OpenAI STT的场景。

开发者工具 clawhub v0.2.1 1 版本 99883.6 Key: 需要

★ 2

Stars

📥 818

下载

💾 26

安装

版本

#latest

概述

Volcengine STT

Use this skill to run speech-to-text through Volcengine.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

Default behavior:

Endpoint: ${ARK_BASE_URL:-https://ark.cn-beijing.volces.com/api/v3}/audio/transcriptions
Model: ${ARK_STT_MODEL:-doubao-seed-asr-1-0}
Auth header: Authorization: Bearer $ARK_API_KEY
Output file: .txt

Required env

ARK_API_KEY (required)

Optional:

ARK_BASE_URL (default: https://ark.cn-beijing.volces.com/api/v3)
ARK_STT_MODEL (default: doubao-seed-asr-1-0)

Useful flags

# Save plain text to custom path
{baseDir}/scripts/transcribe.sh ./voice.ogg --out /tmp/voice.txt

# Force model
{baseDir}/scripts/transcribe.sh ./voice.ogg --model doubao-seed-asr-1-0

# Return raw JSON (for debugging/integration)
{baseDir}/scripts/transcribe.sh ./voice.ogg --json --out /tmp/voice.json

# Hint language/prompt when needed
{baseDir}/scripts/transcribe.sh ./voice.ogg --language zh --prompt "中英混合，保留术语"

Integration notes

For OpenClaw voice-message handling, call this script instead of Whisper script.
Keep keys in machine-local config or env, never commit secrets.
If your Volcengine account uses a different model name, pass --model or set ARK_STT_MODEL.

版本历史

共 1 个版本

v0.2.1 当前

2026-03-30 00:24 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

ai-agent

Reminder

reed1898

捕获自然语言事件，保存至工作区，安排 Telegram 提醒，自动 cron 通知并查询即将到来的事件。

★ 9 📥 10,446

dev-programming

Mcporter

steipete

使用 mcporter CLI 直接列出、配置、认证及调用 MCP 服务器/工具（支持 HTTP 或 stdio），涵盖临时服务器、配置编辑及 CLI/类型生成功能。

★ 194 📥 67,407

dev-programming

CodeConductor.ai

larsonreever

AI驱动平台，提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。

★ 72 📥 181,412