Use this skill to run speech-to-text through Volcengine.
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg
Default behavior:
${ARK_BASE_URL:-https://ark.cn-beijing.volces.com/api/v3}/audio/transcriptions${ARK_STT_MODEL:-doubao-seed-asr-1-0}Authorization: Bearer $ARK_API_KEY.txtARK_API_KEY (required)Optional:
ARK_BASE_URL (default: https://ark.cn-beijing.volces.com/api/v3)ARK_STT_MODEL (default: doubao-seed-asr-1-0)# Save plain text to custom path
{baseDir}/scripts/transcribe.sh ./voice.ogg --out /tmp/voice.txt
# Force model
{baseDir}/scripts/transcribe.sh ./voice.ogg --model doubao-seed-asr-1-0
# Return raw JSON (for debugging/integration)
{baseDir}/scripts/transcribe.sh ./voice.ogg --json --out /tmp/voice.json
# Hint language/prompt when needed
{baseDir}/scripts/transcribe.sh ./voice.ogg --language zh --prompt "中英混合,保留术语"
--model or set ARK_STT_MODEL.共 1 个版本