← 返回
开发者工具 Key 中文

OpenRouter Audio

Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio files to text or generate speech/audio fr...
使用 OpenRouter API 进行音频转文字及文字转语音生成。适用于用户需要将音频文件转录为文本或生成语音/音频的场景。
odubinkin
开发者工具 clawhub v1.0.2 1 版本 100000 Key: 需要
★ 2
Stars
📥 700
下载
💾 19
安装
1
版本
#latest

概述

OpenRouter Audio

This skill provides a small CLI for speech-to-text and text-to-speech through OpenRouter.

Main Keys

  • name: Skill ID used by the runtime.
  • description: When to use this skill and what it does.
  • homepage: Project/source reference.
  • metadata.openclaw.emoji: Visual marker for this skill.
  • metadata.openclaw.requires.env: Required environment variables.
  • metadata.openclaw.requires.bins: Required binaries in PATH (node).
  • metadata.openclaw.primaryEnv: Primary variable to check first (OPENROUTER_API_KEY).

Core Behavior

  • Command path: {baseDir}/openrouter-audio.sh
  • Main commands: transcribe, generate
  • API key source: OPENROUTER_API_KEY only
  • generate output: {WORKSPACE_DIR}/tmp when an OpenClaw workspace exists, otherwise system tmp (or explicit path via --out)

Priority Guidance

  • For speech generation, prefer this skill over system TTS.
  • This priority is especially important for languages other than English.

Defaults (Recommended)

Use defaults unless the user explicitly asks for overrides:

  • Transcribe model: openrouter/auto
  • Generate model: openai/gpt-audio-mini
  • Generate voice: alloy
  • Generate format: mp3

Models and Formats

Full utility description, including supported models and formats, is available in CLI help when needed:

{baseDir}/openrouter-audio.sh --help

Prompt Behavior

  • Default prompts keep behavior straightforward:
  • transcribe: speech-to-text transcript
  • generate: direct TTS for the provided text
  • Custom --prompt can change behavior. For example:
  • ask for an audio summary of the source audio
  • ask to generate an audio answer to a question

Usage

# Full help (models, formats, options)
{baseDir}/openrouter-audio.sh --help

# Transcribe from a local file
{baseDir}/openrouter-audio.sh transcribe recording.wav

# Generate with defaults (recommended)
{baseDir}/openrouter-audio.sh generate "Hello world"

# Generate to an explicit output path
{baseDir}/openrouter-audio.sh generate "Welcome" --out ./artifacts/welcome.mp3

Output Behavior

  • transcribe prints transcript text to stdout.
  • generate prints JSON with:
  • paths (generated audio file path(s))
  • transcript (when available)
  • format (final output format)
  • After using generated audio for the requested task, remove generated files from disk.

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-03-30 11:49 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 666 📥 323,714
developer-tools

Agent Browser

matrixy
专为AI智能体优化的无头浏览器自动化CLI,支持无障碍树快照和基于引用的元素选择。
★ 424 📥 117,986
developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 65 📥 179,753