← 返回
开发者工具 Key 中文

Groq Voice Transcribe

Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...
通过 Groq 的 OpenAI 兼容语音转文字 API 转录音频文件。当用户发送语音消息或音频文件且需要快速云端语音识别时使用。
timing-up
开发者工具 clawhub v1.0.1 1 版本 100000 Key: 需要
★ 0
Stars
📥 668
下载
💾 8
安装
1
版本
#latest

概述

Groq Voice Transcribe

Fast speech-to-text for voice notes and audio files through Groq's OpenAI-compatible transcription endpoint.

Use it when you want cloud transcription via Groq instead of running Whisper locally.

Best for:

  • Telegram / Signal voice notes
  • short audio clips
  • Chinese, English, or mixed daily speech
  • fast transcript generation for follow-up summarization or chat replies

What you need

You need a Groq API key.

Groq often provides a free developer tier / trial credits for new users.

Get one from:

  • https://console.groq.com/
  • Sign in, open API Keys, then create a key

Easiest setup in OpenClaw

If OpenClaw is already running and configured, you can simply ask your assistant:

  • "Configure Groq Voice Transcribe for me"
  • "Here is my Groq API key, set up Groq Voice Transcribe"

The assistant can place the key into ~/.openclaw/openclaw.json for you.

Manual setup

Set GROQ_API_KEY, or configure it in ~/.openclaw/openclaw.json under:

{
  "skills": {
    "entries": {
      "groq-voice-transcribe": {
        "apiKey": "GROQ_KEY_HERE"
      }
    }
  }
}

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

Defaults:

  • Model: whisper-large-v3-turbo
  • Output: .txt
  • Format: plain text

Common examples

# Basic transcript
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

# Chinese voice message
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --language zh --prompt "中文普通话,日常聊天"

# Save to a custom file
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --out /tmp/transcript.txt

# Verbose JSON output
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --json --out /tmp/transcript.json

Flags

  • --model : transcription model (default whisper-large-v3-turbo)
  • --out : output file path
  • --language : hint the spoken language, for example zh, en, ja
  • --prompt : optional context or spelling hint
  • --json: write verbose JSON instead of plain text

Notes

  • Audio is sent to Groq for transcription.
  • This skill is meant for transcription, not text-to-speech.
  • If language is omitted, Groq can usually auto-detect it, but passing --language zh often helps for Chinese voice notes.

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-03-30 23:47 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 921 📥 185,767
developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 66 📥 179,998
developer-tools

Agent Browser

matrixy
专为AI智能体优化的无头浏览器自动化CLI,支持无障碍树快照和基于引用的元素选择。
★ 426 📥 118,107