← 返回
开发者工具 中文

Voice Recognition

Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.
使用 OpenAI Whisper CLI 的本地语音转文字,支持中文、英文以及 100+ 语言的翻译和摘要。
gykdly gykdly 来源
开发者工具 clawhub v1.0.0 1 版本 99958.6 Key: 无需
★ 3
Stars
📥 2,355
下载
💾 76
安装
1
版本
#latest

概述

Voice Recognition (Whisper)

Local speech-to-text with OpenAI Whisper CLI.

Features

  • Local processing - No API key needed, free
  • Multi-language - Chinese, English, 100+ languages
  • Translation - Translate to English
  • Summarization - Generate quick summary

Usage

Basic

# Chinese recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a

# Force Chinese
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --zh

# English recognition  
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --en

# Translate to English
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --translate

# With summary
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --summarize

Quick Command (add to ~/.zshrc)

alias voice="python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py"

Then use:

voice ~/Downloads/audio.m4a --zh

Requirements

  • OpenAI Whisper CLI: brew install openai-whisper
  • Python 3.10+

Files

  • scripts/voice识别_升级版.py - Main script
  • scripts/voice_tool_README.md - Documentation

Supported Formats

  • MP3, M4A, WAV, OGG, FLAC, WebM

Language Support

100+ languages including:

  • Chinese (zh)
  • English (en)
  • Japanese (ja)
  • Korean (ko)
  • And more...

Notes

  • Default model: medium (balance of speed and accuracy)
  • First run downloads model to ~/.cache/whisper
  • Processing time varies by audio length and model size

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-28 19:30 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 212 📥 46,154
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 329 📥 92,514
professional

Dividend Premium Tracker

gykdly
追踪CSI股息低波动指数的股息溢价(股息率‑10年期国债收益率),监测股息率和10年期国债收益率,计算溢价用于投资决策。
★ 0 📥 1,806