Transcribe audio files via OpenRouter using audio-capable models
Transcribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).
clawhub install openrouter-transcribeTranscribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
# Install Skill (downloads SKILL.md to .claude/skills/) clawhub install elevenlabs-stt # Then just tell Claude: "use ElevenLabs Speech-to-Text to help me..."
# Same install command — works with all SKILL.md-compatible AI coding tools clawhub install elevenlabs-stt
This Skill is compatible with the OpenClaw standard. After installation, a SKILL.md file is auto-generated, usable by any OpenClaw-compatible AI Agent (Claude Code, Cursor, Windsurf, etc.).
# Basic transcription
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3
# With speaker diarization
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --diarize
# Specify language (improves accuracy)
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --lang en
# Full JSON output with timestamps
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --json
--diarize | Identify different speakers |
| --lang CODE | ISO language code (e.g., en, pt, es) |
| --json | Output full JSON with word timestamps |
| --events | Tag audio events (laughter, music, etc.) |ELEVENLABS_API_KEY environment variable, or configure in clawdbot.json:{
skills: {
entries: {
"elevenlabs-stt": {
apiKey: "sk_..."
}
}
}
}
# Transcribe a WhatsApp voice note
{baseDir}/scripts/transcribe.sh ~/Downloads/voice_note.ogg
# Meeting recording with multiple speakers
{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize --lang en
# Get JSON for processing
{baseDir}/scripts/transcribe.sh podcast.mp3 --json > transcript.json
clawhub run elevenlabs-stt --input meeting.mp3 --output meeting_transcript.json --language zh --timestamps trueTranscribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).
clawhub install openrouter-transcribe