Voice Transcribe
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
clawhub install voice-transcribeHandle audio file processing, editing, and conversion tasks
# Install Skill (downloads SKILL.md to .claude/skills/) clawhub install audio # Then just tell Claude: "use Audio to help me..."
# Same install command — works with all SKILL.md-compatible AI coding tools clawhub install audio
This Skill is compatible with the OpenClaw standard. After installation, a SKILL.md file is auto-generated, usable by any OpenClaw-compatible AI Agent (Claude Code, Cursor, Windsurf, etc.).
ffmpeg / ffprobe — core audio processingsox — additional noise reductionwhisper — local transcription (or use API)demucs — stem separationcommands.md |
| Loudness standards by platform | loudness.md |
| Podcast production workflow | podcast.md |
| Transcription workflow | transcription.md |-acodec) |
| Remove noise | FFmpeg filters or SoX |
| Normalize loudness | ffmpeg-normalize or -af loudnorm |
| Transcribe | Whisper → text, SRT, VTT |
| Separate stems | Demucs (vocals, drums, bass, other) |ffprobe for codec, sample rate, channels, duration-acodec libmp3lame -q:a 2 |
| "Remove background noise" | Apply highpass/lowpass or dedicated denoiser |
| "Normalize for podcast" | -af loudnorm=I=-16:TP=-1.5:LRA=11 |
| "Transcribe this" | Whisper → output SRT/VTT/TXT |
| "Extract audio from video" | -vn -acodec copy or re-encode |
| "Make it smaller" | Lower bitrate: -b:a 128k or -b:a 96k |
| "Speed up 1.5x" | -af atempo=1.5 |-q:a 2 (~190 kbps) or CBR -b:a 192kclawhub audio convert input.wav --format mp3 --bitrate 256k output.mp3Create chapters, highlights, and show notes from podcast audio or transcripts. Use when a user wants chapter markers, highlight clips, or show-note drafts without publishing or distribution actions.
clawhub install podcast-chaptering-highlights