Voice Transcribe
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
clawhub install voice-transcribe创建和定制具有独特风格的语音角色。
# Install Skill (downloads SKILL.md to .claude/skills/) clawhub install noizai-characteristic-voice # Then just tell Claude: "use Characteristic Voice to help me..."
# Same install command — works with all SKILL.md-compatible AI coding tools clawhub install noizai-characteristic-voice
This Skill is compatible with the OpenClaw standard. After installation, a SKILL.md file is auto-generated, usable by any OpenClaw-compatible AI Agent (Claude Code, Cursor, Windsurf, etc.).
NOIZ_API_KEY | Yes if using Noiz backend | API key from [developers.noiz.ai](https://developers.noiz.ai/api-keys). Not needed if using the local Kokoro backend. |~/.noiz_api_key (mode 600) for convenience. To set it:bash skills/characteristic-voice/scripts/speak.sh config --set-api-key YOUR_KEY
speak.sh script requires curl and python3 at runtime. Depending on which backend and features you use, you may also need:curl, python3 | Always (core script) | Usually pre-installed |
| kokoro-tts | Kokoro (local/offline) backend | uv tool install kokoro-tts |
| yt-dlp | Downloading reference audio for voice cloning | [github.com/yt-dlp/yt-dlp](https://github.com/yt-dlp/yt-dlp) |
| ffmpeg | Trimming reference audio clips | [ffmpeg.org](https://ffmpeg.org) |
| rg (ripgrep) | Searching subtitle files | [github.com/BurntSushi/ripgrep](https://github.com/BurntSushi/ripgrep) |https://noiz.ai/v1. If you supply --ref-audio, that audio file is uploaded for voice cloning.--backend kokoro) if you want fully offline processing.... after fillers for a beat of silence, ~ at word endings for warmth.skills/characteristic-voice/. If one does, use it directly with --ref-audio.yt-dlp and ffmpeg can download and trim audio. Example workflow:yt-dlp "URL" --write-auto-sub --sub-lang en --skip-download -o tmp/clip
rg -n "target line" tmp/clip.en.vtt
yt-dlp "URL" -x --audio-format wav --download-sections "*00:00:00-00:00:25" -o tmp/clip
ffmpeg -i tmp/clip.wav -ss 00:00:02 -to 00:00:20 skills/characteristic-voice/character.wav
https://noiz.ai/v1 for voice cloning when using the Noiz backend. If this is a concern, consider using the local Kokoro backend instead.bash skills/characteristic-voice/scripts/speak.sh \
--preset goodnight -t "Hmm... rest well~ Sweet dreams." \
--ref-audio skills/characteristic-voice/character.wav -o night.wav
--ref-audio flag uploads the file to the Noiz backend for voice cloning (requires NOIZ_API_KEY).speak.sh, a wrapper around the tts skill with companion-friendly presets.# Use a preset (auto-sets emotion + speed)
bash skills/characteristic-voice/scripts/speak.sh \
--preset goodnight -t "Hmm... rest well~ Sweet dreams." -o night.wav
# Custom emotion override
bash skills/characteristic-voice/scripts/speak.sh \
-t "Aww... I'm right here." --emo '{"Tenderness":0.9}' --speed 0.75 -o comfort.wav
# With specific backend and voice
bash skills/characteristic-voice/scripts/speak.sh \
--preset morning -t "Good morning~" --voice-id voice_abc --backend noiz -o morning.mp3 --format mp3
bash skills/characteristic-voice/scripts/speak.sh --help for all options.clawhub install noizai-characteristic-voice && clawhub run noizai-characteristic-voice --text "欢迎使用本产品" --voice "friendly" --speed 1.0 --output "greeting.mp3"Create chapters, highlights, and show notes from podcast audio or transcripts. Use when a user wants chapter markers, highlight clips, or show-note drafts without publishing or distribution actions.
clawhub install podcast-chaptering-highlights