Voice Transcribe
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
clawhub install voice-transcribeGenerate high-quality Chinese speech synthesis audio using Aliyun TTS service
# Install Skill (downloads SKILL.md to .claude/skills/) clawhub install aliyun-tts # Then just tell Claude: "use Aliyun TTS to help me..."
# Same install command — works with all SKILL.md-compatible AI coding tools clawhub install aliyun-tts
This Skill is compatible with the OpenClaw standard. After installation, a SKILL.md file is auto-generated, usable by any OpenClaw-compatible AI Agent (Claude Code, Cursor, Windsurf, etc.).
ALIYUN_APP_KEY - Application KeyALIYUN_ACCESS_KEY_ID - Access Key IDALIYUN_ACCESS_KEY_SECRET - Access Key Secret (sensitive)# Configure App Key
clawdbot skills config aliyun-tts ALIYUN_APP_KEY "your-app-key"
# Configure Access Key ID
clawdbot skills config aliyun-tts ALIYUN_ACCESS_KEY_ID "your-access-key-id"
# Configure Access Key Secret (sensitive)
clawdbot skills config aliyun-tts ALIYUN_ACCESS_KEY_SECRET "your-access-key-secret"
~/.clawdbot/clawdbot.json:{
skills: {
entries: {
"aliyun-tts": {
env: {
ALIYUN_APP_KEY: "your-app-key",
ALIYUN_ACCESS_KEY_ID: "your-access-key-id",
ALIYUN_ACCESS_KEY_SECRET: "your-access-key-secret"
}
}
}
}
}
# Basic usage
{baseDir}/bin/aliyun-tts "Hello, this is Aliyun TTS"
# Specify output file
{baseDir}/bin/aliyun-tts -o /tmp/voice.mp3 "Hello"
# Specify voice
{baseDir}/bin/aliyun-tts -v siyue "Use siyue voice"
# Specify format and sample rate
{baseDir}/bin/aliyun-tts -f mp3 -r 16000 "Audio parameters"
-o, --output | Output file path | tts.mp3 |
| -v, --voice | Voice name | siyue |
| -f, --format | Audio format | mp3 |
| -r, --sample-rate | Sample rate | 16000 |siyue, xiaoxuan, xiaoyun, etc. See Alibaba Cloud documentation for the full list.# Generate audio
{baseDir}/bin/aliyun-tts -o /tmp/voice-reply.mp3 "Your reply content"
# Include in your response:
# MEDIA:/tmp/voice-reply.mp3
#!/usr/bin/env python3
# 阿里云TTS批量中文配音示例
from aliyunsdkcore.client import AcsClient
from aliyunsdktts.request.v20190627 import SynthesizeSpeechRequest
import os
# 配置阿里云凭证
ACCESS_KEY = os.getenv('ALIYUN_ACCESS_KEY')
SECRET_KEY = os.getenv('ALIYUN_SECRET_KEY')
REGION = 'cn-shanghai'
client = AcsClient(ACCESS_KEY, SECRET_KEY, REGION)
# 文本内容和配置
contents = [
{"text": "欢迎来到我们的在线课堂", "voice": "Siqi", "output": "welcome.mp3"},
{"text": "今天我们学习Python编程基础", "voice": "Xiaoyun", "output": "lesson.mp3"},
{"text": "感谢大家的认真听讲,下课", "voice": "Siqi", "output": "closing.mp3"},
]
voices = {
'Siqi': '女声-思琪(温暖)',
'Xiaoyun': '女声-晓云(标准)',
'Yajie': '女声-雅洁(温柔)',
'Zhiyu': '男声-智昱(沉稳)'
}
for item in contents:
request = SynthesizeSpeechRequest()
request.set_Text(item['text'])
request.set_Voice(item['voice'])
request.set_Format('mp3')
request.set_SampleRate(16000)
request.set_Volume(50)
request.set_SpeechRate(1.0) # 语速1倍
try:
response = client.do_action_with_exception(request)
with open(item['output'], 'wb') as f:
f.write(response['AudioContent'])
print(f"✓ {item['output']} 生成成功")
print(f" 文案: {item['text'][:30]}...")
print(f" 音色: {voices[item['voice']]}\n")
except Exception as e:
print(f"✗ {item['output']} 生成失败: {str(e)}\n")
print("所有音频生成任务完成!")
print("\n成本估算:")
total_chars = sum(len(item['text']) for item in contents)
print(f"总字符数: {total_chars}")
print(f"预计费用: ¥{total_chars/10000:.2f}(按1元/万字计算)")Create chapters, highlights, and show notes from podcast audio or transcripts. Use when a user wants chapter markers, highlight clips, or show-note drafts without publishing or distribution actions.
clawhub install podcast-chaptering-highlights