🎵

OpenAI TTS

Medium

Convert text into natural and fluent speech audio using OpenAI TTS

Install

Use in AI Agents

Claude Code

# Install Skill (downloads SKILL.md to .claude/skills/)
clawhub install openai-tts

# Then just tell Claude: "use OpenAI TTS to help me..."

OpenAI Codex / Cursor / Windsurf

# Same install command — works with all SKILL.md-compatible AI coding tools
clawhub install openai-tts

OpenClaw Ecosystem

This Skill is compatible with the OpenClaw standard. After installation, a SKILL.md file is auto-generated, usable by any OpenClaw-compatible AI Agent (Claude Code, Cursor, Windsurf, etc.).

Environment & Dependencies

🟡

Medium

Paid API required or GPU helps significantly

API Dependencies: OpenAI API Key

Requires third-party API keys; some services need stable network access

SKILL.md

Generate speech from text via OpenAI's /v1/audio/speech endpoint.

Quick start

{baseDir}/scripts/speak.sh "Hello, world!"
{baseDir}/scripts/speak.sh "Hello, world!" --out /tmp/hello.mp3

Defaults:

Model: tts-1 (fast) or tts-1-hd (quality)
Voice: alloy (neutral), also: echo, fable, onyx, nova, shimmer
Format: mp3

Voices

| Voice | Description | |-------|-------------| | alloy | Neutral, balanced | | echo | Male, warm | | fable | British, expressive | | onyx | Deep, authoritative | | nova | Female, friendly | | shimmer | Female, soft |

Flags

{baseDir}/scripts/speak.sh "Text" --voice nova --model tts-1-hd --out speech.mp3
{baseDir}/scripts/speak.sh "Text" --format opus --speed 1.2

Options:

--voice <name>: alloy|echo|fable|onyx|nova|shimmer (default: alloy)
--model <name>: tts-1|tts-1-hd (default: tts-1)
--format <fmt>: mp3|opus|aac|flac|wav|pcm (default: mp3)
--speed <n>: 0.25-4.0 (default: 1.0)
--out <path>: output file (default: stdout or auto-named)

API key

Set OPENAI_API_KEY, or configure in ~/.clawdbot/clawdbot.json:

{
  skills: {
    entries: {
      "openai-tts": {
        apiKey: "sk-..."
      }
    }
  }
}

Pricing

tts-1: ~$0.015 per 1K characters
tts-1-hd: ~$0.030 per 1K characters

Very affordable for short responses!

Code Example

#!/usr/bin/env python3
# OpenAI TTS 完整示例
from openai import OpenAI
import os

client = OpenAI(api_key=os.getenv('OPENAI_API_KEY'))

# 转换文本列表为语音
texts = [
    "这是第一段文案",
    "这是第二段文案",
    "欢迎订阅我们的频道"
]

voices = ['alloy', 'echo', 'fable']  # 不同声音选项

for i, text in enumerate(texts):
    response = client.audio.speech.create(
        model="tts-1-hd",  # 高质量模型
        input=text,
        voice=voices[i % len(voices)],
        response_format="mp3"
    )
    
    output_file = f"audio_{i+1}.mp3"
    response.stream_to_file(output_file)
    print(f"✓ 已生成: {output_file}")

print("\n所有音频生成完成！")

Also popular in Audio & Voice

View all

Voice Transcribe

clawhubAudio & Voice Medium

3.6

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

clawhub install voice-transcribe

Podcast

clawhubAudio & Voice Easy Tested

3.6

Create and grow podcasts by planning episodes, producing audio or video, generating clips, and building audience across formats.

297

clawhub install podcast

Podcast Chaptering Highlights

clawhubAudio & Voice Easy

3.5

Create chapters, highlights, and show notes from podcast audio or transcripts. Use when a user wants chapter markers, highlight clips, or show-note drafts without publishing or distribution actions.

clawhub install podcast-chaptering-highlights

Text to Speech

clawhubAudio & Voice Medium

3.5

Generate speech audio from text using HeyGen's Starfish TTS model. Use when: (1) Generating standalone speech audio files from text, (2) Converting text to s...

5.4K

clawhub install text-to-speech-heygen

Podcast Production Pipeline

clawhubAudio & Voice Medium

3.4

端到端播客制作流水线 - 从选题到发布的完整自动化。支持录制前调研、大纲生成、节目笔记、社交媒体宣发。含国内平台适配（小宇宙/喜马拉雅/B站/小红书）。

clawhub install podcast-production-pipeline

Gemini Image Remix

clawhubAudio & Voice Medium

3.4

Generate or remix images using Gemini models with text prompts and multiple input images, supporting various styles, resolutions, and advanced model options.

clawhub install gemini-image-remix

OpenAI TTS

Install

🤖Use in AI Agents

Claude Code

OpenAI Codex / Cursor / Windsurf

OpenClaw Ecosystem

Environment & Dependencies

SKILL.md

Quick start

Voices

Flags

API key

Pricing

Code Example

Also popular in Audio & Voice

Voice Transcribe

Podcast

Podcast Chaptering Highlights

Text to Speech

Podcast Production Pipeline

Gemini Image Remix

Use in AI Agents