🎬

moss-transcribe-diarize

Medium

Transcribe videos and perform speaker diarization on audio content

Install

Use in AI Agents

Claude Code

# Install Skill (downloads SKILL.md to .claude/skills/)
clawhub install moss-transcribe-diarize

# Then just tell Claude: "use moss-transcribe-diarize to help me..."

OpenAI Codex / Cursor / Windsurf

# Same install command — works with all SKILL.md-compatible AI coding tools
clawhub install moss-transcribe-diarize

OpenClaw Ecosystem

This Skill is compatible with the OpenClaw standard. After installation, a SKILL.md file is auto-generated, usable by any OpenClaw-compatible AI Agent (Claude Code, Cursor, Windsurf, etc.).

Environment & Dependencies

🟡

Medium

Paid API required or GPU helps significantly

API Dependencies: speech_recognition_api

Requires third-party API keys; some services need stable network access

SKILL.md

你是语音转写助手。根据用户需求直接调用 scripts/transcribe.py。

常用操作指令

URL 音频转写:

python scripts/transcribe.py --audio-url "https://example.com/audio.mp3" --out "result.json"

本地音视频转写（自动转 data URL）:

python scripts/transcribe.py --file "/path/to/meeting.mp4" --out "result.json"

直接传 data URL:

python scripts/transcribe.py --audio-data "data:audio/wav;base64,..." --out "result.json"

按用户要求输出分段格式:

- 可读文本：--segments-format text - JSON 数组（推荐，含 speaker）：--segments-format json - 紧凑 JSON 串：--segments-format compact

约束

脚本支持统一环境变量（优先级）：MOSS_API_KEY → MOSI_TTS_API_KEY → MOSI_API_KEY。如果都缺失，请提醒用户。
默认模型：moss-transcribe-diarize。
固定 endpoint：https://studio.mosi.cn/v1/audio/transcriptions（不再暴露自定义 endpoint 参数）。
输出文件共三份：

- *.json：原始响应 - *.segments.*：分段结果（格式由 --segments-format 决定，含 speaker） - *.by_speaker.txt：按说话人汇总

Code Example

clawhub moss-transcribe-diarize --input conference.mp4 --output result.json --diarize true --language auto

Also popular in Video Editing

View all

Openai Whisper

clawhubVideo Editing Medium

3.9

OpenAI Whisper：使用 Whisper 模型进行视频语音识别和转录。

815

clawhub install openai-whisper

Mlx Whisper

clawhubVideo Editing Very Easy Tested

3.6

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).

clawhub install mlx-whisper

🖥️ Canvas-OS

clawhubVideo Editing Easy

3.5

Canvas as an app platform. Build, store, and run rich visual apps on the OpenClaw Canvas.

clawhub install canvas-os

Blender

clawhubVideo Editing Easy

3.5

Avoid common Blender mistakes — transform application, modifier order, UV seams, and export settings for game engines.

clawhub install blender

Transcribe

clawhubVideo Editing Very Easy

3.5

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

385

clawhub install transcribe

Canvas Design

Skills.shVideo Editing Very Easy

3.5

Create beautiful visual art in .png and .pdf documents.

22K

npx skills add anthropics/skills@canvas-design