2025-12-20 12:53:09 +00:00
|
|
|
|
---
|
|
|
|
|
|
name: openai-whisper-api
|
|
|
|
|
|
description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
|
2025-12-20 21:12:48 +01:00
|
|
|
|
homepage: https://platform.openai.com/docs/guides/speech-to-text
|
2026-01-31 21:21:09 +09:00
|
|
|
|
metadata:
|
|
|
|
|
|
{
|
|
|
|
|
|
"openclaw":
|
|
|
|
|
|
{
|
2026-03-11 09:13:10 -04:00
|
|
|
|
"emoji": "🌐",
|
2026-01-31 21:21:09 +09:00
|
|
|
|
"requires": { "bins": ["curl"], "env": ["OPENAI_API_KEY"] },
|
|
|
|
|
|
"primaryEnv": "OPENAI_API_KEY",
|
|
|
|
|
|
},
|
|
|
|
|
|
}
|
2025-12-20 12:53:09 +00:00
|
|
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
# OpenAI Whisper API (curl)
|
|
|
|
|
|
|
|
|
|
|
|
Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.
|
|
|
|
|
|
|
|
|
|
|
|
## Quick start
|
|
|
|
|
|
|
|
|
|
|
|
```bash
|
|
|
|
|
|
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
Defaults:
|
2026-01-31 21:21:09 +09:00
|
|
|
|
|
2025-12-20 12:53:09 +00:00
|
|
|
|
- Model: `whisper-1`
|
|
|
|
|
|
- Output: `<input>.txt`
|
|
|
|
|
|
|
|
|
|
|
|
## Useful flags
|
|
|
|
|
|
|
|
|
|
|
|
```bash
|
|
|
|
|
|
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
|
|
|
|
|
|
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
|
|
|
|
|
|
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
|
|
|
|
|
|
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
## API key
|
|
|
|
|
|
|
2026-01-30 21:01:02 +01:00
|
|
|
|
Set `OPENAI_API_KEY`, or configure it in `~/.openclaw/openclaw.json`:
|
2025-12-20 12:53:09 +00:00
|
|
|
|
|
|
|
|
|
|
```json5
|
|
|
|
|
|
{
|
|
|
|
|
|
skills: {
|
|
|
|
|
|
"openai-whisper-api": {
|
2026-01-31 21:21:09 +09:00
|
|
|
|
apiKey: "OPENAI_KEY_HERE",
|
|
|
|
|
|
},
|
|
|
|
|
|
},
|
2025-12-20 12:53:09 +00:00
|
|
|
|
}
|
|
|
|
|
|
```
|