openclaw/skills/openai-whisper-api/SKILL.md

---
name: openai-whisper-api
description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
homepage: https://platform.openai.com/docs/guides/speech-to-text
metadata:
  {
    "openclaw":
      {
        "emoji": "🌐",
        "requires": { "bins": ["curl"], "env": ["OPENAI_API_KEY"] },
        "primaryEnv": "OPENAI_API_KEY",
      },
  }
---

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.

## Quick start

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
```

Defaults:

- Model: `whisper-1`
- Output: `<input>.txt`

## Useful flags

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
```

## API key

Set `OPENAI_API_KEY`, or configure it in `~/.openclaw/openclaw.json`:

```json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}
```
-												feat(skills): add media/transcription helpers

											
										
										
											2025-12-20 12:53:09 +00:00
+								---
 								name: openai-whisper-api
 								description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
-												chore(skills): add homepage metadata

											
										
										
											2025-12-20 21:12:48 +01:00
+								homepage: https://platform.openai.com/docs/guides/speech-to-text
-												chore: Also format `scripts` and `skills`.

											
										
										
											2026-01-31 21:21:09 +09:00
+								metadata:
 								  {
 								    "openclaw":
 								      {
-												fix(terminal): stabilize skills table width across Terminal.app and iTerm (#42849)

* Terminal: measure grapheme display width

* Tests: cover grapheme terminal width

* Terminal: wrap table cells by grapheme width

* Tests: cover emoji table alignment

* Terminal: refine table wrapping and width handling

* Terminal: stop shrinking CLI tables by one column

* Skills: use Terminal-safe emoji in list output

* Changelog: note terminal skills table fixes

* Skills: normalize emoji presentation across outputs

* Terminal: consume unsupported escape bytes in tables
											
										
										
											2026-03-11 09:13:10 -04:00
+								        "emoji": "🌐",
-												chore: Also format `scripts` and `skills`.

											
										
										
											2026-01-31 21:21:09 +09:00
+								        "requires": { "bins": ["curl"], "env": ["OPENAI_API_KEY"] },
 								        "primaryEnv": "OPENAI_API_KEY",
 								      },
 								  }
-												feat(skills): add media/transcription helpers

											
										
										
											2025-12-20 12:53:09 +00:00
+								---
 								# OpenAI Whisper API (curl)
 								Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.
 								## Quick start
 								```bash
 								{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
 								```
 								Defaults:
-												chore: Also format `scripts` and `skills`.

											
										
										
											2026-01-31 21:21:09 +09:00
-												feat(skills): add media/transcription helpers

											
										
										
											2025-12-20 12:53:09 +00:00
+								- Model: `whisper-1`
 								- Output: `<input>.txt`
 								## Useful flags
 								```bash
 								{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
 								{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
 								{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
 								{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
 								```
 								## API key
-												chore: update openclaw naming

											
										
										
											2026-01-30 21:01:02 +01:00
+								Set `OPENAI_API_KEY`, or configure it in `~/.openclaw/openclaw.json`:
-												feat(skills): add media/transcription helpers

											
										
										
											2025-12-20 12:53:09 +00:00
 								```json5
 								{
 								  skills: {
 								    "openai-whisper-api": {
-												chore: Also format `scripts` and `skills`.

											
										
										
											2026-01-31 21:21:09 +09:00
+								      apiKey: "OPENAI_KEY_HERE",
 								    },
 								  },
-												feat(skills): add media/transcription helpers

											
										
										
											2025-12-20 12:53:09 +00:00
+								}
 								```