Skip to main content
Pythondvcrn

aimlapi-voice

Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script with retries and polling.

Stars
15
Source
dvcrn/openclaw-skills-marketplace
Updated
2026-05-29
Slug
dvcrn--openclaw-skills-marketplace--aimlapi-voice
View on GitHubRaw SKILL.md

// install — copy + paste into any project

mkdir -p .claude/skills && curl -fsSL https://raw.githubusercontent.com/dvcrn/openclaw-skills-marketplace/HEAD/plugins/aimlapihello--aiml-voice/skills/aimlapi-voice/SKILL.md -o .claude/skills/aimlapi-voice.md

Drops the SKILL.md into .claude/skills/aimlapi-voice.md. Works with Claude Code, Cursor, and any agent that loads SKILL.md files from .claude/skills/.

AIMLAPI Voice Transcription

Overview

A robust skill for transcribing audio via AIMLAPI's specialized speech-to-text endpoints. It handles queuing, polling for results, and automatic MIME-type detection.

Quick Start

# Set your API key first (if not in env)
# export AIMLAPI_API_KEY="your-key-here"

# Transcribe a file
python {baseDir}/scripts/transcribe.py path/to/audio.ogg

Tasks

Process Voice Messages

When an audio file is received, use this script to extract the text.

python {baseDir}/scripts/transcribe.py <file_path> \
  --model "#g1_whisper-medium" \
  --verbose

Arguments

  • file: (Required) Path to the audio file.
  • --model: Model ID (default: #g1_whisper-medium).
  • --out: Path to save the transcript text.
  • --poll-interval: Seconds between status checks (default: 5).
  • --max-wait: Stop waiting after N seconds (default: 300).

Dependencies

  • Python 3
  • AIMLAPI_API_KEY set in environment or provided via --apikey-file.