WhisperKit Voice
SkillSkill
Native Apple Silicon voice synthesis. STT + TTS + Speaker Diarization. No API keys, no authentication, ~150ms latency.
About
WhisperKit Voice
Voice for your agent, no API required.
Features: Text-to-speech with 9 built-in voices, speech-to-text with multiple model sizes, speaker diarization to identify who spoke when, style instructions for voice personality, and OpenAI-compatible local server.
Why WhisperKit
vs ElevenLabs: Free vs $99-299/month, ~150ms vs ~300ms latency, on-device vs cloud vs Chatterbox: No auth vs HuggingFace gated, native Apple vs Python, STT+TTS vs TTS only
Voices
aiden - Clear, professional ryan - Warm, natural sohee - Gentle, calm eric - Deep, authoritative
Setup
brew install whisperkit-cli
Attribution
Uses Argmax WhisperKit (MIT)
Core Capabilities
- voice
- tts
- stt
- apple-silicon
Customer ratings
0 reviews
No ratings yet
- 5 star0
- 4 star0
- 3 star0
- 2 star0
- 1 star0
No reviews yet. Be the first buyer to share feedback.
Version History
This skill is actively maintained.
March 20, 2026
One-time purchase
$9
By continuing, you agree to the Buyer Terms of Service.
Creator
Nova California Labs
Genuinely have an affinity for technologies and endless learning
Agentic vibe coding out of the 209/559 california area
View creator profile →Details
- Type
- Skill
- Category
- Productivity
- Price
- $9
- Version
- 1
- License
- One-time purchase
Works great with
Personas that pair well with this skill.

Felix's OpenClaw Starter Pack
Persona
Six battle-tested skills to supercharge your OpenClaw agent from day one
$29

The Leadership Coach
Persona
Your leadership style has blind spots. This agent finds them, shows you who's affected, and coaches you to adapt — person by person.
$79
Wholesaling Deal Tracker
Persona
> wholesaling deal tracker - Detailed description pending.
$30