IzwiIzwi

Izwi Documentation

Welcome to the official documentation for Izwi — a local-first audio inference engine for text-to-speech, speech recognition, and voice AI workflows.

What is Izwi?

Izwi is a powerful, privacy-focused audio AI platform that runs entirely on your machine. No cloud services, no API keys, no data leaving your device.

Key capabilities:

  • Voice Mode — Real-time voice conversations with AI
  • Text-to-Speech — Generate natural speech from text
  • Voice Cloning — Clone any voice from a short audio sample
  • Voice Design — Create custom voices from text descriptions
  • Transcription — Convert audio to text with high accuracy
  • Diarization — Identify and separate multiple speakers
  • Chat — Text-based AI conversations

SectionDescription
Getting StartedInstall Izwi and run your first command
InstallationPlatform-specific installation guides
FeaturesLearn about each feature in detail
ModelsUnderstand and manage AI models
CLI ReferenceComplete command-line reference
TroubleshootingCommon issues and solutions

System Requirements

RequirementMinimumRecommended
macOS12.0+ (Monterey)14.0+ (Sonoma)
LinuxUbuntu 20.04+Ubuntu 22.04+
WindowsWindows 10Windows 11
RAM8 GB16 GB+
Storage10 GB free50 GB+ free
GPUApple Silicon / NVIDIA CUDA
Note: Izwi is optimized for Apple Silicon Macs with Metal acceleration. CUDA support is available for NVIDIA GPUs.

Getting Help


License

Izwi is open source software licensed under Apache 2.0.