IzwiIzwi

izwi tts

izwi tts

Generate speech from text.


Synopsis

izwi tts <TEXT> [OPTIONS]

Description

Converts text to speech using a TTS model. Supports multiple output formats, voice selection, and real-time streaming.


Arguments

ArgumentDescription
<TEXT>Text to synthesize (use - to read from stdin)

Options

OptionDescriptionDefault
-m, --model <MODEL>TTS model to useqwen3-tts-0.6b-base
-s, --speaker <VOICE>Voice/speaker (name or audio path)default
-o, --output <PATH>Output file pathstdout
-f, --format <FORMAT>Audio format: wav, mp3, ogg, flac, aacwav
-r, --speed <SPEED>Speech speed (0.5-2.0)1.0
-t, --temperature <TEMP>Sampling temperature0.7
--streamStream output in real-time
-p, --playPlay audio after generation

Examples

Basic usage

izwi tts "Hello, world!" --output hello.wav

Play immediately

izwi tts "Hello, world!" --play

Different format

izwi tts "Hello, world!" --format mp3 --output hello.mp3

Adjust speed

# Slower izwi tts "Speaking slowly" --speed 0.75 --output slow.wav # Faster izwi tts "Speaking quickly" --speed 1.5 --output fast.wav

Read from stdin

echo "Text from pipe" | izwi tts - --output piped.wav cat article.txt | izwi tts - --output article.wav

Voice cloning

izwi tts "Hello in cloned voice" \\ --model qwen3-tts-0.6b-customvoice \\ --speaker /path/to/reference.wav \\ --output cloned.wav

Voice design

izwi tts "Hello in designed voice" \\ --model qwen3-tts-0.6b-voicedesign \\ --speaker "A warm, friendly female voice" \\ --output designed.wav

Streaming with playback

izwi tts "Long text for streaming" --stream --play

Audio Formats

FormatExtensionNotes
wav.wavUncompressed, highest quality
mp3.mp3Compressed, widely compatible
ogg.oggOpen format, good compression
flac.flacLossless compression
aac.aacHigh efficiency compression

Models

ModelTypeDescription
qwen3-tts-0.6b-baseStandardGeneral-purpose TTS
qwen3-tts-0.6b-customvoiceCloningVoice cloning support
qwen3-tts-0.6b-voicedesignDesignVoice from descriptions
qwen3-tts-1.7b-*LargerHigher quality variants

See Also

  • Text-to-Speech Guide
  • Voice Cloning Guide
  • Voice Design Guide