izwi serve

Start the Izwi inference server.

Synopsis

izwi serve [OPTIONS]

Launches the HTTP API server that powers all Izwi functionality. The server provides:

Option	Description	Default
`--mode <MODE>`	Startup mode: `server`, `desktop`, `web`	`server`
`-H, --host <HOST>`	Host to bind to	`0.0.0.0`
`-p, --port <PORT>`	Port to listen on	`8080`
`-m, --models-dir <PATH>`	Models directory	Platform default
`--max-batch-size <N>`	Maximum batch size	`8`
`--metal`	Enable Metal GPU (macOS)	—
`-t, --threads <N>`	Number of CPU threads	Auto
`--max-concurrent <N>`	Max concurrent requests	`100`
`--timeout <SECONDS>`	Request timeout	`300`
`--log-level <LEVEL>`	Log level	`warn`
`--cors`	Enable CORS for all origins	—
`--no-ui`	Disable the web UI	—

Starts only the HTTP server:

izwi serve izwi serve --mode server

Access at http://localhost:8080

Starts the server and opens the native desktop application:

izwi serve --mode desktop

Starts the server and opens the web UI in your default browser:

izwi serve --mode web

izwi serve

izwi serve --port 9000

izwi serve --metal

izwi serve --models-dir /path/to/models

izwi serve \\ --host 0.0.0.0 \\ --port 8080 \\ --max-concurrent 200 \\ --timeout 600 \\ --log-level info

izwi serve --cors --log-level debug

Press Ctrl+C to gracefully shut down the server. Active requests will complete before shutdown.