Chat
Have text-based conversations with AI models running locally on your device.
Overview
Izwi's chat feature provides:
- Local AI — Models run entirely on your device
- Privacy — No data sent to external servers
- Multiple models — Choose from available chat models
- Context memory — Maintains conversation history
- System prompts — Customize AI behavior
Getting Started
Download a Chat Model
izwi pull qwen3-chat-0.6b-4bitStart Chatting
Command line:
izwi chatWeb UI:
http://localhost:8080/chatUsing the CLI
Interactive Mode
Start an interactive chat session:
izwi chatType your messages and press Enter. Type exit or quit to end.
Options
| Option | Description | Default |
|---|---|---|
--model, -m | Chat model to use | qwen3-chat-0.6b-4bit |
--system, -s | System prompt | — |
--voice, -v | Voice for spoken responses | — |
Examples
With custom system prompt:
izwi chat --system "You are a helpful coding assistant."With specific model:
izwi chat --model qwen3-chat-0.6b-4bitWith voice responses:
izwi chat --voice defaultUsing the Web UI
- Navigate to Chat in the sidebar
- Type your message in the input field
- Press Enter or click Send
- View the AI response
Features
- Conversation history — Scroll through past messages
- Clear chat — Start a fresh conversation
- Model selection — Switch between loaded models
- Copy responses — One-click copy
Using the API
Endpoint
POST /v1/chat/completionsRequest
{
"model": "qwen3-chat-0.6b-4bit",
"messages": [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello!"}
],
"temperature": 0.7,
"max_tokens": 1000
}Response
{
"id": "chat-123",
"object": "chat.completion",
"created": 1234567890,
"model": "qwen3-chat-0.6b-4bit",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "Hello! How can I help you today?"
},
"finish_reason": "stop"
}
]
}Streaming
For streaming responses, add "stream": true:
{
"model": "qwen3-chat-0.6b-4bit",
"messages": [...],
"stream": true
}Example (curl)
curl -X POST http://localhost:8080/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "qwen3-chat-0.6b-4bit", "messages": [{"role": "user", "content": "Hello!"}] }'System Prompts
Customize AI behavior with system prompts:
Coding assistant:
You are an expert programmer. Provide clear, well-commented code examples.Writing helper:
You are a professional editor. Help improve writing clarity and style.Concise responder:
You are a helpful assistant. Keep responses brief and to the point.Available Models
| Model | Size | Speed | Notes |
|---|---|---|---|
qwen3-chat-0.6b-4bit | ~400 MB | Fast | Compact, efficient |
More models coming soon.
Tips
- Be specific — Clear questions get better answers
- Use system prompts — Guide the AI's behavior
- Break down complex tasks — Ask step by step
- Provide context — Include relevant background
See Also
- Voice Mode — Spoken conversations
- Models — Download more models
- CLI Reference — Full command documentation