mirror of
https://github.com/Palm1r/QodeAssist.git
synced 2026-07-03 19:49:11 -04:00
67 lines
3.1 KiB
Markdown
67 lines
3.1 KiB
Markdown
# Configure for Ollama
|
|
|
|
1. Install [Ollama](https://ollama.com). Make sure to review the system requirements before installation.
|
|
2. Install a language models in Ollama via terminal. For example, you can run:
|
|
|
|
For standard computers (minimum 8GB RAM):
|
|
```bash
|
|
ollama run qwen2.5-coder:7b
|
|
```
|
|
For better performance (16GB+ RAM):
|
|
```bash
|
|
ollama run qwen2.5-coder:14b
|
|
```
|
|
For high-end systems (32GB+ RAM):
|
|
```bash
|
|
ollama run qwen2.5-coder:32b
|
|
```
|
|
|
|
3. Open Qt Creator settings (Edit > Preferences on Linux/Windows, Qt Creator > Preferences on macOS)
|
|
4. Navigate to QodeAssist > Providers and verify the bundled **Ollama (Native)** provider points at http://localhost:11434 (no API key needed)
|
|
5. Navigate to QodeAssist > General > Agent Pipelines — the Ollama agents are assigned by default:
|
|
- Code completion: **Ollama Completion — FIM** (needs a base/FIM model; use **Ollama Completion — Chat-style** for instruct models)
|
|
- Chat assistant: **Ollama Chat — Simple / Thinking / Gemma 4**
|
|
- Chat compression: **Ollama Compression — 8 GB** (or the 16/32 GB tier for your machine)
|
|
- Quick refactor: **Ollama Quick Refactor — Simple**
|
|
6. Point the agents at models you actually have (see the next section)
|
|
|
|
You're all set! QodeAssist is now ready to use in Qt Creator.
|
|
|
|
## Which models do I actually need?
|
|
|
|
You do **not** need a separate model for every agent. Each bundled Ollama agent
|
|
names a *default* model only as an example — you can point any agent at a model
|
|
you already have via its settings → **Change…** (a per-agent override; it does not
|
|
edit the bundled agent). **Seeing a model name on an agent is not a reason to
|
|
download it.**
|
|
|
|
The defaults cluster into a tiny set, so one or two pulls cover everyday use:
|
|
|
|
| Pull this | Unlocks |
|
|
|---|---|
|
|
| `qwen2.5-coder:7b` | Ollama Chat — Simple · Ollama Completion — FIM · Ollama Completion — Chat-style · Ollama Quick Refactor |
|
|
| `qwen3.5:9b` (or `:4b` on ~8 GB) | Ollama Chat — Thinking · Ollama Compression — 16/32 GB (`:4b` → Compression — 8 GB) |
|
|
|
|
Optional specialists — pull only if you want that capability:
|
|
|
|
| Pull this | For |
|
|
|---|---|
|
|
| `gemma4:12b` | Ollama Chat — Gemma 4 — agentic chat with vision + native reasoning |
|
|
| `theqtcompany/codellama-7b-qml` | Ollama Completion — QML (Qt) — Qt's QML-specific completion model |
|
|
|
|
Rule of thumb: pick the agent for the job, then either pull its named model **or**
|
|
swap it (Change…) for one you already have.
|
|
|
|
## Extended Thinking Mode
|
|
|
|
Ollama supports native reasoning for models trained for it (Qwen3.5, Gemma 4, DeepSeek-R1, QwQ, …). Reasoning is streamed into collapsible "Thinking" blocks in the chat.
|
|
|
|
Thinking is a property of the **agent**, not a global switch:
|
|
|
|
- For chat, pick a thinking agent in the chat panel: **Ollama Chat — Thinking** or **Ollama Chat — Gemma 4**
|
|
- For quick refactor, assign **Ollama Quick Refactor — Qwen3.5** or **— Gemma 4** in Agent Pipelines
|
|
- In your own agents, set `think = true` in the `[body]` table (top level, not under `[body.options]`)
|
|
|
|
Use a reasoning-capable model with these agents — a non-reasoning model simply ignores the flag.
|
|
|