mirror of
https://github.com/Palm1r/QodeAssist.git
synced 2026-07-01 02:29:13 -04:00
feat: Add more agents configs
This commit is contained in:
@@ -143,18 +143,18 @@ needs:
|
|||||||
|---|---|
|
|---|---|
|
||||||
| `Claude Base Chat` | Claude, Anthropic Messages (`/v1/messages`) |
|
| `Claude Base Chat` | Claude, Anthropic Messages (`/v1/messages`) |
|
||||||
| `OpenAI Base Chat` | OpenAI, Chat Completions (`/chat/completions`) |
|
| `OpenAI Base Chat` | OpenAI, Chat Completions (`/chat/completions`) |
|
||||||
| `OpenAI Responses Base` | OpenAI, Responses API (`/responses`) |
|
| `OpenAI Base Responses` | OpenAI, Responses API (`/responses`) |
|
||||||
| `Google Base Chat` | Google AI, Gemini `generateContent` |
|
| `Google Base Chat` | Google AI, Gemini `generateContent` |
|
||||||
| `Ollama Base Chat` | Ollama, native `/api/chat` |
|
| `Ollama Base Chat` | Ollama, native `/api/chat` |
|
||||||
| `Ollama FIM Base` | Ollama, native `/api/generate` fill-in-the-middle |
|
| `Ollama Base FIM` | Ollama, native `/api/generate` fill-in-the-middle |
|
||||||
|
|
||||||
For any OpenAI-compatible provider (Mistral, OpenRouter, LM Studio, llama.cpp,
|
For any OpenAI-compatible provider (Mistral, OpenRouter, LM Studio, llama.cpp,
|
||||||
DeepSeek, …) extend `OpenAI Base Chat` and override `provider_instance`.
|
DeepSeek, …) extend `OpenAI Base Chat` and override `provider_instance`.
|
||||||
|
|
||||||
Each bundled concrete agent (`Claude Sonnet Chat`, `Claude Code Completion`,
|
Each bundled concrete agent (`Claude Chat — Sonnet`, `Claude Completion`,
|
||||||
`OpenAI Chat Completions`, `OpenAI Responses Chat`, `Google Chat`,
|
`OpenAI Chat`, `OpenAI Chat — Responses`, `Google Chat`,
|
||||||
`Ollama Chat`, `Ollama FIM`) is itself a thin delta over one of these bases and
|
`Ollama Chat — Simple`, `Ollama Completion — FIM`) is itself a thin delta over one of these bases and
|
||||||
works as a parent too — `extends = "Claude Sonnet Chat"` inherits everything including
|
works as a parent too — `extends = "Claude Chat — Sonnet"` inherits everything including
|
||||||
the model.
|
the model.
|
||||||
|
|
||||||
## `[body]` — the request, literally
|
## `[body]` — the request, literally
|
||||||
|
|||||||
@@ -28,6 +28,31 @@ ollama run qwen2.5-coder:32b
|
|||||||
|
|
||||||
You're all set! QodeAssist is now ready to use in Qt Creator.
|
You're all set! QodeAssist is now ready to use in Qt Creator.
|
||||||
|
|
||||||
|
## Which models do I actually need?
|
||||||
|
|
||||||
|
You do **not** need a separate model for every agent. Each bundled Ollama agent
|
||||||
|
names a *default* model only as an example — you can point any agent at a model
|
||||||
|
you already have via its settings → **Change…** (a per-agent override; it does not
|
||||||
|
edit the bundled agent). **Seeing a model name on an agent is not a reason to
|
||||||
|
download it.**
|
||||||
|
|
||||||
|
The defaults cluster into a tiny set, so one or two pulls cover everyday use:
|
||||||
|
|
||||||
|
| Pull this | Unlocks |
|
||||||
|
|---|---|
|
||||||
|
| `qwen2.5-coder:7b` | Ollama Chat — Simple · Ollama Completion — FIM · Ollama Completion — Chat-style · Ollama Quick Refactor |
|
||||||
|
| `qwen3.5:9b` (or `:4b` on ~8 GB) | Ollama Chat — Thinking · Ollama Compression — 16/32 GB (`:4b` → Compression — 8 GB) |
|
||||||
|
|
||||||
|
Optional specialists — pull only if you want that capability:
|
||||||
|
|
||||||
|
| Pull this | For |
|
||||||
|
|---|---|
|
||||||
|
| `gemma4:12b` | Ollama Chat — Gemma 4 — agentic chat with vision + native reasoning |
|
||||||
|
| `theqtcompany/codellama-7b-qml` | Ollama Completion — QML (Qt) — Qt's QML-specific completion model |
|
||||||
|
|
||||||
|
Rule of thumb: pick the agent for the job, then either pull its named model **or**
|
||||||
|
swap it (Change…) for one you already have.
|
||||||
|
|
||||||
## Extended Thinking Mode
|
## Extended Thinking Mode
|
||||||
|
|
||||||
Ollama supports extended thinking mode for models that are capable of deep reasoning (such as DeepSeek-R1, QwQ, and similar reasoning models). This mode allows the model to show its step-by-step reasoning process before providing the final answer.
|
Ollama supports extended thinking mode for models that are capable of deep reasoning (such as DeepSeek-R1, QwQ, and similar reasoning models). This mode allows the model to show its step-by-step reasoning process before providing the final answer.
|
||||||
|
|||||||
@@ -1,30 +1,37 @@
|
|||||||
<RCC>
|
<RCC>
|
||||||
<qresource prefix="/agents">
|
<qresource prefix="/agents">
|
||||||
<file>claude_base_chat.toml</file>
|
<file>claude_base_chat.toml</file>
|
||||||
<file>claude_chat.toml</file>
|
<file>claude_chat_sonnet.toml</file>
|
||||||
<file>claude_opus_chat.toml</file>
|
<file>claude_chat_opus_xhigh.toml</file>
|
||||||
<file>claude_opus_max.toml</file>
|
<file>claude_chat_opus_max.toml</file>
|
||||||
<file>claude_completion.toml</file>
|
<file>claude_completion.toml</file>
|
||||||
<file>claude_compression.toml</file>
|
<file>claude_compression.toml</file>
|
||||||
<file>claude_quick_refactor.toml</file>
|
<file>claude_quick_refactor.toml</file>
|
||||||
<file>claude_quick_refactor_fast.toml</file>
|
<file>claude_quick_refactor_fast.toml</file>
|
||||||
<file>openai_base_chat.toml</file>
|
<file>openai_base_chat.toml</file>
|
||||||
<file>openai_chat_completions.toml</file>
|
<file>openai_chat.toml</file>
|
||||||
<file>openai_responses_base.toml</file>
|
<file>openai_base_responses.toml</file>
|
||||||
<file>openai_responses_chat.toml</file>
|
<file>openai_chat_responses.toml</file>
|
||||||
<file>google_base_chat.toml</file>
|
<file>google_base_chat.toml</file>
|
||||||
<file>google_chat.toml</file>
|
<file>google_chat.toml</file>
|
||||||
<file>ollama_base_chat.toml</file>
|
<file>ollama_base_chat.toml</file>
|
||||||
<file>ollama_chat.toml</file>
|
<file>ollama_chat_simple.toml</file>
|
||||||
<file>ollama_chat_completion.toml</file>
|
<file>ollama_chat_thinking.toml</file>
|
||||||
|
<file>ollama_completion_chat.toml</file>
|
||||||
<file>ollama_base_fim.toml</file>
|
<file>ollama_base_fim.toml</file>
|
||||||
<file>ollama_fim.toml</file>
|
<file>ollama_completion_fim.toml</file>
|
||||||
<file>ollama_codellama_qml_fim.toml</file>
|
<file>ollama_completion_qml.toml</file>
|
||||||
<file>ollama_quick_refactor.toml</file>
|
<file>ollama_quick_refactor_simple.toml</file>
|
||||||
<file>ollama_compression.toml</file>
|
<file>ollama_quick_refactor_qwen35.toml</file>
|
||||||
|
<file>ollama_quick_refactor_gemma4.toml</file>
|
||||||
|
<file>ollama_compression_8gb.toml</file>
|
||||||
|
<file>ollama_compression_16gb.toml</file>
|
||||||
|
<file>ollama_compression_32gb.toml</file>
|
||||||
|
<file>ollama_chat_gemma4.toml</file>
|
||||||
</qresource>
|
</qresource>
|
||||||
<qresource prefix="/roles">
|
<qresource prefix="/roles">
|
||||||
<file alias="qt-cpp-developer.md">roles/qt-cpp-developer.md</file>
|
<file alias="qt-cpp-developer.md">roles/qt-cpp-developer.md</file>
|
||||||
|
<file alias="agentic-coder.md">roles/agentic-coder.md</file>
|
||||||
<file alias="code-completion.md">roles/code-completion.md</file>
|
<file alias="code-completion.md">roles/code-completion.md</file>
|
||||||
<file alias="code-completion-c-like.md">roles/code-completion-c-like.md</file>
|
<file alias="code-completion-c-like.md">roles/code-completion-c-like.md</file>
|
||||||
<file alias="code-completion-qml.md">roles/code-completion-qml.md</file>
|
<file alias="code-completion-qml.md">roles/code-completion-qml.md</file>
|
||||||
|
|||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Claude Base Chat"
|
extends = "Claude Base Chat"
|
||||||
name = "Claude Opus Max Chat"
|
name = "Claude Chat — Opus Max"
|
||||||
description = "Anthropic Claude Opus 4.8 — maximum-capability coding chat; adaptive thinking at max effort with 128k output. For frontier problems; higher cost and latency."
|
description = "Anthropic Claude Opus 4.8 — maximum-capability coding chat; adaptive thinking at max effort with 128k output. For frontier problems; higher cost and latency."
|
||||||
|
|
||||||
model = "claude-opus-4-8"
|
model = "claude-opus-4-8"
|
||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Claude Base Chat"
|
extends = "Claude Base Chat"
|
||||||
name = "Claude Opus xHigh Chat"
|
name = "Claude Chat — Opus xHigh"
|
||||||
description = "Anthropic Claude Opus 4.8 — coding chat with adaptive thinking at xhigh effort, tuned for agentic and coding work."
|
description = "Anthropic Claude Opus 4.8 — coding chat with adaptive thinking at xhigh effort, tuned for agentic and coding work."
|
||||||
|
|
||||||
model = "claude-opus-4-8"
|
model = "claude-opus-4-8"
|
||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Claude Base Chat"
|
extends = "Claude Base Chat"
|
||||||
name = "Claude Sonnet Chat"
|
name = "Claude Chat — Sonnet"
|
||||||
description = "Anthropic Claude — coding chat with adaptive thinking."
|
description = "Anthropic Claude — coding chat with adaptive thinking."
|
||||||
|
|
||||||
model = "claude-sonnet-4-6"
|
model = "claude-sonnet-4-6"
|
||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Claude Base Chat"
|
extends = "Claude Base Chat"
|
||||||
name = "Claude Code Completion"
|
name = "Claude Completion"
|
||||||
description = "Anthropic Claude — code completion using the master <code_context> message format over the Messages API."
|
description = "Anthropic Claude — code completion using the master <code_context> message format over the Messages API."
|
||||||
|
|
||||||
model = "claude-haiku-4-5"
|
model = "claude-haiku-4-5"
|
||||||
|
|||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Claude Base Chat"
|
extends = "Claude Base Chat"
|
||||||
name = "Claude Chat Compression"
|
name = "Claude Compression"
|
||||||
description = "Anthropic Claude Haiku — fast, low-cost conversation summarization for shorter chats. Carries the summary system prompt; no tools, no thinking."
|
description = "Anthropic Claude Haiku — fast, low-cost conversation summarization for shorter chats. Carries the summary system prompt; no tools, no thinking."
|
||||||
|
|
||||||
model = "claude-haiku-4-5"
|
model = "claude-haiku-4-5"
|
||||||
|
|||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Claude Base Chat"
|
extends = "Claude Base Chat"
|
||||||
name = "Claude Quick Refactor Fast"
|
name = "Claude Quick Refactor — Fast"
|
||||||
description = "Anthropic Claude Haiku — fast single-shot inline refactor. Static output rules from :/tasks/quick-refactor.md; QuickRefactorHandler injects the live editor context (file, code, cursor/selection). No thinking; tools off."
|
description = "Anthropic Claude Haiku — fast single-shot inline refactor. Static output rules from :/tasks/quick-refactor.md; QuickRefactorHandler injects the live editor context (file, code, cursor/selection). No thinking; tools off."
|
||||||
|
|
||||||
model = "claude-haiku-4-5"
|
model = "claude-haiku-4-5"
|
||||||
|
|||||||
@@ -1,6 +1,6 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
name = "Ollama FIM Base"
|
name = "Ollama Base FIM"
|
||||||
description = "Ollama native /api/generate FIM request body. Abstract — extend it and set model."
|
description = "Ollama native /api/generate FIM request body. Abstract — extend it and set model."
|
||||||
abstract = true
|
abstract = true
|
||||||
|
|
||||||
|
|||||||
@@ -1,15 +0,0 @@
|
|||||||
schema_version = 1
|
|
||||||
|
|
||||||
extends = "Ollama Base Chat"
|
|
||||||
name = "Ollama Chat"
|
|
||||||
description = "Local Ollama coding chat (qwen2.5-coder)."
|
|
||||||
|
|
||||||
model = "qwen2.5-coder:7b"
|
|
||||||
tags = ["chat", "ollama", "local"]
|
|
||||||
|
|
||||||
system_prompt = """{{ read_file(":/roles/qt-cpp-developer.md") }}"""
|
|
||||||
|
|
||||||
[body.options]
|
|
||||||
num_predict = 2048
|
|
||||||
temperature = 0.7
|
|
||||||
keep_alive = "5m"
|
|
||||||
22
sources/agents/ollama_chat_gemma4.toml
Normal file
22
sources/agents/ollama_chat_gemma4.toml
Normal file
@@ -0,0 +1,22 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Chat — Gemma 4"
|
||||||
|
description = "Local Ollama agentic coding chat (Gemma 4 12B by default) — native reasoning plus IDE tool use (read/edit/build), driven by the act-first agentic-coder role. Vision-capable."
|
||||||
|
|
||||||
|
model = "gemma4:12b"
|
||||||
|
enable_tools = true
|
||||||
|
enable_thinking = true
|
||||||
|
tags = ["chat", "ollama", "local", "16gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/roles/agentic-coder.md") }}"""
|
||||||
|
|
||||||
|
[body]
|
||||||
|
think = true
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
temperature = 1.0
|
||||||
|
top_p = 0.95
|
||||||
|
top_k = 64
|
||||||
|
num_predict = 8096
|
||||||
|
keep_alive = "5m"
|
||||||
15
sources/agents/ollama_chat_simple.toml
Normal file
15
sources/agents/ollama_chat_simple.toml
Normal file
@@ -0,0 +1,15 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Chat — Simple"
|
||||||
|
description = "Local Ollama coding chat for any model — plain conversational assistant, no tools, no thinking (Qwen2.5-Coder 7B by default)."
|
||||||
|
|
||||||
|
model = "qwen3.5:4b"
|
||||||
|
tags = ["chat", "ollama", "local", "8gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/roles/qt-cpp-developer.md") }}"""
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 2048
|
||||||
|
temperature = 0.7
|
||||||
|
keep_alive = "5m"
|
||||||
19
sources/agents/ollama_chat_thinking.toml
Normal file
19
sources/agents/ollama_chat_thinking.toml
Normal file
@@ -0,0 +1,19 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Chat — Thinking"
|
||||||
|
description = "Local Ollama agentic coding chat — tools + native reasoning, for any tools/thinking-capable model (Qwen3.5 9B by default). Acts through IDE tools (read/edit/build) via the act-first agentic-coder role."
|
||||||
|
|
||||||
|
model = "qwen3.5:9b"
|
||||||
|
enable_tools = true
|
||||||
|
enable_thinking = true
|
||||||
|
tags = ["chat", "ollama", "local", "16gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/roles/agentic-coder.md") }}"""
|
||||||
|
|
||||||
|
[body]
|
||||||
|
think = true
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 8192
|
||||||
|
keep_alive = "5m"
|
||||||
@@ -1,33 +0,0 @@
|
|||||||
schema_version = 1
|
|
||||||
|
|
||||||
name = "Ollama CodeLlama QML FIM"
|
|
||||||
description = "Local Ollama FIM for the CodeLlama-13B QML fine-tune (raw passthrough Modelfile, suffix-first markers)."
|
|
||||||
|
|
||||||
provider_instance = "Ollama (Native)"
|
|
||||||
endpoint = "/api/generate"
|
|
||||||
|
|
||||||
model = "theqtcompany/codellama-7b-qml"
|
|
||||||
tags = ["completion", "ollama", "local", "fim", "qml"]
|
|
||||||
|
|
||||||
[match]
|
|
||||||
file_patterns = ["*.qml"]
|
|
||||||
|
|
||||||
[body]
|
|
||||||
prompt = """{{ tojson("<SUF>" + ctx.suffix + "<PRE>" + ctx.prefix + "<MID>") }}"""
|
|
||||||
|
|
||||||
[body.options]
|
|
||||||
temperature = 0
|
|
||||||
top_p = 1
|
|
||||||
repeat_penalty = 1.05
|
|
||||||
num_predict = 500
|
|
||||||
stop = [
|
|
||||||
"<SUF>",
|
|
||||||
"<PRE>",
|
|
||||||
"</PRE>",
|
|
||||||
"</SUF>",
|
|
||||||
"< EOT >",
|
|
||||||
"\\end",
|
|
||||||
"<MID>",
|
|
||||||
"</MID>",
|
|
||||||
"##",
|
|
||||||
]
|
|
||||||
@@ -1,11 +1,11 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "Ollama Base Chat"
|
extends = "Ollama Base Chat"
|
||||||
name = "Ollama FIM-on-chat"
|
name = "Ollama Completion — Chat-style"
|
||||||
description = "Local Ollama code completion over /api/chat using the <code_context> message format — FIM-on-chat."
|
description = "Code completion via a chat request — the cursor sits in a <code_context> message on /api/chat, so the model just continues the code. Works with ANY normal chat/instruct model (qwen2.5-coder, qwen3, llama3, gemma, mistral, codellama-instruct, …), so use this when your model is NOT a dedicated FIM model. Tradeoff vs native 'Ollama Completion — FIM': a bit slower and may leak code fences or explanations, so slightly less precise."
|
||||||
|
|
||||||
model = "qwen2.5-coder:7b"
|
model = "qwen2.5-coder:7b"
|
||||||
tags = ["completion", "ollama", "local", "fim"]
|
tags = ["completion", "ollama", "local", "fim", "8gb"]
|
||||||
|
|
||||||
system_prompt = """
|
system_prompt = """
|
||||||
{%- if language == "qml" %}{{ read_file(":/roles/code-completion-qml.md") }}
|
{%- if language == "qml" %}{{ read_file(":/roles/code-completion-qml.md") }}
|
||||||
@@ -25,7 +25,7 @@ messages = """
|
|||||||
"""
|
"""
|
||||||
|
|
||||||
[body.options]
|
[body.options]
|
||||||
num_predict = 512
|
num_predict = 256
|
||||||
temperature = 0.2
|
temperature = 0.2
|
||||||
keep_alive = "5m"
|
keep_alive = "5m"
|
||||||
stop = ["</code_context>"]
|
stop = ["</code_context>"]
|
||||||
13
sources/agents/ollama_completion_fim.toml
Normal file
13
sources/agents/ollama_completion_fim.toml
Normal file
@@ -0,0 +1,13 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base FIM"
|
||||||
|
name = "Ollama Completion — FIM"
|
||||||
|
description = "Native fill-in-the-middle completion — uses the model's OWN FIM template (prompt+suffix on /api/generate). Fast and clean (no markdown or prose), but works ONLY with models that ship a FIM template, and those are few: the base / '-code' variants, NOT instruct/chat models. Verified to work: qwen2.5-coder (incl. -base), codellama:7b-code, deepseek-coder-v2 lite-base. A plain chat model outputs garbage here — use 'Ollama Completion — Chat-style' instead. Check a model: `ollama show <model> --modelfile` must mention 'Suffix'."
|
||||||
|
|
||||||
|
model = "qwen2.5-coder:7b"
|
||||||
|
tags = ["completion", "ollama", "local", "fim", "8gb"]
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 256
|
||||||
|
temperature = 0.2
|
||||||
|
keep_alive = "5m"
|
||||||
33
sources/agents/ollama_completion_qml.toml
Normal file
33
sources/agents/ollama_completion_qml.toml
Normal file
@@ -0,0 +1,33 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
name = "Ollama Completion — QML (Qt)"
|
||||||
|
description = "Specialized QML completion built for The Qt Company's CodeLlama-QML fine-tune (`theqtcompany/codellama-7b-qml` / `13b-qml`) — a model trained specifically for QML. Native fill-in-the-middle with the fine-tune's suffix-first markers; route `.qml` files here via [match]. Not a general agent — use only with a Qt QML model (theqtcompany/* or palm1r/codellama-13b-code-qml)."
|
||||||
|
|
||||||
|
provider_instance = "Ollama (Native)"
|
||||||
|
endpoint = "/api/generate"
|
||||||
|
|
||||||
|
model = "theqtcompany/codellama-7b-qml"
|
||||||
|
tags = ["completion", "ollama", "local", "fim", "qml", "8gb"]
|
||||||
|
|
||||||
|
[match]
|
||||||
|
file_patterns = ["*.qml"]
|
||||||
|
|
||||||
|
[body]
|
||||||
|
prompt = """{{ tojson("<SUF>" + ctx.suffix + "<PRE>" + ctx.prefix + "<MID>") }}"""
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
temperature = 0
|
||||||
|
top_p = 1
|
||||||
|
repeat_penalty = 1.05
|
||||||
|
num_predict = 500
|
||||||
|
stop = [
|
||||||
|
"<SUF>",
|
||||||
|
"<PRE>",
|
||||||
|
"</PRE>",
|
||||||
|
"</SUF>",
|
||||||
|
"< EOT >",
|
||||||
|
"\\end",
|
||||||
|
"<MID>",
|
||||||
|
"</MID>",
|
||||||
|
"##",
|
||||||
|
]
|
||||||
@@ -1,17 +0,0 @@
|
|||||||
schema_version = 1
|
|
||||||
|
|
||||||
extends = "Ollama Base Chat"
|
|
||||||
name = "Ollama Chat Compression"
|
|
||||||
description = "Local Ollama conversation summarization (qwen2.5-coder). Carries the summary system prompt; no tools."
|
|
||||||
|
|
||||||
model = "qwen2.5-coder:7b"
|
|
||||||
enable_tools = false
|
|
||||||
tags = ["compression", "ollama", "local"]
|
|
||||||
|
|
||||||
system_prompt = """{{ read_file(":/tasks/chat-compressor.md") }}"""
|
|
||||||
|
|
||||||
[body.options]
|
|
||||||
num_predict = 2048
|
|
||||||
temperature = 0.3
|
|
||||||
num_ctx = 8192
|
|
||||||
keep_alive = "5m"
|
|
||||||
17
sources/agents/ollama_compression_16gb.toml
Normal file
17
sources/agents/ollama_compression_16gb.toml
Normal file
@@ -0,0 +1,17 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Compression — 16 GB"
|
||||||
|
description = "Conversation summarization tuned for ~16 GB RAM — qwen3.5:9b at num_ctx 8192 (~10 GB resident). General summarizer with moderate context. KV cache is reserved at load per num_ctx. No tools."
|
||||||
|
|
||||||
|
model = "qwen3.5:9b"
|
||||||
|
enable_tools = false
|
||||||
|
tags = ["compression", "ollama", "local", "16gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/tasks/chat-compressor.md") }}"""
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 2048
|
||||||
|
temperature = 0.3
|
||||||
|
num_ctx = 8192
|
||||||
|
keep_alive = "5m"
|
||||||
17
sources/agents/ollama_compression_32gb.toml
Normal file
17
sources/agents/ollama_compression_32gb.toml
Normal file
@@ -0,0 +1,17 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Compression — 32 GB"
|
||||||
|
description = "Conversation summarization tuned for ~32 GB RAM — qwen3.5:9b at num_ctx 24576 (~20 GB resident). Long context for big chats. KV cache is reserved at load per num_ctx. No tools."
|
||||||
|
|
||||||
|
model = "qwen3.5:9b"
|
||||||
|
enable_tools = false
|
||||||
|
tags = ["compression", "ollama", "local", "32gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/tasks/chat-compressor.md") }}"""
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 2048
|
||||||
|
temperature = 0.3
|
||||||
|
num_ctx = 24576
|
||||||
|
keep_alive = "5m"
|
||||||
17
sources/agents/ollama_compression_8gb.toml
Normal file
17
sources/agents/ollama_compression_8gb.toml
Normal file
@@ -0,0 +1,17 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Compression — 8 GB"
|
||||||
|
description = "Conversation summarization tuned for ~8 GB RAM — qwen3.5:4b at num_ctx 8192 (~5.5 GB resident). Smallest qwen3.5 (same family as the bigger tiers), decent context for the footprint. KV cache is reserved at load per num_ctx. No tools."
|
||||||
|
|
||||||
|
model = "qwen3.5:4b"
|
||||||
|
enable_tools = false
|
||||||
|
tags = ["compression", "ollama", "local", "8gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/tasks/chat-compressor.md") }}"""
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 2048
|
||||||
|
temperature = 0.3
|
||||||
|
num_ctx = 8192
|
||||||
|
keep_alive = "5m"
|
||||||
@@ -1,13 +0,0 @@
|
|||||||
schema_version = 1
|
|
||||||
|
|
||||||
extends = "Ollama FIM Base"
|
|
||||||
name = "Ollama FIM"
|
|
||||||
description = "Local Ollama FIM code completion (qwen2.5-coder)."
|
|
||||||
|
|
||||||
model = "qwen2.5-coder:7b"
|
|
||||||
tags = ["completion", "ollama", "local", "fim"]
|
|
||||||
|
|
||||||
[body.options]
|
|
||||||
num_predict = 256
|
|
||||||
temperature = 0.2
|
|
||||||
keep_alive = "5m"
|
|
||||||
@@ -1,16 +0,0 @@
|
|||||||
schema_version = 1
|
|
||||||
|
|
||||||
extends = "Ollama Base Chat"
|
|
||||||
name = "Ollama Quick Refactor"
|
|
||||||
description = "Local Ollama deterministic inline refactor (qwen2.5-coder). Static output rules from :/tasks/quick-refactor.md; QuickRefactorHandler injects the live editor context. No tools."
|
|
||||||
|
|
||||||
model = "qwen2.5-coder:7b"
|
|
||||||
enable_tools = false
|
|
||||||
tags = ["refactor", "ollama", "local"]
|
|
||||||
|
|
||||||
system_prompt = """{{ read_file(":/tasks/quick-refactor.md") }}"""
|
|
||||||
|
|
||||||
[body.options]
|
|
||||||
num_predict = 2048
|
|
||||||
temperature = 0.2
|
|
||||||
keep_alive = "5m"
|
|
||||||
19
sources/agents/ollama_quick_refactor_gemma4.toml
Normal file
19
sources/agents/ollama_quick_refactor_gemma4.toml
Normal file
@@ -0,0 +1,19 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Quick Refactor — Gemma 4"
|
||||||
|
description = "Local Ollama smart inline refactor — gemma4:12b with native reasoning (thinks before editing). Static output rules from :/tasks/quick-refactor.md; QuickRefactorHandler injects the live editor context. No tools."
|
||||||
|
|
||||||
|
model = "gemma4:12b"
|
||||||
|
enable_tools = false
|
||||||
|
enable_thinking = true
|
||||||
|
tags = ["refactor", "ollama", "local", "16gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/tasks/quick-refactor.md") }}"""
|
||||||
|
|
||||||
|
[body]
|
||||||
|
think = true
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 4096
|
||||||
|
keep_alive = "5m"
|
||||||
19
sources/agents/ollama_quick_refactor_qwen35.toml
Normal file
19
sources/agents/ollama_quick_refactor_qwen35.toml
Normal file
@@ -0,0 +1,19 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Quick Refactor — Qwen3.5"
|
||||||
|
description = "Local Ollama smart inline refactor — qwen3.5:9b with native reasoning (thinks before editing). Static output rules from :/tasks/quick-refactor.md; QuickRefactorHandler injects the live editor context. No tools."
|
||||||
|
|
||||||
|
model = "qwen3.5:9b"
|
||||||
|
enable_tools = false
|
||||||
|
enable_thinking = true
|
||||||
|
tags = ["refactor", "ollama", "local", "16gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/tasks/quick-refactor.md") }}"""
|
||||||
|
|
||||||
|
[body]
|
||||||
|
think = true
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 4096
|
||||||
|
keep_alive = "5m"
|
||||||
16
sources/agents/ollama_quick_refactor_simple.toml
Normal file
16
sources/agents/ollama_quick_refactor_simple.toml
Normal file
@@ -0,0 +1,16 @@
|
|||||||
|
schema_version = 1
|
||||||
|
|
||||||
|
extends = "Ollama Base Chat"
|
||||||
|
name = "Ollama Quick Refactor — Simple"
|
||||||
|
description = "Local Ollama deterministic inline refactor — fast single-shot, any model (Qwen2.5-Coder 7B by default). Static output rules from :/tasks/quick-refactor.md; QuickRefactorHandler injects the live editor context. No thinking, no tools."
|
||||||
|
|
||||||
|
model = "qwen2.5-coder:7b"
|
||||||
|
enable_tools = false
|
||||||
|
tags = ["refactor", "ollama", "local", "8gb"]
|
||||||
|
|
||||||
|
system_prompt = """{{ read_file(":/tasks/quick-refactor.md") }}"""
|
||||||
|
|
||||||
|
[body.options]
|
||||||
|
num_predict = 2048
|
||||||
|
temperature = 0.2
|
||||||
|
keep_alive = "5m"
|
||||||
@@ -1,6 +1,6 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
name = "OpenAI Responses Base"
|
name = "OpenAI Base Responses"
|
||||||
description = "OpenAI Responses API request body (/responses). Abstract — extend it and set model."
|
description = "OpenAI Responses API request body (/responses). Abstract — extend it and set model."
|
||||||
abstract = true
|
abstract = true
|
||||||
|
|
||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "OpenAI Base Chat"
|
extends = "OpenAI Base Chat"
|
||||||
name = "OpenAI Chat Completions"
|
name = "OpenAI Chat"
|
||||||
description = "OpenAI GPT-4o — coding chat via Chat Completions."
|
description = "OpenAI GPT-4o — coding chat via Chat Completions."
|
||||||
|
|
||||||
model = "gpt-4o"
|
model = "gpt-4o"
|
||||||
@@ -1,7 +1,7 @@
|
|||||||
schema_version = 1
|
schema_version = 1
|
||||||
|
|
||||||
extends = "OpenAI Responses Base"
|
extends = "OpenAI Base Responses"
|
||||||
name = "OpenAI Responses Chat"
|
name = "OpenAI Chat — Responses"
|
||||||
description = "OpenAI o4-mini — reasoning coding chat via the Responses API."
|
description = "OpenAI o4-mini — reasoning coding chat via the Responses API."
|
||||||
|
|
||||||
model = "o4-mini"
|
model = "o4-mini"
|
||||||
15
sources/agents/roles/agentic-coder.md
Normal file
15
sources/agents/roles/agentic-coder.md
Normal file
@@ -0,0 +1,15 @@
|
|||||||
|
You are an autonomous Qt/C++ coding agent working inside the Qt Creator IDE. You have tools to read, search, edit, and build the project. Use them to complete the user's request directly — act, do not wait for approval.
|
||||||
|
|
||||||
|
## Workflow
|
||||||
|
- **Do the task with tools.** When the request is clear, carry it out. Ask a question only when it is genuinely ambiguous or destructive — never to confirm an obvious change.
|
||||||
|
- **Read before editing.** Call read_file for the exact current content before changing a file, so your edit matches.
|
||||||
|
- **Edit through tools.** Apply changes with edit_file (create_new_file for new files). Do not paste whole files into chat when an edit will do.
|
||||||
|
- **Stay in the project source root.** Never create or edit files in the build directory.
|
||||||
|
- **Verify when it matters.** After non-trivial changes, build_project and check get_issues_list, then fix anything you broke.
|
||||||
|
|
||||||
|
## Code
|
||||||
|
- C++20, Qt6; match the surrounding style, naming, and patterns.
|
||||||
|
- Write the minimum that solves the task: no over-engineering, no TODOs, no debug code, no unrelated refactoring. Make sure it compiles.
|
||||||
|
|
||||||
|
## Reporting
|
||||||
|
- Be brief: a one-line summary of what changed and which files. Let the diff speak. Call out only non-obvious consequences.
|
||||||
Reference in New Issue
Block a user