chore(session): pause work — Qwen promoted to primary + Hermes v0.8.0
Engine: - default_role: fast → balanced (Qwen 3.5 35B primary) - balanced: remove -t/-tb (no impact with -ngl 999) Hermes Agent submodule: - Update fff237e1 → e902e55b (v0.8.0, 340 commits merged) - Local 8 file patches auto-merged (stash/pop, 0 conflicts) - mmproj CPU offload, DISCORD_HOME_CHANNEL fix via external ~/.hermes/config.yaml Decisions: - Speculative decoding experiment rejected (+14% gen vs -31% cold start) - llama.cpp kept at b8660 (b8757 has 9% Gemma 4 regression) - Qwen superior for thinking/Korean/coding; speed diff negligible Session handoff: .planning/HANDOFF.json Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -1,6 +1,6 @@
|
||||
{
|
||||
"version": "1.0",
|
||||
"default_role": "fast",
|
||||
"default_role": "balanced",
|
||||
"llama_server": {
|
||||
"path": "llama_bin_run/llama-server.exe",
|
||||
"internal_port": 8080,
|
||||
@@ -62,10 +62,6 @@
|
||||
"256",
|
||||
"-b",
|
||||
"512",
|
||||
"-t",
|
||||
"6",
|
||||
"-tb",
|
||||
"6",
|
||||
"-ts",
|
||||
"0.48,0.52"
|
||||
]
|
||||
|
||||
Reference in New Issue
Block a user