Engine: - default_role: fast → balanced (Qwen 3.5 35B primary) - balanced: remove -t/-tb (no impact with -ngl 999) Hermes Agent submodule: - Update fff237e1 → e902e55b (v0.8.0, 340 commits merged) - Local 8 file patches auto-merged (stash/pop, 0 conflicts) - mmproj CPU offload, DISCORD_HOME_CHANNEL fix via external ~/.hermes/config.yaml Decisions: - Speculative decoding experiment rejected (+14% gen vs -31% cold start) - llama.cpp kept at b8660 (b8757 has 9% Gemma 4 regression) - Qwen superior for thinking/Korean/coding; speed diff negligible Session handoff: .planning/HANDOFF.json Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
hermes-agent
@ e902e55b26