- --jinja + --chat-template-kwargs '{"enable_thinking":true}' 추가
- -cram 8192: context checkpoint를 GPU 대신 CPU RAM에 저장
(GPU CUDA OOM 크래시 방지 — cuMemSetAccess 실패 at device:1)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
- --jinja + --chat-template-kwargs '{"enable_thinking":true}' 추가
- -cram 8192: context checkpoint를 GPU 대신 CPU RAM에 저장
(GPU CUDA OOM 크래시 방지 — cuMemSetAccess 실패 at device:1)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>