diff --git a/.planning/REQUIREMENTS.md b/.planning/REQUIREMENTS.md index e7f15dc..29932ae 100644 --- a/.planning/REQUIREMENTS.md +++ b/.planning/REQUIREMENTS.md @@ -3,8 +3,8 @@ ## v1.1 Requirements ### Provider Connection (CONN) -- [ ] **CONN-01**: OpenClaude가 Variet Engine(`http://:8000/v1`)에 OpenAI-compatible 프로바이더로 연결된다. -- [ ] **CONN-02**: 프로바이더 프로필(`.openclaude-profile.json`)이 Variet Engine 전용으로 저장된다. +- [x] **CONN-01**: OpenClaude가 Variet Engine(`http://:8000/v1`)에 OpenAI-compatible 프로바이더로 연결된다. +- [x] **CONN-02**: 프로바이더 프로필(`.openclaude-profile.json`)이 Variet Engine 전용으로 저장된다. ### Model Routing (ROUTE) - [ ] **ROUTE-01**: 5-Tier agentRouting이 설정되어 에이전트 유형별(Explore, Plan, general-purpose 등) 모델이 자동 선택된다. @@ -15,7 +15,7 @@ - [ ] **AGENT-02**: 스트리밍 응답이 실시간으로 터미널에 출력된다. ### Build & Deploy (BUILD) -- [ ] **BUILD-01**: OpenClaude CLI가 Machine B에서 `bun install && bun run build`로 빌드되고 `node dist/cli.mjs`로 실행 가능하다. +- [x] **BUILD-01**: OpenClaude CLI가 Machine B에서 `bun install && bun run build`로 빌드되고 `node dist/cli.mjs`로 실행 가능하다. - [ ] **BUILD-02**: VS Code Extension이 `npx @vscode/vsce package`로 `.vsix` 패키징되어 Machine B의 VS Code에 설치 가능하다. - [ ] **BUILD-03**: Extension 설정(`launchCommand`, `useOpenAIShim`)이 Variet Engine용으로 구성된다. @@ -33,12 +33,12 @@ | REQ-ID | Phase | Status | |--------|-------|--------| -| CONN-01 | 03 | Pending | -| CONN-02 | 03 | Pending | +| CONN-01 | 03 | Complete | +| CONN-02 | 03 | Complete | | ROUTE-01 | 04 | Pending | | ROUTE-02 | 04 | Pending | | AGENT-01 | 04 | Pending | | AGENT-02 | 04 | Pending | -| BUILD-01 | 03 | Pending | +| BUILD-01 | 03 | Complete | | BUILD-02 | 05 | Pending | | BUILD-03 | 05 | Pending | diff --git a/.planning/STATE.md b/.planning/STATE.md index f3e7bdc..4fad493 100644 --- a/.planning/STATE.md +++ b/.planning/STATE.md @@ -1,23 +1,42 @@ +--- +gsd_state_version: 1.0 +milestone: v1.1 +milestone_name: milestone +status: planning +last_updated: "2026-04-07T12:25:10.234Z" +last_activity: 2026-04-07 +progress: + total_phases: 3 + completed_phases: 1 + total_plans: 1 + completed_plans: 1 +--- + # Project State ## Project Reference + A high-performance, locally-hosted AI assistant system built on two RTX 3060 12GB GPUs. It uses a "2+0" architecture where Machine A acts as a dedicated inference server running large language models, while Machine B handles the user interface (VS Code, Discord) and tool execution. ## Current Position -Phase: 03-cli-build-provider (Next) -Plan: — -Status: Roadmap defined, ready to plan Phase 03 -Last activity: 2026-04-07 — Milestone v1.1 roadmap created (3 phases) + +Phase: 04 +Plan: Not started +Status: Ready to plan +Last activity: 2026-04-07 ## Progress + [████████████████████] 100% (Phase 01: LLM Tuning) [████████████████████] 100% (Phase 02: API Engine) ## Completed Phases + - Phase 01 (LLM Tuning): 5개 모델 최적 설정 확정 (74.65 / 61.62 / 16.0 / 16.7 / 8.95 t/s) - Phase 02 (API Engine): Variet Engine v1.0 — FastAPI 프록시 + 핫스왑 + 503 보호 ## Recent Decisions + - 2+0 GPU Architecture (Machine A API Server, Machine B tools). - 5-tier model strategy: fast/balanced/deep-coder/deep-logic/ultra. - GPU 0 PCIe x4 제약 → 122B MoE는 GPU 1 단독 사용. @@ -26,11 +45,14 @@ Last activity: 2026-04-07 — Milestone v1.1 roadmap created (3 phases) - CLI-First 검증 전략: VS Code Extension 전 OpenClaude CLI로 에이전트 루프 먼저 검증. ## Pending Todos + 0 pending. ## Blockers/Concerns + None. ## Session Continuity + Last session: 2026-04-07T20:38:00+09:00 Milestone: v1.1 OpenClaude CLI Integration