Commit Graph

10 Commits

Author SHA1 Message Date
root
c491779248 Epic: Model Switching via Sidecar — Issues #2-#3
Issue #2: Manifest schema + Sidecar foundation
- sidecar/manifest.py: YAML manifest loading and profile validation
- sidecar/app.py: FastAPI sidecar service with /models/available, /models/status endpoints
- Router GET /v1/models: proxies to sidecar, returns OpenAI-compatible model list
- Tests: 12 manifest tests, 6 sidecar endpoint tests, 3 router tests (21 total)

Issue #3: Sidecar model switch + Router request queue
- Sidecar POST /models/switch: stops current llama-server, starts new one, polls for readiness
- Switch lock prevents concurrent switches (threading.Lock for TestClient compatibility)
- Router request queue: max 10 requests, 120s hard timeout, 429 when full
- Router automatic model detection: extracts model from chat body, matches against sidecar status
- Full proxy endpoint with Sidecar → Main PC routing and fallback chain
- Tests: 5 sidecar switch tests, 4 queue tests, 3 router integration tests (12 total)

Total: 33 tests, all passing
2026-06-15 00:49:24 +00:00
root
b2031d8b7a Added next changes 2026-06-15 00:09:31 +00:00
712fe041b1 test 2026-06-09 19:54:03 +01:00
1a7dd550ec added debug 2026-06-09 18:05:10 +01:00
d7090b1644 Fix build context, port conflict, and improve proxy/health-check logic 2026-06-09 17:34:07 +01:00
cb01b42f38 Cleanup: Remove redundant llama-slm service and use LXC IP 2026-06-09 12:41:32 +01:00
4ea94f7d60 Update IPs for Main PC and LXC Fallback Brain 2026-06-09 12:37:34 +01:00
Chiabur Aiode
8fab2f3801 .env 2026-06-09 13:57:22 +03:00
Chiabur Aiode
038e8f9f7c gitignore 2026-06-09 13:54:18 +03:00
0e05390be2 Initial commit: migrate intelligence-router files 2026-06-09 11:48:43 +01:00