intelligence-router

Go to file

root 4ee85972ec fix: convert underscores to hyphens in llama-server flag names, fix n_ctx→ctx-size rename Two changes to fix 'error: invalid argument: --n-ctx' during model switch: 1. sidecar/app.py: Added _flag_key() converter that normalises underscores to hyphens in flag names and handles the n_ctx→ctx-size rename. The code now converts e.g. n_gpu_layers → n-gpu-layers, top_p → top-p, top_k → top-k, min_p → min-p before passing to llama-server CLI. 2. deploy/manifest.yaml: Updated all 20 profiles to use correct llama-server flag names: n_ctx→ctx-size, n_gpu_layers→n-gpu-layers, top_p→top-p, top_k→top-k, min_p→min-p. All flags now use hyphens, matching what llama-server actually accepts.		2026-06-16 20:54:32 +00:00
.hermes/plans	fix: add probe endpoints and no-model fallback for Hermes Desktop compatibility	2026-06-15 15:22:15 +00:00
deploy	fix: convert underscores to hyphens in llama-server flag names, fix n_ctx→ctx-size rename	2026-06-16 20:54:32 +00:00
docs	Added next changes	2026-06-15 00:09:31 +00:00
scripts	feat: add sync_models.py script to auto-update Hermes custom_providers from router model list	2026-06-15 21:10:36 +00:00
sidecar	fix: convert underscores to hyphens in llama-server flag names, fix n_ctx→ctx-size rename	2026-06-16 20:54:32 +00:00
tests	fix: change sidecar port from 8081 to 8080	2026-06-15 13:17:31 +00:00
.env	.env	2026-06-09 13:57:22 +03:00
.gitignore	Epic: Model Switching via Sidecar — Issues #2-#3	2026-06-15 00:49:24 +00:00
CONTEXT.md	Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment	2026-06-15 01:13:36 +00:00
docker-compose.yml	fix: resolve port conflict between sidecar and llama-server	2026-06-15 15:31:31 +00:00
Dockerfile	Initial commit: migrate intelligence-router files	2026-06-09 11:48:43 +01:00
main.py	fix: add probe endpoints and no-model fallback for Hermes Desktop compatibility	2026-06-15 15:22:15 +00:00
pytest.ini	feat: add 15 model profiles to manifest.yaml	2026-06-15 12:34:46 +00:00
requirements.txt	Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment	2026-06-15 01:13:36 +00:00