Go to file
root bcf45129f1 fix: add --host 0.0.0.0 to llama-server command
llama-server defaults to binding on 127.0.0.1 (localhost only).
When the router runs on a separate Docker host (10.0.4.100), all
chat completion requests fail with:

  PROXY EXCEPTION on primary http://10.0.4.11:8081/v1/chat/completions:
    ConnectError: All connection attempts failed

Added --host 0.0.0.0 after --port so llama-server listens on all
network interfaces, reachable from the Docker host.
2026-06-16 21:46:07 +00:00
.hermes/plans fix: add probe endpoints and no-model fallback for Hermes Desktop compatibility 2026-06-15 15:22:15 +00:00
deploy fix: convert underscores to hyphens in llama-server flag names, fix n_ctx→ctx-size rename 2026-06-16 20:54:32 +00:00
docs Added next changes 2026-06-15 00:09:31 +00:00
scripts feat: add sync_models.py script to auto-update Hermes custom_providers from router model list 2026-06-15 21:10:36 +00:00
sidecar fix: add --host 0.0.0.0 to llama-server command 2026-06-16 21:46:07 +00:00
tests fix: change sidecar port from 8081 to 8080 2026-06-15 13:17:31 +00:00
.env .env 2026-06-09 13:57:22 +03:00
.gitignore Epic: Model Switching via Sidecar — Issues #2-#3 2026-06-15 00:49:24 +00:00
CONTEXT.md Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00
docker-compose.yml fix: resolve port conflict between sidecar and llama-server 2026-06-15 15:31:31 +00:00
Dockerfile Initial commit: migrate intelligence-router files 2026-06-09 11:48:43 +01:00
main.py fix: log exceptions on primary proxy target 2026-06-16 21:32:36 +00:00
pytest.ini feat: add 15 model profiles to manifest.yaml 2026-06-15 12:34:46 +00:00
requirements.txt Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00