llama-server defaults to binding on 127.0.0.1 (localhost only). When the router runs on a separate Docker host (10.0.4.100), all chat completion requests fail with: PROXY EXCEPTION on primary http://10.0.4.11:8081/v1/chat/completions: ConnectError: All connection attempts failed Added --host 0.0.0.0 after --port so llama-server listens on all network interfaces, reachable from the Docker host. |
||
|---|---|---|
| .hermes/plans | ||
| deploy | ||
| docs | ||
| scripts | ||
| sidecar | ||
| tests | ||
| .env | ||
| .gitignore | ||
| CONTEXT.md | ||
| docker-compose.yml | ||
| Dockerfile | ||
| main.py | ||
| pytest.ini | ||
| requirements.txt | ||