Sidecar and llama-server were both configured on port 8080, causing llama-server to fail on startup (port already in use). - sidecar/app.py: LLAMA_SERVER_PORT → 8081 (sidecar stays on 8080) - docker-compose.yml: MAIN_PC_URL → port 8081 (router sends chat requests to llama-server, not the sidecar)
15 lines
342 B
YAML
15 lines
342 B
YAML
version: '3.8'
|
|
|
|
services:
|
|
# The Intelligence Router
|
|
router:
|
|
build: .
|
|
ports:
|
|
- "9001:9000"
|
|
environment:
|
|
- SIDECAR_URL=http://10.0.4.11:8080
|
|
- MAIN_PC_URL=http://10.0.4.11:8081/v1
|
|
- FALLBACK_SLM_URL=http://10.0.4.200:8080/v1
|
|
- OPENROUTER_API_KEY=${OPENROUTER_API_KEY:-}
|
|
restart: unless-stopped
|