Two changes to fix 'error: invalid argument: --n-ctx' during model switch:
1. sidecar/app.py: Added _flag_key() converter that normalises
underscores to hyphens in flag names and handles the n_ctx→ctx-size
rename. The code now converts e.g. n_gpu_layers → n-gpu-layers,
top_p → top-p, top_k → top-k, min_p → min-p before passing to
llama-server CLI.
2. deploy/manifest.yaml: Updated all 20 profiles to use correct
llama-server flag names: n_ctx→ctx-size, n_gpu_layers→n-gpu-layers,
top_p→top-p, top_k→top-k, min_p→min-p. All flags now use hyphens,
matching what llama-server actually accepts.