Go to file
root e9790c00dc feat: add 15 model profiles to manifest.yaml
- Qwen3.6-27B: 3 profiles (balanced/thinking/extended)
- Gemma 4 12B: 4 profiles (Q6_K_XL and IQ4_XS variants)
- Gemma 4 26B-A4B: 3 profiles (Q4_K_M and IQ4_XS)
- Qwen3.6-35B-A3B: 3 profiles (fast/thinking/extended, non-MTP)
- Uncensored: 3 profiles (HauhauCS, Genesis APEX)
- Add pytest.ini for test discovery
- All profiles use KV cache quantization (q8_0/q4_0) for 64K-128K context
- Embedded sampling parameters per model family
- Based on research from r/LocalLLaMA, Unsloth benchmarks, HF model cards
2026-06-15 12:34:46 +00:00
deploy feat: add 15 model profiles to manifest.yaml 2026-06-15 12:34:46 +00:00
docs Added next changes 2026-06-15 00:09:31 +00:00
sidecar Epic: Model Switching via Sidecar — Issues #2-#3 2026-06-15 00:49:24 +00:00
tests Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00
.env .env 2026-06-09 13:57:22 +03:00
.gitignore Epic: Model Switching via Sidecar — Issues #2-#3 2026-06-15 00:49:24 +00:00
CONTEXT.md Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00
docker-compose.yml Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00
Dockerfile Initial commit: migrate intelligence-router files 2026-06-09 11:48:43 +01:00
main.py Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00
pytest.ini feat: add 15 model profiles to manifest.yaml 2026-06-15 12:34:46 +00:00
requirements.txt Epic: Model Switching via Sidecar — Issues #4-#7 + #8 deployment 2026-06-15 01:13:36 +00:00