added models, model-swap, ...

2026-06-26 08:13:33 -04:00
parent de1635872f
commit 224afbb3a6
18 changed files with 1659 additions and 243 deletions
--- a/pyinfra/framework/compose/qwen3-235b/README.md
+++ b/pyinfra/framework/compose/qwen3-235b/README.md
@@ -53,6 +53,15 @@ Disk: needs ~90 GB free on `/models`. Pull is bandwidth-bound; expect

 ## Bring up (M0.2 — first generation)

+Easy path — `swap-model` handles the stop-conflicting-services dance + waits for `/health`:
+
+```sh
+ssh framework swap-model 235b      # ~3-5 min on first cold load
+ssh framework /srv/docker/qwen3-235b/smoke.sh    # perf measure
+```
+
+Manual equivalent (for first-ever bring-up before the image is cached):
+
 ```sh
 cd /srv/docker/qwen3-235b
 docker compose pull       # already-cached image if you ran llama first