added models, model-swap, ...
This commit is contained in:
@@ -53,6 +53,15 @@ Disk: needs ~90 GB free on `/models`. Pull is bandwidth-bound; expect
|
||||
|
||||
## Bring up (M0.2 — first generation)
|
||||
|
||||
Easy path — `swap-model` handles the stop-conflicting-services dance + waits for `/health`:
|
||||
|
||||
```sh
|
||||
ssh framework swap-model 235b # ~3-5 min on first cold load
|
||||
ssh framework /srv/docker/qwen3-235b/smoke.sh # perf measure
|
||||
```
|
||||
|
||||
Manual equivalent (for first-ever bring-up before the image is cached):
|
||||
|
||||
```sh
|
||||
cd /srv/docker/qwen3-235b
|
||||
docker compose pull # already-cached image if you ran llama first
|
||||
|
||||
Reference in New Issue
Block a user