Containerized local LLM stack for the Framework Desktop / Strix Halo,
plus the OpenCode harness on the Mac side.
- pyinfra/framework/: pyinfra deploy targeting the box
- llama.cpp (Vulkan), vLLM (ROCm), Ollama (ROCm with HSA override
for gfx1151), OpenWebUI
- Beszel (host + container + AMD GPU dashboard via sysfs)
- OpenLIT (LLM fleet metrics)
- Phoenix (per-trace agent waterfall)
- OpenHands (autonomous agent in a Docker sandbox)
- opencode/: OpenCode config + Phoenix bridge plugin (OTel exporter)
- install.sh deploys to ~/.config/opencode/
- StrixHaloSetup.md / StrixHaloMemory.md / Roadmap.md / TODO.md:
documentation and planning
- testing/qwen3-coder-30b/: small evaluation harness
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
pyinfra
One folder per station. Each subfolder is a self-contained pyinfra
deploy: inventory.py, deploy.py, run.sh, plus any compose files
or assets that ship to the host.
| Station | Host | Notes |
|---|---|---|
framework/ |
10.0.0.237 |
Framework Desktop (Strix Halo, 128 GB) — local LLM box |
To bring up a station, cd into its folder and run ./run.sh. See the
station's own README for prerequisites.