Containerized local LLM stack for the Framework Desktop / Strix Halo,
plus the OpenCode harness on the Mac side.
- pyinfra/framework/: pyinfra deploy targeting the box
- llama.cpp (Vulkan), vLLM (ROCm), Ollama (ROCm with HSA override
for gfx1151), OpenWebUI
- Beszel (host + container + AMD GPU dashboard via sysfs)
- OpenLIT (LLM fleet metrics)
- Phoenix (per-trace agent waterfall)
- OpenHands (autonomous agent in a Docker sandbox)
- opencode/: OpenCode config + Phoenix bridge plugin (OTel exporter)
- install.sh deploys to ~/.config/opencode/
- StrixHaloSetup.md / StrixHaloMemory.md / Roadmap.md / TODO.md:
documentation and planning
- testing/qwen3-coder-30b/: small evaluation harness
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
13 lines
463 B
Markdown
13 lines
463 B
Markdown
# pyinfra
|
|
|
|
One folder per station. Each subfolder is a self-contained pyinfra
|
|
deploy: `inventory.py`, `deploy.py`, `run.sh`, plus any compose files
|
|
or assets that ship to the host.
|
|
|
|
| Station | Host | Notes |
|
|
|---------|------|-------|
|
|
| [`framework/`](framework/README.md) | `10.0.0.237` | Framework Desktop (Strix Halo, 128 GB) — local LLM box |
|
|
|
|
To bring up a station, `cd` into its folder and run `./run.sh`. See the
|
|
station's own README for prerequisites.
|