Add OpenAI-compatible voice servers (faster-whisper + Kokoro)

Path B from VoiceModels.md — adds two new compose stacks alongside the
Wyoming pair so OpenWebUI/Conduit get voice without a Wyoming-shim:

- compose/faster-whisper.yml — fedirz/faster-whisper-server CPU image,
  large-v3-turbo by default, OpenAI /v1/audio/transcriptions on :8001.
  Built-in web UI for ad-hoc transcription.
- compose/kokoro.yml — ghcr.io/remsky/kokoro-fastapi-cpu, Kokoro-82M,
  OpenAI /v1/audio/speech on :8880.

Both run alongside (not instead of) Wyoming Whisper + Piper — Wyoming
keeps serving HA Assist, OpenAI-API serves OpenWebUI / Conduit. Memory
budget on Strix Halo accommodates everything plus Qwen3-Coder loaded
concurrently with plenty of headroom.

Homepage gets dedicated tiles for both. README documents the
OpenWebUI Audio configuration that wires the new endpoints. Conduit
inherits voice via OpenWebUI without app-side setup.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
2026-05-08 14:42:45 -04:00
parent 36b8cfe835
commit 6db46d8f6a
7 changed files with 157 additions and 27 deletions

View File

@@ -74,20 +74,35 @@
container: phoenix
- Voice:
# Wyoming-protocol services have no web UI; tiles are informational
# (container status + port). Click-through goes nowhere meaningful.
- Whisper:
# Wyoming-protocol services have no web UI; tiles are informational.
# The OpenAI-compatible servers (faster-whisper, Kokoro) have UIs /
# APIs you can hit directly.
- Whisper (Wyoming):
icon: mdi-microphone-message
description: Speech-to-text (Wyoming :10300)
description: STT for Home Assistant Assist (Wyoming :10300)
server: localhost-docker
container: wyoming-whisper
- Piper:
- Piper (Wyoming):
icon: mdi-account-voice
description: Text-to-speech (Wyoming :10200)
description: TTS for Home Assistant Assist (Wyoming :10200)
server: localhost-docker
container: wyoming-piper
- faster-whisper:
icon: mdi-microphone
href: http://framework:8001
description: STT (OpenAI API) — large-v3-turbo, used by OpenWebUI/Conduit
server: localhost-docker
container: faster-whisper
- Kokoro:
icon: mdi-account-music
href: http://framework:8880/web
description: TTS (OpenAI API) — Kokoro-82M, used by OpenWebUI/Conduit
server: localhost-docker
container: kokoro
- External:
- SearXNG:
icon: searxng.svg