Add Wyoming voice stack to pyinfra + landscape doc
- Move piper-compose.yaml / whisper-compose.yaml from repo root into
pyinfra/framework/compose/{piper,whisper}.yml; bind paths shifted to
/srv/docker/{piper,whisper}/data on the box.
- deploy.py registers both stacks and provisions the data dirs.
- Homepage gets a "Voice" group with informational tiles (Wyoming has
no web UI, so tiles show container status without click-through).
- New VoiceModels.md captures the May 2026 STT/TTS landscape, why the
current Wyoming defaults aren't SOTA, and concrete upgrade paths
(whisper-large-v3-turbo + faster-whisper-server, Kokoro, Sesame CSM,
F5-TTS for cloning).
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
26
pyinfra/framework/compose/whisper.yml
Normal file
26
pyinfra/framework/compose/whisper.yml
Normal file
@@ -0,0 +1,26 @@
|
||||
# Wyoming Whisper — speech-to-text over the Wyoming protocol.
|
||||
# https://github.com/rhasspy/wyoming-whisper
|
||||
#
|
||||
# Wyoming is Home Assistant's voice protocol; it's also consumable by any
|
||||
# Wyoming client. No web UI — this is a protocol server on TCP :10300.
|
||||
#
|
||||
# Model selection: `tiny-int8` is the smallest viable model (~75 MB),
|
||||
# fast and good enough for command-style transcription. Bump to
|
||||
# `base-int8` (140 MB) or `small-int8` (480 MB) for general dictation.
|
||||
# Models are downloaded into /srv/docker/whisper/data on first start.
|
||||
services:
|
||||
whisper:
|
||||
image: rhasspy/wyoming-whisper:latest
|
||||
container_name: wyoming-whisper
|
||||
restart: unless-stopped
|
||||
ports:
|
||||
- "10300:10300"
|
||||
volumes:
|
||||
- /srv/docker/whisper/data:/data
|
||||
command:
|
||||
- --model
|
||||
- tiny-int8
|
||||
- --language
|
||||
- en
|
||||
- --beam-size
|
||||
- "1"
|
||||
Reference in New Issue
Block a user