microWakeWord-Trainer-Nvidia-Docker

taco/microWakeWord-Trainer-Nvidia-Docker

mirror of https://github.com/TaterTotterson/microWakeWord-Trainer-Nvidia-Docker.git synced 2026-06-12 20:10:19 -06:00

Author	SHA1	Message	Date
MasterPhooey	94903783cb	blackwell/wham & chim datasets	2026-03-09 19:48:35 -05:00
MasterPhooey	747822e856	wsl2 fix	2026-02-01 08:45:12 -06:00
MasterPhooey	423bbd15f5	tweaks	2026-01-21 06:03:57 -06:00
Tater Totterson	8669ccdfd4	Improve CUDA/XLA setup and CPU fallback	2026-01-20 20:47:58 -06:00
MasterPhooey	d5e8d187a1	personal samples	2026-01-17 23:01:50 -06:00
MasterPhooey	2b9aa95903	personal samples	2026-01-17 20:24:43 -06:00
MasterPhooey	c52f92d3c9	cli + web recorder ui	2026-01-17 16:23:24 -06:00
MasterPhooey	5bc0f12a7f	cli + web recorder ui	2026-01-17 01:23:51 -06:00
George Joseph	cb81f7f02d	Train from the command line The files in the `cli` directory allow you to train wake words from the command line without needing to use the Jupyter notebook or a web browser. Basically, the logic from the notebook has been placed in separate shell scripts and python files wrapped by 3 high-level scripts that do the following: * setup_python_venv: Creates a Python virtual environment with all the packages needed to train. The venv is created in the container's /data directory and is therefore stored on the host, not in the container's root docker volume. * setup_training_datasets: Downloads, extracts and converts the MIT RIR, FMA, Audioset and Negative training reference datasets. Also stored in /data. * train_wake_word: Generates the wake word samples, augments them with the audio from the training datasets, and finally runs the microwakeword training. The resulting model tflite and json files are placed in the /data/output directory. See the README.md file for much more information.	2025-12-28 12:48:51 -07:00

9 Commits