Files
microWakeWord-Trainer-Nvidi…/README.md
Tater Totterson bdac2263a5 Update README.md
2025-09-27 15:04:16 -05:00

89 lines
2.3 KiB
Markdown
Raw Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
<div align="center">
<img src="https://raw.githubusercontent.com/TaterTotterson/microWakeWord-Trainer-Nvidia-Docker/refs/heads/main/mmw.png" alt="MicroWakeWord Trainer Logo" width="100" />
<h1>microWakeWord Trainer Docker</h1>
</div>
# 🥔 MicroWakeWord Trainer Tater Approved
**✅ Tater Totterson tested & working on an NVIDIA RTX 3070 Laptop GPU (8 GB VRAM).**
Easily train microWakeWord detection models with this pre-built Docker image and JupyterLab notebook.
---
## 🚀 Quick Start
Follow these steps to get up and running:
### 1⃣ Pull the Pre-Built Docker Image
```bash
docker pull ghcr.io/tatertotterson/microwakeword:latest
```
---
### 2⃣ Run the Docker Container
```bash
docker run --rm -it \
--gpus all \
-p 8888:8888 \
-v $(pwd):/data \
ghcr.io/tatertotterson/microwakeword:latest
```
**What these flags do:**
- `--gpus all` → Enables GPU acceleration
- `-p 8888:8888` → Exposes JupyterLab on port 8888
- `-v $(pwd):/data` → Saves your work in the current folder
---
### 3⃣ Open JupyterLab
Visit [http://localhost:8888](http://localhost:8888) in your browser — the notebook UI will open.
---
### 4⃣ Set Your Wake Word
At the **top of the notebook**, find this line:
```bash
TARGET_WORD = "hey_tater" # Change this to your desired wake word
```
Change `"hey_tater"` to your desired wake word (phonetic spellings often work best).
---
### 5⃣ Run the Notebook
Run all cells in the notebook. This process will:
- Generate wake word samples
- Train a detection model
- Output a quantized `.tflite` model ready for on-device use
---
### 6⃣ Retrieve the Trained Model & JSON
When training finishes, download links for both the `.tflite` model and its `.json` manifest will be displayed in the last cell.
---
## 🔄 Resetting to a Clean State
If you need to start fresh:
1. Delete the `data` folder that was mapped to your Docker container.
2. Restart the container using the steps above.
3. A fresh copy of the notebook will be placed into the `data` directory.
---
## 🙌 Credits
This project builds upon the excellent work of [kahrendt/microWakeWord](https://github.com/kahrendt/microWakeWord).
Huge thanks to the original authors for their contributions to the open-source community!