Update README.md

This commit is contained in:
Tater Totterson
2025-09-27 15:04:16 -05:00
committed by GitHub
parent cef3daa1a7
commit bdac2263a5

View File

@@ -3,22 +3,27 @@
<h1>microWakeWord Trainer Docker</h1> <h1>microWakeWord Trainer Docker</h1>
</div> </div>
Easily train microWakeWord detection models with this pre-built Docker image. # 🥔 MicroWakeWord Trainer Tater Approved
## Quick Start **✅ Tater Totterson tested & working on an NVIDIA RTX 3070 Laptop GPU (8 GB VRAM).**
Easily train microWakeWord detection models with this pre-built Docker image and JupyterLab notebook.
Follow these steps to get started with the microWakeWord Trainer: ---
### 1. Pull the Pre-Built Docker Image ## 🚀 Quick Start
Follow these steps to get up and running:
### 1⃣ Pull the Pre-Built Docker Image
Pull the Docker image from Docker Hub:
```bash ```bash
docker pull ghcr.io/tatertotterson/microwakeword:latest docker pull ghcr.io/tatertotterson/microwakeword:latest
``` ```
### 2. Run the Docker Container ---
### 2⃣ Run the Docker Container
Start the container with a mapped volume for saving your data and expose the Jupyter Notebook:
```bash ```bash
docker run --rm -it \ docker run --rm -it \
--gpus all \ --gpus all \
@@ -26,53 +31,58 @@ docker run --rm -it \
-v $(pwd):/data \ -v $(pwd):/data \
ghcr.io/tatertotterson/microwakeword:latest ghcr.io/tatertotterson/microwakeword:latest
``` ```
--gpus all: Enables GPU acceleration.
-p 8888:8888: Exposes the Jupyter Notebook on port 8888.
-v $(pwd):/data: Maps the current directory to the container's /data directory for saving your files.
### 3. Access Jupyter Notebook **What these flags do:**
- `--gpus all` → Enables GPU acceleration
- `-p 8888:8888` → Exposes JupyterLab on port 8888
- `-v $(pwd):/data` → Saves your work in the current folder
---
### 3⃣ Open JupyterLab
Visit [http://localhost:8888](http://localhost:8888) in your browser — the notebook UI will open.
---
### 4⃣ Set Your Wake Word
At the **top of the notebook**, find this line:
Open your web browser and navigate to:
```bash ```bash
http://localhost:8888 TARGET_WORD = "hey_tater" # Change this to your desired wake word
``` ```
The notebook interface should appear.
### 4. Edit the Wake Word Change `"hey_tater"` to your desired wake word (phonetic spellings often work best).
Locate and edit the second cell in the notebook to specify your desired wake word: ---
```bash
target_word = 'khum_puter' # Phonetic spellings may produce better samples
```
Change 'khum_puter' to your desired wake word.
### 5. Run the Notebook ### 5️⃣ Run the Notebook
Run all cells in the notebook. The process will:
Generate wake word samples. Run all cells in the notebook. This process will:
Train a detection model. - Generate wake word samples
Output a quantized .tflite model for on-device use. - Train a detection model
- Output a quantized `.tflite` model ready for on-device use
### 6. Retrieve the Trained Model and JSON ---
Once the training is complete, the quantized .tflite model and .json will be available for download. Follow the instructions in the last cell of the notebook to download the model.
### 6⃣ Retrieve the Trained Model & JSON
When training finishes, download links for both the `.tflite` model and its `.json` manifest will be displayed in the last cell.
---
## 🔄 Resetting to a Clean State
## Resetting to a Clean State
If you need to start fresh: If you need to start fresh:
### Delete your data folder: 1. Delete the `data` folder that was mapped to your Docker container.
Locate and delete the data folder that was mapped to your Docker container. 2. Restart the container using the steps above.
3. A fresh copy of the notebook will be placed into the `data` directory.
### Restart the Docker container:
Run the container again using the steps provided above.
### Fresh notebook generated:
Upon restarting, a clean version of the training notebook will be placed in the newly created data directory.
This will reset your MicroWakeWord-Training-Docker environment to its initial state.
## Credits
This project builds upon the excellent work of [kahrendt/microWakeWord](https://github.com/kahrendt/microWakeWord). A huge thank you to the original authors for their contributions to the open-source community!
---
## 🙌 Credits
This project builds upon the excellent work of [kahrendt/microWakeWord](https://github.com/kahrendt/microWakeWord).
Huge thanks to the original authors for their contributions to the open-source community!