A desktop Text-to-Speech application powered by Kokoro 82M, providing high-quality voice synthesis through an intuitive PyQt6 interface.
- User-Friendly GUI: Clean PyQt6 interface for easy text-to-speech conversion
- Kokoro 82M Integration: Lightweight yet powerful TTS model with 82 million parameters
- Multi-Language Support: Support for 9 languages including English, Spanish, French, Hindi, Italian, Japanese, Portuguese, and Chinese
- Multiple Voices: Wide variety of male and female voices for each language
- Customizable Speech: Adjustable speed (50% - 200%) and volume control
- Real-time Playback: Instant audio playback with stop functionality
- Responsive Design: Threaded architecture keeps the UI responsive during synthesis
- Local Processing: Runs entirely on your machine without requiring external servers
- Python 3.12
- PyTorch (CPU or GPU support)
- No external server required - runs locally!
- Clone the repository:
git clone https://github.com/yourusername/vibe-tts.git
cd vibe-tts
- Install dependencies using uv:
uv sync
- Launch the application:
uv run python tts_app.py
- In the application:
- Enter or paste text in the text area
- Select language from the dropdown
- Choose a voice for the selected language
- Adjust speed (50% - 200%) and volume as needed
- Click "Speak" to synthesize and play audio
- Use "Stop" to halt playback
- American English
- British English
- Spanish
- French
- Hindi
- Italian
- Japanese (requires
uv add misaki[ja]
) - Brazilian Portuguese
- Mandarin Chinese (requires
uv add misaki[zh]
)
- American English: af_heart, af_bella, am_adam, am_michael
- British English: bf_emma, bf_isabella, bm_george, bm_lewis
- Spanish: ef_sofia, em_carlos
- French: ff_camille, fm_pierre
- And many more for each language!
vibe-tts/
- tts_app.py # Main application with GUI and Riva integration
- main.py # Application entry point
- pyproject.toml # Project dependencies and metadata
- README.md # This file
- README_RIVA_SETUP.md # Detailed Riva server setup guide
- riva_quickstart_v2.19.0/ # Riva setup scripts and configuration
uv run pytest
uv run ruff format .
uv run ruff check . --fix
uv run pyright
- If you encounter errors with PyTorch, install it manually based on your system:
- CPU:
uv add torch --index-url https://download.pytorch.org/whl/cpu
- CUDA:
uv add torch --index-url https://download.pytorch.org/whl/cu121
- CPU:
- Verify system audio is working
- Check volume settings in both the app and system
- Ensure soundfile is properly installed:
uv add soundfile
- For Japanese support:
uv add misaki[ja]
- For Chinese support:
uv add misaki[zh]
- Some languages require espeak-ng:
sudo apt-get install espeak-ng
(Linux/WSL)
- GPU acceleration is supported automatically if CUDA is available
- For Apple Silicon Macs: The app will use MPS acceleration automatically
- First run may be slower as models are downloaded and cached
- Fork the repository
- Create a feature branch:
git checkout -b feature-name
- Make your changes following the guidelines in CLAUDE.md
- Run tests and formatters
- Commit your changes
- Push to your fork and submit a pull request
This project is licensed under the MIT License. See the LICENSE file for details.
- Built with Kokoro 82M by Hexgrad
- UI powered by PyQt6