This Python library handles text-to-speech conversion for applications needing low latency audio. It's designed to work with generators and LLM token streams, supporting local playback, streaming to other processes, or writing WAV files. It's built to integrate with various engines, from system voices to cloud APIs and local neural models.
Turn text into speech with minimal delay.
Developers building applications that require real-time or near real-time audio output from text.