This is the Windows app named GLM-TTS whose latest release can be downloaded as GLM-TTSsourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named GLM-TTS with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start any OS OnWorks online emulator from this website, but better Windows online emulator.
- 5. From the OnWorks Windows OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application and install it.
- 7. Download Wine from your Linux distributions software repositories. Once installed, you can then double-click the app to run them with Wine. You can also try PlayOnLinux, a fancy interface over Wine that will help you install popular Windows programs and games.
Wine is a way to run Windows software on Linux, but with no Windows required. Wine is an open-source Windows compatibility layer that can run Windows programs directly on any Linux desktop. Essentially, Wine is trying to re-implement enough of Windows from scratch so that it can run all those Windows applications without actually needing Windows.
SCREENSHOTS:
GLM-TTS
DESCRIPTION:
GLM-TTS is an advanced text-to-speech synthesis system built on large language model technologies that focuses on producing high-quality, expressive, and controllable spoken output, including features like emotion modulation and zero-shot voice cloning. It uses a two-stage architecture where a generative LLM first converts text into intermediate speech token sequences and then a Flow-based neural model converts those tokens into natural audio waveforms, enabling rich prosody and voice character even for unseen speakers. The system introduces a multi-reward reinforcement learning framework that jointly optimizes for voice similarity, emotional expressiveness, pronunciation, and intelligibility, yielding output that can rival commercial options in naturalness and expressiveness. GLM-TTS also supports phoneme-level control and hybrid text + phoneme input, giving developers precise control over pronunciation critical for multilingual or polyphone-rich languages.
Features
- Zero-shot voice cloning from short prompt audio
- Multi-reward reinforcement learning for expressive prosody
- Two-stage LLM + Flow-based audio generation pipeline
- Support for phoneme-level control and hybrid inputs
- High-quality synthesis comparable with commercial TTS
- Streaming real-time speech synthesis
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/glm-tts.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.