This is the Linux app named Real-Time Voice Cloning whose latest release can be downloaded as Real-Time-Voice-Cloningsourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named Real-Time Voice Cloning with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS
Ad
Real-Time Voice Cloning
DESCRIPTION
Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that captures voice characteristics; this embedding is then used by a Tacotron-style synthesizer to generate spectrograms from text, which a WaveRNN-based vocoder finally turns into audio. The repo includes both a command-line demo and a graphical “toolbox” application where you can load reference voices, type text, and hear the synthesized results interactively. It also provides scripts for preprocessing datasets (such as LibriSpeech), training each of the three components.
Features
- Full SV2TTS pipeline with encoder, synthesizer, and WaveRNN-style vocoder implemented in Python
- Ability to clone a voice from a few seconds of reference audio and synthesize arbitrary text in that voice
- GUI “toolbox” demo for interactive experimentation with multiple speakers and texts
- CLI demos (demo_cli.py) for scripted, non-GUI voice cloning workflows
- Preprocessing and training scripts for popular datasets like LibriSpeech plus automatic pretrained model download
- Supports both GPU and CPU modes via simple launch flags, making it usable on a range of hardware
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/real-time-voice-cloning.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.
