Name: Real-Time Voice Cloning download for Linux
Brand: OnWorks
SKU: e7baf0fe3a39371060ef8c7838294acf
Availability: OnlineOnly
Rating: 4.93 (2205 reviews)

This is the Linux app named Real-Time Voice Cloning whose latest release can be downloaded as Real-Time-Voice-Cloningsourcecode.zip. It can be run online in the free hosting provider OnWorks for workstations.

Download and run online this app named Real-Time Voice Cloning with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

Download App Run in Ubuntu Run in Fedora Run in Windows Sim Run in MACOS Sim

SCREENSHOTS

Real-Time Voice Cloning

DESCRIPTION

Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that captures voice characteristics; this embedding is then used by a Tacotron-style synthesizer to generate spectrograms from text, which a WaveRNN-based vocoder finally turns into audio. The repo includes both a command-line demo and a graphical “toolbox” application where you can load reference voices, type text, and hear the synthesized results interactively. It also provides scripts for preprocessing datasets (such as LibriSpeech), training each of the three components.

Features

Full SV2TTS pipeline with encoder, synthesizer, and WaveRNN-style vocoder implemented in Python
Ability to clone a voice from a few seconds of reference audio and synthesize arbitrary text in that voice
GUI “toolbox” demo for interactive experimentation with multiple speakers and texts
CLI demos (demo_cli.py) for scripted, non-GUI voice cloning workflows
Preprocessing and training scripts for popular datasets like LibriSpeech plus automatic pretrained model download
Supports both GPU and CPU modes via simple launch flags, making it usable on a range of hardware

Programming Language

Python

Real-Time Voice Cloning download for Linux

SCREENSHOTS

DESCRIPTION

Features

Programming Language

Categories