Tacotron-2
Tacotron-2 is a TensorFlow
implementation of DeepMinds Tacotron-2
end-to-end text-to-speech architecture,
which predicts mel spectrograms from raw
text and th...
Enter
HiFi-GAN
HiFi-GAN is a GAN-based neural vocoder
designed to generate high-fidelity
speech waveforms from mel spectrograms
with exceptional efficiency. It
introduces a g...
Enter
VoxCPM
VoxCPM is a tokenizer-free
text-to-speech system that models speech
in a continuous space, aiming for
extremely realistic, context-aware
synthesis and true-to-...
Enter
Auto Synced Translated Dubs
Auto-Synced-Translated-Dubs is a
toolchain that automatically translates
and re-dubs videos using AI voices while
keeping the new speech aligned to the
origina...
Enter
VideLibri
VideLibri lists the books you have
borrowed from a public library and lets
you search the library catalog from your
local device. It has all the usual
features...
Enter
jackson-core
This project contains core low-level
incremental ("streaming") parser
and generator abstractions used by
Jackson Data Processor. It also includes
the d...
Enter
DeepSeekMath-V2
DeepSeekMath-V2 is a large-scale
open-source AI model designed
specifically for advanced mathematical
reasoning, theorem proving, and rigorous
proof verificati...
Enter
IMS Toucan
IMS-Toucan is a toolkit for training,
using, and teaching state-of-the-art
text-to-speech systems, built at the
Institute for Natural Language
Processing (IMS)...
Enter
Parallel WaveGAN
Parallel WaveGAN is an unofficial
PyTorch implementation of several
state-of-the-art non-autoregressive
neural vocoders, centered on Parallel
WaveGAN but also ...
Enter