FastRTC
FastRTC is a Python library designed to
simplify real-time communication (RTC),
especially for audio and video streaming
applications. It abstracts away much o...
Enter
MetaVoice-1B
MetaVoice in the form of its source
repository metavoice-src is a
large-scale text-to-speech (TTS) model.
Specifically, the base model
(MetaVoice-1B) use...
Enter
Equalizer APO
Equalizer APO is a parametric / graphic
equalizer for Windows. It is implemented
as an Audio Processing Object (APO) for
the system effect infrastructure intro...
Enter
abogen
abogen is a tool designed to generate
audiobooks (or speech narrations) from
textual sources such as EPUBs, PDFs, or
plain text, with synchronized captions.
In...
Enter
Jupyter Notebook Tools for Sphinx
nbsphinx is a Sphinx extension that
provides a source parser for *.ipynb
files. Custom Sphinx directives are used
to show Jupyter Notebook code cells (and
of c...
Enter
MooseStack
MooseStack is an opinionated starter
stack that assembles a modern web
application foundationproject
structure, build tooling, and deployment
scriptsso teams...
Enter
WhisperLive
WhisperLive is a nearly live
implementation of OpenAIs Whisper model
focused on real-time transcription. It
runs as a serverclient system in which
the serv...
Enter
MARS5
MARS5-TTS is CAMB.AIs open-source
English speech model designed for
high-quality text-to-speech and voice
emulation. It uses a two-stage
architecture that com...
Enter
TTS WebUI
TTS-WebUI is a unified Gradio + React
web interface that brings together a
large ecosystem of text-to-speech, voice
conversion, and audio generation models
und...
Enter