This is the Linux app named RLax whose latest release can be downloaded as RLax0.1.8sourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named RLax with OnWorks for free.
Дотримуйтесь цих інструкцій, щоб запустити цю програму:
- 1. Завантажив цю програму на свій ПК.
- 2. Введіть у наш файловий менеджер https://www.onworks.net/myfiles.php?username=XXXXX із потрібним ім'ям користувача.
- 3. Завантажте цю програму в такий файловий менеджер.
- 4. Запустіть онлайн-емулятор OnWorks Linux або Windows або онлайн-емулятор MACOS з цього веб-сайту.
- 5. З ОС OnWorks Linux, яку ви щойно запустили, перейдіть до нашого файлового менеджера https://www.onworks.net/myfiles.php?username=XXXXX з потрібним іменем користувача.
- 6. Завантажте програму, встановіть її та запустіть.
СКРЕНИ:
RLax
ОПИС:
RLax (pronounced “relax”) is a JAX-based library developed by Google DeepMind that provides reusable mathematical building blocks for constructing reinforcement learning (RL) agents. Rather than implementing full algorithms, RLax focuses on the core functional operations that underpin RL methods—such as computing value functions, returns, policy gradients, and loss terms—allowing researchers to flexibly assemble their own agents. It supports both on-policy and off-policy learning, as well as value-based, policy-based, and model-based approaches. RLax is fully JIT-compilable with JAX, enabling high-performance execution across CPU, GPU, and TPU backends. The library implements tools for Bellman equations, return distributions, general value functions, and policy optimization in both continuous and discrete action spaces. It integrates seamlessly with DeepMind’s Haiku (for neural network definition) and Optax (for optimization), making it a key component in modular RL pipelines.
Функції
- Modular reinforcement learning primitives (values, returns, and policies)
- JAX-optimized for GPU/TPU acceleration and automatic differentiation
- Supports on-policy and off-policy learning paradigms
- Implements distributional value functions and general value functions
- Integrates with Haiku and Optax for neural network and optimization pipelines
- Comprehensive testing and examples for reproducibility and educational use
Мова програмування
Python, оболонка Unix
Категорії
This is an application that can also be fetched from https://sourceforge.net/projects/rlax.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.