This is the Linux app named OBLITERATUS whose latest release can be downloaded as OBLITERATUSsourcecode.zip. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named OBLITERATUS with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS:
OBLITERATUS
DESCRIPTION:
OBLITERATUS is an advanced open-source toolkit designed to analyze and modify the internal behavior of large language models by identifying and removing mechanisms responsible for refusal or restricted responses. It implements a set of techniques collectively referred to as “abliteration,” which target specific internal representations within neural networks to alter how models respond to certain prompts. Unlike traditional fine-tuning approaches, OBLITERATUS operates directly on model activations, enabling behavioral changes without retraining the model. The toolkit provides a full pipeline for probing, analyzing, and modifying model behavior, including visualization tools that help researchers understand where and how refusal mechanisms are encoded. It supports multiple analytical methods such as PCA and SVD to locate these behavioral directions within model layers.
Features
- Identification and removal of refusal behaviors in language models
- Techniques such as PCA and SVD for analyzing model activations
- Modification of model behavior without retraining
- Visualization tools for understanding internal model representations
- Python API for advanced experimentation and integration
- Optional telemetry for contributing to collaborative research
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/obliteratus.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.