This is the Linux app named Transformer Debugger whose latest release can be downloaded as transformer-debuggersourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named Transformer Debugger with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
Transformer Debugger
DESCRIPTION:
Transformer Debugger (TDB) is a research tool developed by OpenAI’s Superalignment team to investigate and interpret the behaviors of small language models. It combines automated interpretability methods with sparse autoencoders, enabling researchers to analyze how specific neurons, attention heads, and latent features contribute to a model’s outputs. TDB allows users to intervene directly in the forward pass of a model and observe how such interventions change predictions, making it possible to answer questions like why a token was selected or why an attention head focused on a certain input. It automatically identifies and explains the most influential components, highlights activation patterns, and maps relationships across circuits within the model. The tool includes both a React-based neuron viewer for exploring model components and a backend activation server for running inferences and serving data.
Features
- Investigates behaviors of small language models with interpretability tools
- Intervenes in the forward pass to test effects on outputs
- Identifies and explains neuron, attention head, and latent activations
- Provides a React-based neuron viewer for interactive exploration
- Includes an activation server and inference hooks for GPT-2 models
- Offers collated activation datasets for deeper analysis
Programming Language
Python, TypeScript
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/transformer-debugger.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.