Profile Data download for Linux

This is the Linux app named Profile Data whose latest release can be downloaded as profile-datasourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.

 
 

Download and run online this app named Profile Data with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

SCREENSHOTS:


Profile Data


DESCRIPTION:

profile-data is a repository that publishes profiling traces and metrics from DeepSeek’s training and inference infrastructure (especially during DeepSeek-V3 / R1 experiments). The profiling data targets insights into computation-communication overlap, pipeline scheduling (e.g. DualPipe), and how MoE / EP / parallelism strategies interact in real systems. The repository contains JSON trace files like train.json, prefill.json, decode.json, and associated assets. Users can load them into tools like Chrome tracing to inspect GPU idle times, overlapping operations, and scheduling alignment. The idea is to bring transparency to internal efficiency tradeoffs, enabling researchers to reproduce, analyze, or improve on DeepSeek’s parallelism strategies. The README explains how trace data corresponds to forward/backward chunks, settings (e.g. EP64, TP1, 4K sequence length), and notes that pipeline communication is excluded for simplicity.



Features

  • Profiling traces (JSON) of training, prefill, and inference phases
  • Support for Chrome tracing (and similar visualization tools)
  • Documentation of experimental settings (e.g. MoE, pipeline shape)
  • Transparency into compute-communication overlap and pipeline bubble behavior
  • Public access to real system performance data from DeepSeek stack
  • Enables external analysis, validation, and benchmarking of parallelism strategies



Categories

AI Models

This is an application that can also be fetched from https://sourceforge.net/projects/profile-data.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.



Latest Linux & Windows online programs


Categories to download Software & Programs for Windows & Linux