This is the Linux app named PRM800K whose latest release can be downloaded as prm800ksourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named PRM800K with OnWorks for free.
Ikuti petunjuk ini untuk menjalankan aplikasi ini:
- 1. Download aplikasi ini di PC Anda.
- 2. Masuk ke file manager kami https://www.onworks.net/myfiles.php?username=XXXXX dengan username yang anda inginkan.
- 3. Upload aplikasi ini di filemanager tersebut.
- 4. Jalankan emulator online OnWorks Linux atau Windows online atau emulator online MACOS dari situs web ini.
- 5. Dari OS Linux OnWorks yang baru saja Anda mulai, buka file manager kami https://www.onworks.net/myfiles.php?username=XXXXX dengan nama pengguna yang Anda inginkan.
- 6. Download aplikasinya, install dan jalankan.
SCREENSHOT:
PRM800K
DESKRIPSI:
PRM800K is a process supervision dataset accompanying the paper Let’s Verify Step by Step, providing 800,000 step-level correctness labels on model-generated solutions to problems from the MATH dataset. The repository releases the raw labels and the labeler instructions used in two project phases, enabling researchers to study how human raters graded intermediate reasoning. Data are stored as newline-delimited JSONL files tracked with Git LFS, where each line is a full solution sample that can contain many step-level labels and rich metadata such as labeler UUIDs, timestamps, generation identifiers, and quality-control flags. Each labeled step can include multiple candidate completions with ratings of -1, 0, or +1, optional human-written corrections (phase 1), and a chosen completion index, along with a final finish reason such as found_error, solution, bad_problem, or give_up.
Fitur
- 800,000 step-level correctness labels for MATH problems via JSONL
- Detailed schema with labeler IDs, timestamps, generations, QC flags, and finish reasons
- Multi-candidate step ratings of -1, 0, +1 with optional human-completion entries
- Labeler instruction docs for both phase 1 and phase 2
- Python grading logic using math normalization and sympy equivalence checks
- Nonstandard MATH train/test split and large-scale scored samples with PRM/ORM eval scripts
Bahasa Pemrograman
Ular sanca
KATEGORI
This is an application that can also be fetched from https://sourceforge.net/projects/prm800k.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.