This is the Linux app named GLM-V whose latest release can be downloaded as GLM-Vsourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named GLM-V with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS
Ad
GLM-V
DESCRIPTION
GLM-V is an open-source vision-language model (VLM) series from ZhipuAI that extends the GLM foundation models into multimodal reasoning and perception. The repository provides both GLM-4.5V and GLM-4.1V models, designed to advance beyond basic perception toward higher-level reasoning, long-context understanding, and agent-based applications. GLM-4.5V builds on the flagship GLM-4.5-Air foundation (106B parameters, 12B active), achieving state-of-the-art results on 42 benchmarks across image, video, document, GUI, and grounding tasks. It introduces hybrid training for broad-spectrum reasoning and a Thinking Mode switch to balance speed and depth of reasoning. GLM-4.1V-9B-Thinking incorporates reinforcement learning with curriculum sampling (RLCS) and Chain-of-Thought reasoning, outperforming models much larger in scale (e.g., Qwen-2.5-VL-72B) across many benchmarks.
Features
- Bilingual (Chinese/English) multimodal reasoning and perception
- GLM-4.5V: hybrid-trained flagship with state-of-the-art benchmark scores
- GLM-4.1V-9B-Thinking: reasoning-focused model with RLCS and CoT mechanisms
- Long-context support (up to 64k) and flexible input (images, video, documents)
- GUI agent capabilities with platform-aware prompts and precise grounding
- Thinking Mode switch to toggle between fast and deep reasoning outputs
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/glm-v.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.