CogVLM2 download for Linux

This is the Linux app named CogVLM2 whose latest release can be downloaded as CogVLM2sourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.

 
 

Download and run online this app named CogVLM2 with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

SCREENSHOTS:


CogVLM2


DESCRIPTION:

CogVLM2 is the second generation of the CogVLM vision-language model series, developed by ZhipuAI and released in 2024. Built on Meta-Llama-3-8B-Instruct, CogVLM2 significantly improves over its predecessor by providing stronger performance across multimodal benchmarks such as TextVQA, DocVQA, and ChartQA, while introducing extended context length support of up to 8K tokens and high-resolution image input up to 1344×1344. The series includes models for both image understanding and video understanding, with CogVLM2-Video supporting up to 1-minute videos by analyzing keyframes. It supports bilingual interaction (Chinese and English) and has open-source versions optimized for dialogue and video comprehension. Notably, the Int4 quantized version allows efficient inference on GPUs with only 16GB of memory. The repository offers demos, API servers, fine-tuning examples, and integration with OpenAI API-compatible endpoints, making it accessible for both researchers and developers.



Features

  • Supports both image and video understanding with multimodal reasoning
  • Up to 8K context length and 1344×1344 image resolution input
  • Bilingual support for English and Chinese interactions
  • Quantized Int4 version for efficient inference on 16GB GPUs
  • Outperforms previous open-source models on TextVQA, DocVQA, and ChartQA
  • Provides demos for CLI, Gradio, API server, and fine-tuning workflows


Programming Language

Python


Categories

Large Language Models (LLM), AI Models

This is an application that can also be fetched from https://sourceforge.net/projects/cogvlm2.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.



Latest Linux & Windows online programs


Categories to download Software & Programs for Windows & Linux