This is the Windows app named Vidi2 whose latest release can be downloaded as vidisourcecode.zip. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named Vidi2 with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start any OS OnWorks online emulator from this website, but better Windows online emulator.
- 5. From the OnWorks Windows OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application and install it.
- 7. Download Wine from your Linux distributions software repositories. Once installed, you can then double-click the app to run them with Wine. You can also try PlayOnLinux, a fancy interface over Wine that will help you install popular Windows programs and games.
Wine is a way to run Windows software on Linux, but with no Windows required. Wine is an open-source Windows compatibility layer that can run Windows programs directly on any Linux desktop. Essentially, Wine is trying to re-implement enough of Windows from scratch so that it can run all those Windows applications without actually needing Windows.
SCREENSHOTS:
Vidi2
DESCRIPTION:
Vidi is a family of large multimodal models developed for deep video understanding and editing tasks, integrating vision, audio, and language to allow sophisticated querying and manipulation of video content. It’s designed to process long-form, real-world videos and answer complex queries such as “when in this clip does X happen?” or “where in the frame is object Y during that moment?” — offering temporal retrieval, spatio-temporal grounding (i.e. locating objects over time + space), and even video question answering. Vidi targets applications like intelligent video editing, automated video search, content analysis, and editing assistance, enabling users to efficiently locate relevant segments and objects in hours-long footage. The system is built with open-source release in mind, giving developers access to model code, inference scripts, and evaluation pipelines so they can reproduce research results or integrate Vidi into their own video-processing workflows.
Features
- Multimodal video understanding: processes video + audio + possibly metadata/text to answer complex queries
- Temporal retrieval: identifies time ranges in long videos corresponding to given text queries
- Spatio-temporal grounding: finds bounding boxes of target objects across time when relevant
- Video question answering: supports QA over video content rather than only retrieval or segmentation
- Open-source release with model code, inference scripts, and evaluation pipelines — reproducible research and integration-friendly
- Designed for long-context videos — capable of handling extended footage instead of only short clips
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/vidi2.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.