This is the Linux app named Open Vision Agents by Stream whose latest release can be downloaded as v0.2.2sourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named Open Vision Agents by Stream with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS
Ad
Open Vision Agents by Stream
DESCRIPTION
Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio and video latency while processing frames and generating responses. Developers work with an agent abstraction that connects video edge providers, LLMs, and processors into pipelines, making it easier to orchestrate tasks like object detection, pose estimation, and conversational guidance. The project includes SDKs for React, Android, iOS, Flutter, React Native, and Unity, enabling integration into a wide variety of client environments such as mobile apps, web apps, and games.
Features
- Framework for multimodal agents that process live video, audio, and text
- Integrations with YOLO, Roboflow, and real time LLMs like OpenAI and Gemini
- Ultra low latency streaming via Stream’s global edge network
- Agent abstraction with processors for detection, pose, and custom logic
- SDKs for React, Android, iOS, Flutter, React Native, and Unity
- Ready made examples for sports coaching, safety monitoring, and interactive apps
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/open-vision-ag-stream.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.
