Name: DeepEval download for Linux
Brand: OnWorks
SKU: 206743d229fc07b181c51d37d84b1498
Availability: OnlineOnly
Rating: 4.92 (2339 reviews)

This is the Linux app named DeepEval whose latest release can be downloaded as MetricsforAIagents,multi-turnsyntheticdatageneration,andmore!sourcecode.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.

Download and run online this app named DeepEval with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

Download App Run in Ubuntu Run in Fedora Run in Windows Sim Run in MACOS Sim

SCREENSHOTS

DeepEval

DESCRIPTION

DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence.

Features

Large variety of ready-to-use LLM evaluation metrics (all with explanations) powered by ANY LLM of your choice
Red team your LLM application for 40+ safety vulnerabilities in a few lines of code
Documentation available
Examples available
Evaluate your entire dataset in bulk in under 20 lines of Python code in parallel. Do this via the CLI in a Pytest-like manner, or through our evaluate() function
Create your own custom metrics that are automatically integrated with DeepEval's ecosystem by inheriting DeepEval's base metric class

Programming Language

Python

DeepEval download for Linux

SCREENSHOTS

DESCRIPTION

Features

Programming Language

Categories