EnglishFrenchSpanish

Ad


OnWorks favicon

Unsupervised TXT classifier

Free download Unsupervised TXT classifier Linux app to run online in Ubuntu online, Fedora online or Debian online

This is the Linux app named Unsupervised TXT classifier whose latest release can be downloaded as classifier.zip. It can be run online in the free hosting provider OnWorks for workstations.

Download and run online this app named Unsupervised TXT classifier with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

SCREENSHOTS

Ad


Unsupervised TXT classifier


DESCRIPTION

This program is made to address two most common issues with the known classifying algorithms. First, over-training and second, shortage of data for a training of categories. Instead, each TXT file is a category on its own, rather than an assigned category. In a way, this is similar to clustering but not really a clustering algorithm since there is some training involved. The summarizer from Classifier4J has been adjusted to accept two inputs (lets call them A and B). Then, the summarizer gets trained with A to summarize a document B, and vice versa. This extracts a relevant structure for both documents (and thus avoids the over-training) which are then compared using the Vector-Space analysis to give a range of belonging of one document to another (and thus avoids the shortage of information). This method can be used to create the user-defined classes by merging texts of certain categories and then to calculate the relevant distances between the documents, but this is not necessary.



Audience

Education, Developers, Testers



Programming Language

Java


Categories

Artificial Intelligence, Information Analysis, Linguistics

This is an application that can also be fetched from https://sourceforge.net/projects/txtclassifier/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.


Ad