This is the command ifile that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator
ifile - core executable for the ifile mail filtering system
ifile [-b file] [-q|-Q] [-g] [-k] [-o] [-v num] [lexing options] file ...
ifile -c -q|-Q [-T threshold] [-b file] [-g] [-k] [-o] [lexing options] file ...
ifile [-b file] [-d folder] [-i folder|-u folder] [-g] [-k] [-o] [-v num] [lexing options]
ifile -r [-b file]
ifile is a mail filter client that uses machine learning to classify e-mail into
folders/mail boxes. The algorithm that it uses is called Naive Bayes. Basically, naive
bayes considers each document an unordered collection of words and classifies by matching
the document distribution with the most closely matching folder/mailbox distribution.
Location to read/store ifile database. Default is ~/.idata
equivalent of "ifile -v 0 | head -1 | cut -f1 -d". Must be used with -q or -Q.
Delete the statistics for each of files from the category folder
Show the word-probability calculations for folder
Create and store debugging information in ~/.ifile.log
Add the statistics for each of the files to the category folder
Leave in the database words that occur infrequently (normally they are tossed)
For each of the files, temporarily removes file from folder, performs query and
then reinserts file in folder. Database is not modified.
Uses document bit-vector representation. Count each word once per document.
Output rating scores for each of the files
For each of the files, output rating scores and add statistics for the folder with
the highest score
When used with both -c and -q, output the two highest ranking categories if their
score differs by at most threshold / 1000, which can be used to detect border
cases. When used with -q only and any threshold > 0, output the score difference
percentage. For example,
ifile -T1 -q foo.txt
might result in
If so, then
ifile -T93 -q -c foo.txt
will result in
ifile -T92 -q -c foo.txt
will result in
Erases all currently stored information
Same as 'insert' except only adds stats if folder already exists
Amount of output while running: 0=silent, 1=quiet, 2=progress, 3=verbose, 4=debug
Lex words as sequences of alphabetic characters (default)
Only lex space-separated character sequences which are composed entirely of
Skip all of the header lines except Subject:, From: and To:
Ignore portion of message after first char characters. Use entire message if char
set to 0. Default is 50,000.
Just tokenize and print, don't do any other processing. Documents are returned as
a list of word, frequency pairs.
Do not throw out overly frequent (stoplist) words when lexing
Use 'Porter' stemming algorithm when lexing documents
Lex words as sequences of space separated characters
If no files are specified on the command line, ifile will use standard input as its
message to process.
Give this help list
Give a short usage message
Print program version
Mandatory or optional arguments to long options are also mandatory or optional for any
corresponding short options.
Use ifile online using onworks.net services