This is the Linux app named Crawl-By-Example (Heritrix plugin) whose latest release can be downloaded as byexample-distr-0.1.tar.gz. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named Crawl-By-Example (Heritrix plugin) with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
Crawl-By-Example (Heritrix plugin)
Ad
DESCRIPTION
Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.Audience
Science/Research, Advanced End Users, Developers
User interface
Web-based
Programming Language
Java
This is an application that can also be fetched from https://sourceforge.net/projects/crawlbyexample/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.
 
 














