This is the Linux app named DCTFinder to run in Linux online whose latest release can be downloaded as dct-finder-2015-01-22.jar. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named DCTFinder to run in Linux online with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
DCTFinder to run in Linux online
Ad
DESCRIPTION
Web pages do not offer reliable metadata concerning their creation date and time. However, getting the document creation time is a necessary step for allowing to apply temporal normalization systems to web pages. DCTFinder is a system that parses a web page and extracts from its content the title and the creation date of this web page. DCTFinder combines heuristic title detection, supervised learning with Conditional Random Fields (CRFs) for document date extraction, and rule-based creation time recognition.DCTFinder is released under CeCILL free software license agreement.
The system is described in the following paper (see 'Files' section):
Xavier Tannier. "Extracting News Web Page Creation Time with DCTFinder". Proceedings of the 9th Language Resources and Evaluation Conference. Reykjavik, Iceland.
Audience
Information Technology, Science/Research
Programming Language
Java
This is an application that can also be fetched from https://sourceforge.net/projects/dctfinder/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.