EnglishFrenchSpanish

Ad


OnWorks favicon

djvu2hocr - Online in the Cloud

Run djvu2hocr in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command djvu2hocr that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


djvu2hocr - DjVu to hOCR converter

SYNOPSIS


djvu2hocr [option...] djvu-file

djvu2hocr {--version | --help | -h}

DESCRIPTION


djvu2hocr converts hidden text from a DjVu file to the hOCR[1] format.

OPTIONS


Input selection options
-p, --pages=page-range
Specifies pages to covert. page-range is a comma-separated list of sub-ranges. Each
sub-range is either a single page (e.g. 17) or a contiguous range of pages
(e.g. 37-42). Pages are numbered from 1.

The default is to convert all pages.

Text segmentation options
--word-segmentation=simple
Use the same word segmentation as found in the DjVu file.

This is the default.

--word-segmentation=uax29
Use the Unicode Text Segmentation[2] algorithm to break lines into words, possibly
fixing word segmentation found in the DjVu file.

HTML output options
--title=title
Specifies the document title.

The default is “DjVu hidden text layer”.

--css=style
Add the specfied CSS style to the document.

For example, --css='.ocrx_line { display: block; }' can be used to visually preserve
line breaks.

Other options
--version
Output version information and exit.

-h, --help
Display help and exit.

PORTABILITY


djvu2hocr uses a custom extension to hOCR to retain characters which cannot be directly
represented in an HTML/XML document. For example, control character BEL (^G, U+0007), is
converted into the following HTML chunk: <span class="djvu_char" title="#x07"> </span>

Use djvu2hocr online using onworks.net services


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    strace
    strace
    The strace project has been moved to
    https://strace.io. strace is a
    diagnostic, debugging and instructional
    userspace tracer for Linux. It is used
    to monitor a...
    Download strace
  • 2
    gMKVExtractGUI
    gMKVExtractGUI
    A GUI for mkvextract utility (part of
    MKVToolNix) which incorporates most (if
    not all) functionality of mkvextract and
    mkvinfo utilities. Written in C#NET 4.0,...
    Download gMKVExtractGUI
  • 3
    JasperReports Library
    JasperReports Library
    JasperReports Library is the
    world's most popular open source
    business intelligence and reporting
    engine. It is entirely written in Java
    and it is able to ...
    Download JasperReports Library
  • 4
    Frappe Books
    Frappe Books
    Frappe Books is a free and open source
    desktop book-keeping software that's
    simple and well-designed to be used by
    small businesses and freelancers. It'...
    Download Frappe Books
  • 5
    Numerical Python
    Numerical Python
    NEWS: NumPy 1.11.2 is the last release
    that will be made on sourceforge. Wheels
    for Windows, Mac, and Linux as well as
    archived source distributions can be fou...
    Download Numerical Python
  • 6
    CMU Sphinx
    CMU Sphinx
    CMUSphinx is a speaker-independent large
    vocabulary continuous speech recognizer
    released under BSD style license. It is
    also a collection of open source tools ...
    Download CMU Sphinx
  • 7
    Old Feren OS Repositories
    Old Feren OS Repositories
    This was the Official Repository for
    Feren OS. To add the latest one, run
    this command: (16.04-based) echo
    "deb ...
    Download Old Feren OS Repositories
  • More »

Linux commands

Ad