hocr2djvused - Online in the Cloud

This is the command hocr2djvused that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

Run in Ubuntu Run in Fedora Run in Windows Sim Run in MACOS Sim

PROGRAM:

NAME

hocr2djvused - hOCR to djvused script converter

SYNOPSIS

hocr2djvused [option...] [hocr-file...]

DESCRIPTION

hocr2djvused reads one or more hOCR[1] files (as produced by OCRopus[2] or Cuneiform[3] or
Tesseract[4]) and converts them to a djvused script.

Unless a filename is explicitly provided on the command line, hOCR is read from the
standard input.

OPTIONS

Text segmentation options
-t lines, --details lines
Record location of every line. Don't record locations of particular words or
characters.

-t words, --details=words
Record location of every line and every word. Don't record locations of particular
characters.

This is the default.

-t chars, --details=chars
Record location of every line, every word and every character.

--word-segmentation=simple
Consider each non-empty sequence of non-whitespace characters a single word.

This is the default, despite being linguistically incorrect.

--word-segmentation=uax29
Use the Unicode Text Segmentation[5] algorithm to break lines into words.

This options break assumptions of some DjVu tools that words are separated by spaces,
and therefore is it not recommended.

Other options
--rotation=n
Assume that DjVu pages are rotated by n degrees.

--page-size=widthxheight
Specifies that page size is width pixels × height pixels.

This option is required for hOCR generated by Cuneiform (< 0.8) and superfluous
otherwise.

--html5
Use a HTML5 parser[6], which is more robust but slower than the default parser.

--fix-utf8
Attempt to fix UTF-8 encoding issues and eliminate unwanted control characters.

This option might be needed for hOCR generated by Cuneiform[7] or Tesseract[8].

--version
Output version information and exit.

-h, --help
Display help and exit.

Use hocr2djvused online using onworks.net services

Latest Linux & Windows online programs

ESPnet

ESPnet is a comprehensive end-to-end
speech processing toolkit covering a
wide spectrum of tasks, including
automatic speech recognition (ASR),
text-to-speech ...

Enter

Flax

Flax is a flexible neural-network
library for JAX that embraces functional
programming while offering ergonomic
module abstractions. Its design
separates pure ...

Enter

Electron Packager

Electron Packager is a command line
tool and Node.js library that bundles
Electron-based application source code
with a renamed Electron executable and
support...

Enter

ni is a lightweight command-line tool
that simplifies package manager usage
across JavaScript projects. It detects
and runs the correct package manager
command...

Enter

Matomo

Google Analytics alternative that
protects your data and your
customers' privacy. Take back
control with Matomo a powerful web
analytics platform that gi...

Enter

Optional Fund Assistant

Self-selected fund assistants can view
the funds you are concerned about in
real-time and help you quickly obtain
real-time data. It can be used to view
the re...

Enter

Directory Lister

Directory Lister is a simple,
self-hosted PHP application that lets
users browse and share the contents of
directories over the web. It creates a
web interface...

Enter

Stlite

Stlite is a WebAssembly-powered
framework that enables Streamlit
applications to run entirely in the
browser without requiring a Python
backend server. It achi...

Enter

Halloy - IRC Client

Halloy is an open-source IRC client
written in Rust, utilizing the Iced GUI
library. It aims to provide a simple and
fast client for Mac, Windows, and Linux
pl...

Enter

hocr2djvused - Online in the Cloud

PROGRAM:

NAME

SYNOPSIS

DESCRIPTION

OPTIONS

Latest Linux & Windows online programs

Categories to download Software & Programs for Windows & Linux