EnglishFrenchSpanish

Ad


OnWorks favicon

gendict - Online in the Cloud

Run gendict in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command gendict that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


gendict - Compiles word list into ICU string trie dictionary

SYNOPSIS


gendict [ --uchars | --bytes --transform transform ] [ -h, -?, --help ] [ -V, --version ]
[ -c, --copyright ] [ -v, --verbose ] [ -i, --icudatadir directory ] input-file
output-file

DESCRIPTION


gendict reads the word list from dictionary-file and creates a string trie dictionary
file. Normally this data file has the .dict extension.

Words begin at the beginning of a line and are terminated by the first whitespace. Lines
that begin with whitespace are ignored.

OPTIONS


-h, -?, --help
Print help about usage and exit.

-V, --version
Print the version of gendict and exit.

-c, --copyright
Embeds the standard ICU copyright into the output-file.

-v, --verbose
Display extra informative messages during execution.

-i, --icudatadir directory
Look for any necessary ICU data files in directory. For example, the file
pnames.icu must be located when ICU's data is not built as a shared library. The
default ICU data directory is specified by the environment variable ICU_DATA. Most
configurations of ICU do not require this argument.

--uchars
Set the output trie type to UChar. Mutually exclusive with --bytes.

--bytes
Set the output trie type to Bytes. Mutually exclusive with --uchars.

--transform
Set the transform type. Should only be specified with --bytes. Currently supported
transforms are: offset-<hex-number>, which specifies an offset to subtract from all
input characters. It should be noted that the offset transform also maps U+200D to
0xFF and U+200C to 0xFE, in order to offer compatibility to languages that require
these characters. A transform must be specified for a bytes trie, and when applied
to the non-value characters in the input-file must produce output between 0x00 and
0xFF.

input-file
The source file to read.

output-file
The file to write the output dictionary to.

CAVEATS


The input-file is assumed to be encoded in UTF-8. The integers in the input-file that are
used as values must be made up of ASCII digits. They may be specified either in hex, by
using a 0x prefix, or in decimal. Either --bytes or --uchars must be specified.

ENVIRONMENT


ICU_DATA Specifies the directory containing ICU data. Defaults to
${prefix}/share/icu/55.1/. Some tools in ICU depend on the presence of the
trailing slash. It is thus important to make sure that it is present if ICU_DATA
is set.

AUTHORS


Maxime Serrano

VERSION


1.0

COPYRIGHT


Copyright (C) 2012 International Business Machines Corporation and others

Use gendict online using onworks.net services


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    Firebird
    Firebird
    Firebird RDBMS offers ANSI SQL features
    & runs on Linux, Windows &
    several Unix platforms. Features
    excellent concurrency & performance
    & power...
    Download Firebird
  • 2
    KompoZer
    KompoZer
    KompoZer is a wysiwyg HTML editor using
    the Mozilla Composer codebase. As
    Nvu's development has been stopped
    in 2005, KompoZer fixes many bugs and
    adds a f...
    Download KompoZer
  • 3
    Free Manga Downloader
    Free Manga Downloader
    The Free Manga Downloader (FMD) is an
    open source application written in
    Object-Pascal for managing and
    downloading manga from various websites.
    This is a mirr...
    Download Free Manga Downloader
  • 4
    UNetbootin
    UNetbootin
    UNetbootin allows you to create bootable
    Live USB drives for Ubuntu, Fedora, and
    other Linux distributions without
    burning a CD. It runs on Windows, Linux,
    and ...
    Download UNetbootin
  • 5
    Dolibarr ERP - CRM
    Dolibarr ERP - CRM
    Dolibarr ERP - CRM is an easy to use
    ERP and CRM open source software package
    (run with a web php server or as
    standalone software) for businesses,
    foundations...
    Download Dolibarr ERP - CRM
  • 6
    SQuirreL SQL Client
    SQuirreL SQL Client
    SQuirreL SQL Client is a graphical SQL
    client written in Java that will allow
    you to view the structure of a JDBC
    compliant database, browse the data in
    tables...
    Download SQuirreL SQL Client
  • More »

Linux commands

Ad