EnglishFrenchSpanish

Ad


OnWorks favicon

htload - Online in the Cloud

Run htload in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command htload that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


htload - reads in an ASCII-text version of the document database

SYNOPSIS


htload [options]

DESCRIPTION


Htload reads in an ASCII-text version of the document database in the same form as the
-t option of htdig and htdump. Note that this will overwrite data in your databases,
so this should be used with great care.

OPTIONS


-a Use alternate work files. Tells htload to append .work to database files, allowing
it to operate on a second set of databases.

-c configfile
Use the specified configfile instead of the default.

-i Initial. Do not use any old databases. This is accomplished by first erasing the
databases.

-v Verbose mode. This doesn't have much effect.

File Formats


Document Database
Each line in the file starts with the document id followed by a list of fieldname :
value separated by tabs. The fields always appear in the order listed below:

u URL

t Title

a State (0 = normal, 1 = not found, 2 = not indexed, 3 = obsolete)

m Last modification time as reported by the server

s Size in bytes

H Excerpt

h Meta description

l Time of last retrieval

L Count of the links in the document (outgoing links)

b Count of the links to the document (incoming links or backlinks)

c HopCount of this document

g Signature of the document used for duplicate-detection

e E-mail address to use for a notification message from htnotify

n Date to send out a notification e-mail message

S Subject for a notification e-mail message

d The text of links pointing to this document. (e.g. <a
href="/docURL">description</a>)

A Anchors in the document (i.e. <A NAME=...)

Word Database
While htdump and htload don't deal with the word database directly, it's worth
mentioning it here because you need to deal with it when copying the ASCII
databases from one system to another. The initial word database produced by htdig
is already in ASCII format, and a binary version of it is produced by htmerge, for
use by htsearch. So, when you copy over the ASCII version of the document database
produced by htdump, you need to copy over the wordlist as well, then run htload to
make the binary document database on the target system, followed by running htmerge
to make the word index.

Each line in the word list file starts with the word
followed by a list of fieldname : value separated by tabs. The fields always appear
in the order listed below, with the last two being optional:

i Document ID

l Location of word in document (1 to 1000)

w Weight of word based on scoring factors

c Count of word's appearances in document, if more than 1

a Anchor number if word occurred after a named anchor

Use htload online using onworks.net services


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    AstrOrzPlayer
    AstrOrzPlayer
    AstrOrz Player is a free media player
    software, part based on WMP and VLC. The
    player is in a minimalist style, with
    more than ten theme colors, and can also
    b...
    Download AstrOrzPlayer
  • 2
    movistartv
    movistartv
    Kodi Movistar+ TV es un ADDON para XBMC/
    Kodi que permite disponer de un
    decodificador de los servicios IPTV de
    Movistar integrado en uno de los
    mediacenters ma...
    Download movistartv
  • 3
    Code::Blocks
    Code::Blocks
    Code::Blocks is a free, open-source,
    cross-platform C, C++ and Fortran IDE
    built to meet the most demanding needs
    of its users. It is designed to be very
    extens...
    Download Code::Blocks
  • 4
    Amidst
    Amidst
    Amidst or Advanced Minecraft Interface
    and Data/Structure Tracking is a tool to
    display an overview of a Minecraft
    world, without actually creating it. It
    can ...
    Download Amidst
  • 5
    MSYS2
    MSYS2
    MSYS2 is a collection of tools and
    libraries providing you with an
    easy-to-use environment for building,
    installing and running native Windows
    software. It con...
    Download MSYS2
  • 6
    libjpeg-turbo
    libjpeg-turbo
    libjpeg-turbo is a JPEG image codec
    that uses SIMD instructions (MMX, SSE2,
    NEON, AltiVec) to accelerate baseline
    JPEG compression and decompression on
    x86, x8...
    Download libjpeg-turbo
  • More »

Linux commands

  • 1
    abi-tracker
    abi-tracker
    abi-tracker - visualize ABI changes
    timeline of a C/C++ software library.
    DESCRIPTION: NAME: ABI Tracker
    (abi-tracker) Visualize ABI changes
    timeline of a C/C+...
    Run abi-tracker
  • 2
    abicheck
    abicheck
    abicheck - check application binaries
    for calls to private or evolving symbols
    in libraries and for static linking of
    some system libraries. ...
    Run abicheck
  • 3
    couriermlm
    couriermlm
    couriermlm - The Courier mailing list
    manager ...
    Run couriermlm
  • 4
    couriertcpd
    couriertcpd
    couriertcpd - the Courier mail server
    TCP server daemon ...
    Run couriertcpd
  • 5
    gbklatex
    gbklatex
    bg5latex - Use LaTeX directly on a Big5
    encodedtex file bg5pdflatex - Use
    pdfLaTeX directly on a Big5 encodedtex
    file bg5+latex - Use LaTeX directly on a
    Big5+...
    Run gbklatex
  • 6
    gbkpdflatex
    gbkpdflatex
    bg5latex - Use LaTeX directly on a Big5
    encodedtex file bg5pdflatex - Use
    pdfLaTeX directly on a Big5 encodedtex
    file bg5+latex - Use LaTeX directly on a
    Big5+...
    Run gbkpdflatex
  • More »

Ad