OnWorks favicon

mmseg - Online in the Cloud

Run mmseg in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command mmseg that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator



mmseg - maximum matching segment Chinese text.


mmseg -d dict_file [option]... [corpus_file]...


mmseg is a tool for segmenting Chinese text into words using maximum matching algorithm.
mmseg segments corpus_file, or standard input if no filename is specified, and write the
segmented result to standard output.


-d dict_file
Use dict_file as lexicon. A default lexicon can be found at

-f,--format (text|bin)
Output Format, can be 'text' or 'bin'. default 'bin'. Normally, in text mode, word
text are output, while in binary mode, binary short integer of the word-ids are
written to stdout.

-s, --stok STOK_ID
Sentence token id. Default 10. It will be written to output in binary mode after
every sentence.

-i, --show-id
Show Id info. Under text output format mode, attach id after known words. If under
binary mode, print id(s) in text.

-a, --ambiguious-id AMBI-ID
Ambiguious means ABC => A BC or AB C. If specified (AMBI-ID != 0), The sequence ABC
will not be segmented, in binary mode, the AMBI-ID is written out; in text mode,
"<ambi>ABC</ambi>" will be output. Default is 0.


Under binary mode, consecutive id of 0 are merged into one 0. Under text mode, no space
are inserted between unknown-words.

Use mmseg online using onworks.net services

Free Servers & Workstations

Download Windows & Linux apps

  • 1
    Clementine is a multi-platform music
    player and library organizer inspired by
    Amarok 1.4. It has a fast and
    easy-to-use interface, and allows you to
    search and ...
    Download Clementine
  • 2
    ATTENTION: Cumulative update 2.4.3 has
    been released!! The update works for any
    previous 2.x.x version. If upgrading
    from version v1.x.x, please download and
    Download XISMuS
  • 3
    Modular headtracking program that
    supports multiple face-trackers, filters
    and game-protocols. Among the trackers
    are the SM FaceAPI, AIC Inertial Head
    Tracker ...
    Download facetracknoir
  • 4
    PHP QR Code
    PHP QR Code
    PHP QR Code is open source (LGPL)
    library for generating QR Code,
    2-dimensional barcode. Based on
    libqrencode C library, provides API for
    creating QR Code barc...
    Download PHP QR Code
  • 5
    Cuckoo Sandbox
    Cuckoo Sandbox
    Cuckoo Sandbox uses components to
    monitor the behavior of malware in a
    Sandbox environment; isolated from the
    rest of the system. It offers automated
    analysis o...
    Download Cuckoo Sandbox
  • 6
    Play YouTube video on LMS (porting of
    Triode's to YouTbe API v3) This is
    an application that can also be fetched
    Download LMS-YouTube
  • 7
    dotnet sdk
    dotnet sdk
    Core functionality needed to createNET
    Core projects, that is shared between
    Visual Studio and CLI. There are no fees
    or licensing costs, including for
    Download dotnet sdk
  • More »

Linux commands