OnWorks favicon

mlpack_lsh - Online in the Cloud

Run mlpack_lsh in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command mlpack_lsh that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator



mlpack_lsh - all k-approximate-nearest-neighbor search with lsh


mlpack_lsh [-h] [-v] [-B int] [-d string] [-H double] [-m string] [-k int] [-n string] [-M string] [-K int] [-q string] [-r string] [-S int] [-s int] [-L int] -V


This program will calculate the k approximate-nearest-neighbors of a set of points using
locality-sensitive hashing. You may specify a separate set of reference points and query
points, or just a reference set which will be used as both the reference and query set.

For example, the following will return 5 neighbors from the data for each point in
'input.csv' and store the distances in 'distances.csv' and the neighbors in the file

$ lsh -k 5 -r input.csv -d distances.csv -n neighbors.csv

The output files are organized such that row i and column j in the neighbors output file
corresponds to the index of the point in the reference set which is the i'th nearest
neighbor from the point in the query set with index j. Row i and column j in the
distances output file corresponds to the distance between those two points.

Because this is approximate-nearest-neighbors search, results may be different from run to
run. Thus, the --seed option can be specified to set the random seed.


--bucket_size (-B) [int]
The size of a bucket in the second level hash. Default value 500.
--distances_file (-d) [string] File to output distances into. Default value ’'.

--hash_width (-H) [double]
The hash width for the first-level hashing in the LSH preprocessing. By default,
the LSH class automatically estimates a hash width for its use. Default value 0.

--help (-h)
Default help info.

--info [string]
Get help on a specific module or option. Default value ''. --input_model_file
(-m) [string] File to load LSH model from. (Cannot be specified with
--reference_file.) Default value ’'.

--k (-k) [int]
Number of nearest neighbors to find. Default value 0. --neighbors_file (-n)
[string] File to output neighbors into. Default value ’'. --output_model_file (-M)
[string] File to save LSH model to. Default value ''.

--projections (-K) [int]
The number of hash functions for each table Default value 10.

--query_file (-q) [string]
File containing query points (optional). Default value ''. --reference_file (-r)
[string] File containing the reference dataset. Default value ''.
--second_hash_size (-S) [int] The size of the second level hash table. Default
value 99901.

--seed (-s) [int]
Random seed. If 0, 'std::time(NULL)' is used. Default value 0.

--tables (-L) [int]
The number of hash tables to be used. Default value 30.

--verbose (-v)
Display informational messages and the full list of parameters and timers at the
end of execution.

--version (-V)
Display the version of mlpack.



For further information, including relevant papers, citations, and theory, For further
information, including relevant papers, citations, and theory, consult the documentation
found at http://www.mlpack.org or included with your consult the documentation found at
http://www.mlpack.org or included with your DISTRIBUTION OF MLPACK. DISTRIBUTION OF


Use mlpack_lsh online using onworks.net services

Free Servers & Workstations

Download Windows & Linux apps

  • 1
    OpenFOAM - The Open Source CFD Toolbox.
    OpenFOAM is a generic, programmable
    software tool for Computational Fluid
    Dynamics (CFD). For more information,
    see: - ...
    Download OpenFOAM
  • 2
    PostInstallerF will install all the
    software that Fedora Linux and others
    doesn't include by default, after
    running Fedora for the first time. Its
    easy for...
    Download PostInstallerF
  • 3
    The strace project has been moved to
    https://strace.io. strace is a
    diagnostic, debugging and instructional
    userspace tracer for Linux. It is used
    to monitor a...
    Download strace
  • 4
    JasperReports Library
    JasperReports Library
    JasperReports Library is the
    world's most popular open source
    business intelligence and reporting
    engine. It is entirely written in Java
    and it is able to ...
    Download JasperReports Library
  • 5
    Numerical Python
    Numerical Python
    NEWS: NumPy 1.11.2 is the last release
    that will be made on sourceforge. Wheels
    for Windows, Mac, and Linux as well as
    archived source distributions can be fou...
    Download Numerical Python
  • 6
    CMU Sphinx
    CMU Sphinx
    CMUSphinx is a speaker-independent large
    vocabulary continuous speech recognizer
    released under BSD style license. It is
    also a collection of open source tools ...
    Download CMU Sphinx
  • 7
    Old Feren OS Repositories
    Old Feren OS Repositories
    This was the Official Repository for
    Feren OS. To add the latest one, run
    this command: (16.04-based) echo
    "deb ...
    Download Old Feren OS Repositories
  • More »

Linux commands