EnglishFrenchSpanish

Ad


OnWorks favicon

dnaindex - Online in the Cloud

Run dnaindex in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command dnaindex that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


dnaindex - index dna file for use with ANFO

SYNOPSIS


dnaindex [ option ... ]

DESCRIPTION


dnaindex builds an index for a dna file. Dna files must be indexed to be useable with
anfo(1), it is possible to have multiple indices for the same dna file.

OPTIONS


-V, --version
Print version number and exit.

-o file, --output file
Write output to file. file customarily ends in .idx. Default is
genomename_wordsize.idx.

-g file, --genome file
Read the genome from file. This file name is also stored in the resulting index so
it can be found automatically whenever the index is used. It is therefore best if
file is just a file name without path.

-G dir, --genome-dir dir
Add dir to the genome search path. This is useful if the genome to be indexed is
not yet in the place where it will later be used.

-d text, --description text
Add text as description to the index. This is purely informative.

-s size, --wordsize size
Set the wordsize to size. A smaller wordsize increases precision at the expense of
higher computational investment. The default is 12, which with a stride of 8
yields a good compromise.

-S num, --stride num
Set the stride to num. Only one out of num possible words of dna is actually
indexed. A smaller stride increases precicion at the expense of a bigger index.
The default is 8, which in conjunction with a wordsize of 12 yields a good
compromise.

-l lim, --limit lim
Prevents the indexing of words that occur more often than lim times. This can be
used to ignore repetitive seeds and save the space to store them. A good default
depends on the size of the genome being indexed, something like 500 works for the
human genome with wordsize 12 and stride 8.

-h, --histogram
Produce a histogram of word frequencies. This can be used to get an indea how the
frequency distribution to select an appropriate value for --limit.

-v, --verbose
Print a progress indicator during operation.

NOTES


dnaindex is limited to genomes no longer than 4 gigabases due to its use of 32 bit
indices. The index is quite large, so depending on parameters, a 64 bit platform is
needed for genomes in the gigabase range.

If a genome contains IUPAC ambiguity codes, the affected seeds need to be expanded. If
there are many ambiguity codes in a small region, that results in an unacceptably large
index.

ENVIRONMENT


ANFO_PATH
Colon separated list of directories searched for genome files.

Use dnaindex online using onworks.net services


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    Asuswrt-Merlin
    Asuswrt-Merlin
    Asuswrt-Merlin is a third party
    firmware for select Asus wireless
    routers. Based on the Asuswrt firmware
    developed by Asus, it brings tweaks, new
    features and ...
    Download Asuswrt-Merlin
  • 2
    Atom
    Atom
    Atom is a text editor that's
    modern, approachable and full-featured.
    It's also easily customizable- you
    can customize it to do anything and be
    able to ...
    Download Atom
  • 3
    Osu!
    Osu!
    Osu! is a simple rhythm game with a well
    thought out learning curve for players
    of all skill levels. One of the great
    aspects of Osu! is that it is
    community-dr...
    Download Osu!
  • 4
    LIBPNG: PNG reference library
    LIBPNG: PNG reference library
    Reference library for supporting the
    Portable Network Graphics (PNG) format.
    Audience: Developers. Programming
    Language: C. This is an application that
    can also...
    Download LIBPNG: PNG reference library
  • 5
    Metal detector based on  RP2040
    Metal detector based on RP2040
    Based on Raspberry Pi Pico board, this
    metal detector is included in pulse
    induction metal detectors category, with
    well known advantages and disadvantages.
    RP...
    Download Metal detector based on RP2040
  • 6
    PAC Manager
    PAC Manager
    PAC is a Perl/GTK replacement for
    SecureCRT/Putty/etc (linux
    ssh/telnet/... gui)... It provides a GUI
    to configure connections: users,
    passwords, EXPECT regula...
    Download PAC Manager
  • More »

Linux commands

Ad