EnglishFrenchSpanish

Ad


OnWorks favicon

tika-python download for Linux

Free download tika-python Linux app to run online in Ubuntu online, Fedora online or Debian online

This is the Linux app named tika-python whose latest release can be downloaded as ApacheTika2.6.0Release.zip. It can be run online in the free hosting provider OnWorks for workstations.

Download and run online this app named tika-python with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

SCREENSHOTS

Ad


tika-python


DESCRIPTION

A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.



Features

  • Parser Interface (backwards compat prior to REST)
  • The parser interface extracts text and metadata using the /rmeta interface
  • Optionally, you can pass Tika server URL along with the call what's useful for multi-instance execution
  • Specify Output Format To XHTML
  • The unpack interface handles both metadata and text extraction in a single call
  • Internally returns back a tarball of metadata and text entries that is internally unpacked


Programming Language

Python


Categories

Text Processing, Healthcare, Machine Learning

This is an application that can also be fetched from https://sourceforge.net/projects/tika-python.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    SportMusik
    SportMusik
    Mit dem Programm kann man schnell und
    einfach Pausen bei Sportveranstaltungen
    mit Musik �berbr�cken. Hierf�r haben sie
    die M�glichkeit, folgende Wiedergabvaria...
    Download SportMusik
  • 2
    DavMail POP/IMAP/SMTP/Caldav to Exchange
    DavMail POP/IMAP/SMTP/Caldav to Exchange
    Ever wanted to get rid of Outlook ?
    DavMail is a
    POP/IMAP/SMTP/Caldav/Carddav/LDAP
    gateway allowing users to use any mail
    client with Exchange and Office 365, e...
    Download DavMail POP/IMAP/SMTP/Caldav to Exchange
  • 3
    DivFix++
    DivFix++
    DivFix++ is yours AVI video repair and
    preview software. It designed for repair
    and preview files which are on download
    from ed2k(emule), torrent, gnutella, ftp...
    Download DivFix++
  • 4
    JBoss Community
    JBoss Community
    Community driven projects featuring the
    latest innovations for cutting edge
    apps. Our flagship project JBoss AS is
    the leading Open Source,
    standards-compliant...
    Download JBoss Community
  • 5
    Django Filer
    Django Filer
    django Filer is a file management
    application for django that makes
    handling files and images a breeze.
    django-filer is a file management
    application for djang...
    Download Django Filer
  • 6
    xCAT
    xCAT
    Extreme Cluster Administration Toolkit.
    xCAT is a scalable cluster management
    and provisioning tool that provides
    hardware control, discovery, and OS
    diskful/di...
    Download xCAT
  • More »

Linux commands

Ad