Install latest/stable of Datashare

Ubuntu 16.04 or later?

Make sure snap support is enabled in your Desktop store.


Install using the command line

sudo snap install datashare

Don't have snapd? Get set up for snaps.

Channel Version Published

Details for Datashare

License

  • AGPL-1.0-or-later

Last updated

  • 18 December 2025 - latest/stable
  • 3 December 2025 - latest/edge

Websites


Contact


Report a Snap Store violation

Share this snap

Generate an embeddable card to be shared on external websites.

Datashare is a self-hosted search engine for documents.

Datashare is a self-hosted search engine for documents, using Apache Tika and Apache Tesseract to read thousands of file formats. This tool is developed by the International Consortium of Investigative Journalists (ICIJ), famously known for its groundbreaking investigations into the offshore world (Pandora Papers, Panama Papers, etc).

It also provides:

  • Many search filters (file types, creation date, languages, tags, etc)
  • Search in batch (with a CSV)
  • Search results download
  • Tagging and recommendation
  • Named Entities recognition with CoreNLP
  • Optical characters recognition with Apache Tesseract

After the installation, open a terminal and use the following command to start Datashare:

datashare

Datashare should now be available on http://localhost:8080 🚀


Install Datashare on your Linux distribution

Choose your Linux distribution to get detailed installation instructions. If yours is not shown, get more details on the installing snapd documentation.


Where people are using Datashare

Users by distribution (log)

Ubuntu 24.04