Install latest/stable of Datashare

Ubuntu 16.04 or later?

Make sure snap support is enabled in your Desktop store.


Install using the command line

sudo snap install datashare

Don't have snapd? Get set up for snaps.

Channel Version Published

Datashare is a self-hosted search engine for documents.

Datashare is a self-hosted search engine for documents, using Apache Tika and Apache Tesseract to read hundreds of file formats. Datashare is developed by the International Consortium of Investigative Journalists (ICIJ), famously known for its groundbreaking investigations into the offshore world (Pandora Papers, Panama Papers, etc).

Datashare is based on Apache Tika and supports thousands of files format.

It also provides:

  • Many search filters (file types, creation date, languages, tags, etc)
  • Search in batch (with a CSV)
  • Search results download
  • Tagging and recommendation
  • Named Entities recognition with CoreNLP
  • Optical characters recognition with Apache Tesseract

Details for Datashare

License
  • AGPL-1.0-or-later

Last updated
  • 5 November 2024 - latest/stable
  • 18 November 2024 - latest/edge

Websites

Contact

Report a Snap Store violation

Share this snap

Generate an embeddable card to be shared on external websites.


Install Datashare on your Linux distribution

Choose your Linux distribution to get detailed installation instructions. If yours is not shown, get more details on the installing snapd documentation.


Where people are using Datashare

Users by distribution (log)

Ubuntu 24.04
Ubuntu 22.04
Ubuntu 24.10