Downloading Data

Downloading Document Text

Downloading a small amount of documents can be done via the export results button in Search view.

  1. First, choose a project and index to work on first.

  2. Complete a search that gives you the small amount of documents you want to download.

_images/export1.png
  1. Click on the Export results button.

_images/export2.png
  1. In Export search, select the fields you wish to export, in this example we chose the text field and the facts field.

The results will be saved as a .jsonl (jsonlines format) file, which is machine readable and could also be opened in a text/code editor.

To download a larger amount of documents or a small amount of very long documents, using a script is advised.



Accessing Document PDF

  1. Find the project and index the document is stored in.

  2. Find an interesting document by looking through the index or completing a search.

_images/metafile1.png
  1. You can access a document’s original PDF in NextCloud by selecting the meta.url field in Toggle Columns (See Search view options).

_images/metafile2.png

Now you can see the NextCloud links in the meta.url field.

  1. Click on the link to open or download the PDF.