Backup and Recovery

Before deploying PSPDFKit Document Engine in a production environment, set up a backup strategy with scheduled automated backups and a tested disaster recovery plan.

Document Engine uses PostgreSQL as a data store, while binary assets including PDFs are stored in a Docker volume. You need to make backups of both.

PostgreSQL

We recommend using managed databases and relying on the vendor-specific backup management.

Assets

Built-In Storage

When you use the built-in storage option, all assets are backed up with the PostgreSQL backup.

S3

Using the S3-compatible backend means you need a separate backup routine, but consider that:

  • As Document Engine stores files by their SHA checksums, most of the time, a daily, incremental backup will suffice.

  • You should schedule the asset storage backup right after the PostgreSQL database backup to prevent data from drifting between the two.