Backup and Recovery
Before deploying PSPDFKit Document Engine in a production environment, set up a backup strategy with scheduled automated backups and a tested disaster recovery plan.
Document Engine uses PostgreSQL as a data store, while binary assets including PDFs are stored in a Docker volume. You need to make backups of both.
PostgreSQL
We recommend using managed databases and relying on the vendor-specific backup management.
Assets
Built-In Storage
When you use the built-in storage option, all assets are backed up with the PostgreSQL backup.
S3
Using the S3-compatible backend means you need a separate backup routine, but consider that:
-
As Document Engine stores files by their SHA checksums, most of the time, a daily, incremental backup will suffice.
-
You should schedule the asset storage backup right after the PostgreSQL database backup to prevent data from drifting between the two.