Why is pg_dumpall so damn slow?

pg_dumpall is single-threaded and has no parallel processing. It dumps databases one by one, sequentially. A 100GB cluster takes 2-3 hours because there's no way to speed it up. Unlike pg_dump which can use multiple workers with `-j`, pg_dumpall just chugs along at whatever speed your disk I/O allows.

Can I get a progress indicator?

Nope. pg_dumpall runs in complete silence until it's done or crashes. No progress bar, no "X% complete", nothing. You just run it and hope. Use `--verbose` to get some output, but it's mostly useless start/stop timestamps.

Why does it keep asking for passwords?

Because pg_dumpall connects to every database separately, and PostgreSQL asks for authentication each time. Without a `.pgpass` file, you'll type the same password 20 times. Set up [password file authentication](https://www.postgresql.org/docs/current/libpq-pgpass.html) or use `PGPASSWORD` environment variable.

Can I exclude specific databases?

Yes, use `--exclude-database=pattern`. But here's the frustrating part - you can't exclude system databases like `template0` or `template1`. So if those have issues, your entire dump fails.

What happens if the dump fails halfway through?

You're fucked. pg_dumpall doesn't support resuming from where it left off. If your network drops, disk fills up, or PostgreSQL crashes, you start over from the beginning. This is why smart teams test their backups and have monitoring on backup processes.

Can I restore individual databases from a pg_dumpall backup?

Not easily. The output is one giant SQL script with all databases mixed together. You'd need to manually edit the SQL file to extract just the database you want. It's a nightmare - use pg_dump for individual database backups instead.

Does pg_dumpall work across PostgreSQL versions?

Yes, but use the **newer** version's pg_dumpall. If you're migrating from PostgreSQL 13 to 17, use PostgreSQL 17's pg_dumpall to create the backup from the 13 cluster. The reverse often breaks with syntax changes and new features.

How much disk space do I need?

Plan for 2-3x your database size. The uncompressed SQL dump is usually larger than your actual data due to INSERT statements and metadata. Add compression (`| gzip`) and you'll get it down to maybe 50-70% of original size, depending on your data types.

Why can't I run pg_dumpall as a regular user?

Because it needs to read system catalogs, role information, and global objects that only superusers can access. Try running as a regular user and you'll get `permission denied` errors on half the databases. You need superuser privileges or a role that can access everything.

Currently viewing the AI version

Switch to human version

pg_dumpall: AI-Optimized Technical Reference

Core Function

PostgreSQL cluster-wide backup tool that creates logical dumps of entire PostgreSQL instances including all databases, users, roles, tablespaces, and global objects.

Critical Performance Characteristics

Speed and Resource Requirements

Performance: 100GB cluster = 2-3 hours backup time
Threading: Single-threaded only (no parallel processing unlike pg_dump)
Memory: Can exhaust system memory on large clusters with no memory limiting options
Progress: No progress indicator - runs blindly until completion or failure
Disk Space: Requires 2-3x database size for uncompressed dump

Failure Modes and Consequences

All-or-nothing: If backup fails halfway, restart from beginning (no resume capability)
Authentication hell: Prompts for password per database without .pgpass file
Memory exhaustion: OOM errors on large clusters (solution: more RAM or split job)
Tablespace path dependency: Hardcoded paths cause restore failures on different servers

Configuration That Actually Works

Production-Ready Commands

# Basic with compression and timestamping
pg_dumpall -h localhost -U postgres --no-password | gzip > "cluster_$(date +%Y%m%d_%H%M%S).sql.gz"

# With database exclusion
pg_dumpall --exclude-database=test_db | gzip > cluster_backup.sql.gz

Authentication Setup (Critical)

# .pgpass file format: hostname:port:database:username:password
echo "localhost:5432:*:postgres:yourpassword" > ~/.pgpass
chmod 600 ~/.pgpass

Alternative: Environment variable

export PGPASSWORD=yourpassword
pg_dumpall | gzip > cluster_backup.sql.gz

Decision Matrix: pg_dumpall vs Alternatives

Capability	pg_dumpall	pg_dump	pg_basebackup
Speed	Slowest (single-thread)	Fast (parallel -j)	Fastest
Scope	Entire cluster	Single database	Entire cluster
Global objects	✅ Users/roles/tablespaces	❌ Database only	✅ Everything
Parallel processing	❌ None	✅ -j workers	✅ Inherent
Progress tracking	❌ None	✅ Verbose mode	✅ Progress shown
Partial restore	❌ All-or-nothing	✅ Selective	❌ All-or-nothing
Cross-version	✅ Works	✅ Works	❌ Same major only

When pg_dumpall is Required (vs Better Alternatives)

Use pg_dumpall When:

Migrating entire PostgreSQL clusters between servers
Cloning development environments needing identical user setups
Version upgrades requiring all global objects preserved
Backing up user accounts and permissions alongside data

Use pg_dump Instead When:

Daily database backups (faster, parallel processing)
Individual database restoration needed
Large databases (parallel workers reduce time significantly)
Progress monitoring required

Use pg_basebackup When:

Hot standby setup required
Fastest full cluster backup needed
Binary-level cluster replication

Critical Warnings and Gotchas

Version Compatibility Rule

Always use NEWER version's pg_dumpall for cross-version migration

Correct: Use PostgreSQL 17's pg_dumpall to backup from PostgreSQL 13
Wrong: Use PostgreSQL 13's pg_dumpall for PostgreSQL 17 target

Tablespace Path Issues

Backup includes hardcoded CREATE TABLESPACE paths
Restore fails if paths don't exist on target server
Solutions: Create identical paths OR use --no-tablespaces flag OR manually edit SQL

System Database Limitations

Cannot exclude template0, template1 databases
If system databases have issues, entire dump fails
No partial recovery options

Restore Process Reality

Basic Restore

psql -U postgres -f cluster_backup.sql

Restore Failure Handling

No incremental restore capability
No "skip broken parts" option
Failure midway = restart from beginning
Always test restores on throwaway servers first

Resource Requirements

Prerequisites

Superuser access (needs system catalogs and role information)
.pgpass file or PGPASSWORD environment variable
Disk space: 2-3x database size (uncompressed)
Memory: Proportional to largest database + metadata

Time Investment

Setup: 15-30 minutes (authentication configuration)
Execution: 1-3 hours per 100GB
Testing: Required for each backup strategy change
Recovery planning: Critical for production use

Common Failure Scenarios

Password Authentication Loops

Password for user postgres: 
Password for user postgres: 
Password for user postgres: 
[continues for every database...]

Solution: Configure .pgpass file (mandatory for production)

Memory Exhaustion

Symptoms: OOM killer terminates pg_dumpall process
Solutions:

Increase system RAM
Split job into individual database dumps
No memory limiting options available

Corrupted Compressed Backups

Risk: Gzip pipeline failures create corrupted .gz files
Mitigation: Always test compressed backups before relying on them

Alternative Tools Comparison

For Large-Scale Production

pgBackRest: Parallel backup/restore, incremental backups, enterprise features
Barman: Disaster recovery focus, continuous archiving, point-in-time recovery
WAL-G: Cloud storage archiving, fast restoration, multi-database support

Performance Benchmarks (Approximate)

pg_dumpall: 100GB = 2-3 hours (single-thread)
pg_dump with -j 4: 100GB = 45-90 minutes (4 workers)
pg_basebackup: 100GB = 15-30 minutes (binary copy)

Operational Intelligence Summary

pg_dumpall is the "nuclear option" - use only when you need everything or have no other choice. For most production scenarios, pg_dump with parallel workers or pg_basebackup provides better performance and flexibility. The tool's single-threaded, all-or-nothing nature makes it unsuitable for large production databases where backup time and recovery flexibility matter.

Success criteria: Working .pgpass file, sufficient disk space, tested restore procedures, and realistic time expectations. Failure to meet any of these results in painful backup experiences and potential data recovery failures.

Useful Links for Further Investigation

Useful PostgreSQL Backup Resources

Link	Description
pg_dumpall Documentation	The official documentation for pg_dumpall, providing comprehensive and detailed information on its usage, options, and capabilities for full database backups, though it can be quite technical.
PostgreSQL Backup and Restore	A comprehensive overview of all available PostgreSQL backup and restore methods, detailing various strategies and tools for ensuring data integrity and recovery.
Password File Documentation	Official documentation explaining how to configure and use the .pgpass password file, enabling users to avoid constant password prompts for PostgreSQL connections and operations.
Backup PostgreSQL Using pg_dump and pg_dumpall	A practical guide offering a clear comparison between pg_dump and pg_dumpall, complete with real-world examples to illustrate their usage and best practices for PostgreSQL backups.
Introduction to Postgres Backups	An introductory overview of PostgreSQL backup strategies provided by Crunchy Data, covering fundamental concepts and essential techniques for reliable data protection.
PostgreSQL Backup Guide	A comprehensive PostgreSQL backup guide from Neon, detailing the usage and differences between pg_dump and pg_dumpall for various database backup scenarios.
pg_dump vs pg_dumpall - Which One to Use	A Stack Overflow discussion comparing pg_dump and pg_dumpall, offering insights from real user experiences and scenarios to help determine the appropriate tool for database backups.
Improving pg_dump Performance	A Stack Overflow thread dedicated to discussing various strategies and performance optimization tips for improving the speed and efficiency of pg_dump and pg_restore operations.
Password Prompt Issues	A Stack Overflow discussion addressing common password prompt issues encountered during PostgreSQL database backups, providing solutions and workarounds for authentication problems.
pg_dump Documentation	The official documentation for pg_dump, a powerful utility specifically designed for performing single database backups, often considered a more flexible and granular choice than pg_dumpall.
Barman	Barman (Backup and Recovery Manager) is an open-source, enterprise-grade tool for disaster recovery of PostgreSQL servers, enabling continuous archiving and point-in-time recovery.
WAL-G	WAL-G is a fast archival restoration for PostgreSQL, MySQL, and MS SQL, designed for continuous archiving of WAL files to cloud storage, ensuring robust point-in-time recovery capabilities.
pgBackRest	pgBackRest is a robust, full-featured backup and restore solution for PostgreSQL, offering parallel backup and restore, incremental backups, and comprehensive archiving capabilities for large databases.

pg_dumpall: AI-Optimized Technical Reference

Core Function

Critical Performance Characteristics

Speed and Resource Requirements

Failure Modes and Consequences

Configuration That Actually Works

Production-Ready Commands

Authentication Setup (Critical)

Alternative: Environment variable

Decision Matrix: pg_dumpall vs Alternatives

When pg_dumpall is Required (vs Better Alternatives)

Use pg_dumpall When:

Use pg_dump Instead When:

Use pg_basebackup When:

Critical Warnings and Gotchas

Version Compatibility Rule

Tablespace Path Issues

System Database Limitations

Restore Process Reality

Basic Restore

Restore Failure Handling

Resource Requirements

Prerequisites

Time Investment

Common Failure Scenarios

Password Authentication Loops

Memory Exhaustion

Corrupted Compressed Backups

Alternative Tools Comparison

For Large-Scale Production

Performance Benchmarks (Approximate)

Operational Intelligence Summary

Useful Links for Further Investigation

Useful PostgreSQL Backup Resources

Related Tools & Recommendations

PostgreSQL Alternatives: Escape Your Production Nightmare

AWS RDS Blue/Green Deployments - Zero-Downtime Database Updates

pgAdmin - The GUI You Get With PostgreSQL

The Great Cron Job Mystery: When Variables Go AWOL

How to Migrate PostgreSQL 15 to 16 Without Destroying Your Weekend

Why I Finally Dumped Cassandra After 5 Years of 3AM Hell

MongoDB vs PostgreSQL vs MySQL: Which One Won't Ruin Your Weekend

Three Stories That Pissed Me Off Today

Aider - Terminal AI That Actually Works

jQuery - The Library That Won't Die

vtenext CRM Allows Unauthenticated Remote Code Execution

Django Production Deployment - Enterprise-Ready Guide for 2025

HeidiSQL - Database Tool That Actually Works

Fix Redis "ERR max number of clients reached" - Solutions That Actually Work

QuickNode - Blockchain Nodes So You Don't Have To

Get Alpaca Market Data Without the Connection Constantly Dying on You

OpenAI Alternatives That Won't Bankrupt You

Migrate JavaScript to TypeScript Without Losing Your Mind

Docker Compose 2.39.2 and Buildx 0.27.0 Released with Major Updates

Google Vertex AI - Google's Answer to AWS SageMaker