Scaling and auto-failover Hasura with PostgreSQL

This is the test experiment of several popular PostgreSQL replication setup to know better about scaling solutions of Hasura GraphQL Engine with Postgres

This test categorizes into 2 versions: Hasura Core (OSS) and Read Replica on Hasura Pro

Scaling and auto-failover Hasura with PostgreSQL

TLDR

This is summary features table of Postgres replication with Hasura

System	Load balancing	Auto-failover	Difficulty
Streaming Replication	❌	❌	Easy
Streaming Replication + Hasura Pro	✔️	Read	Easy
repmgr	❌	✔️	Easy
repmgr + Hasura Pro	✔️	✔️ (*)	Easy
PgBouncer	❌	❌	Easy
PgBouncer + Hasura Pro	✔️	Read	Easy
PgPool	✔️	✔️(**)	Medium
HAProxy + PgBouncer + (xinetd or Patroni)	✔️	✔️	Hard

(*): work well with 3 nodes.

(**): with extra components: repmgr, docker/kubernetes.

If you use cloud SQL services (Google Cloud, Amazon Web Services, Azure...), or you don't care about auto-failover. Streaming Replication + Hasura Pro is best performance with load balancing. For Hasura Core, the workaround is using 2 Hasura instances that connect to each Postgres server. Read-only apps will use standby Hasura, or use 2 read and write GraphQL clients, and select correct client according to query, subscription (read) and mutation (write) requests .You can optionally use PgBouncer if your application need many concurrent connections.
If you deploy on premise server, and high-availability is critical, repmgr + Streaming Replication + Hasura Pro can solve both auto-failover and load balancing problem. For Hasura Core, it's also possible with PgPool, Hatroni + HAProxy + PgBouncer setup. However, the downside is more complicated to deploy, with extra server cost that isn't cheaper than Hasura Pro's Read replica solution.

Below sections are detail experiments with multiple replication setups.

Built-in Hasura features

Failover multi-hosts connections

Because Hasura GraphQL Engine uses low level libpq library binding, it supports built-in failover multi-hosts connection. However, Hasura core can only connect to one PostgreSQL server one time. Some advanced feature such as Read-Write replicas is only available on Pro.

Read Replica

This is cool feature of Hasura Pro that supports load balancing between Master and Standby replica. All write queries are executed on Master, and read query (query, subscription) are executed on Standby node. Moreover, Hasura Pro can load balancing multiple standby nodes, therefore we can horizontally scale read queries.

In this article, I will test read-replica using Hasura Pro v1.2.1-pro.1 version

PostgreSQL Streaming Replication

Streaming replication in PostgreSQL works on log shipping. Every transaction in postgres is written to a transaction log called WAL (write-ahead log) to achieve durability. A slave uses these WAL segments to continuously replicate changes from its master.

Replication type: master-slave
Cluster setup: 1 master pg-0, 1 slave pg-1

Hasura Core

Docker image: bitnami/postgresql:latest
Configuration: docker-compose.postgres.yaml

Use Cases

1: stop master node (pg-0)

Standby node pg-1 is still in read-only mode
Hasura can switch to pg-1. However, because Hasura core require read-write mode, so it keep throwing transaction error cannot set transaction read-write mode during recovery

2: start master after stopped

Hasura still connects to slave node pg-1. This is getting worse, because Hasura think that transaction error is just a normal postgres-error error. pg-1 is still alive, so it can't disconnect current connection. The workaround is restarting pg-1

3: restart pg-1

Hasura switches back to master pg-0 in seconds

Conclusion

Pros

Simple to deploy and configure settings
Used as a backup standby node that can switch to master

Cons

Doesn't support auto-failover. Database admin have to switch manually
Multi-host connections are bad choice. Hasura will be stuck at slave node if master node is down

Hasura Pro

Set pg-1, pg-2 as read replica
Configuration: docker-compose.pro.postgres.yaml

Use Cases

1: stop master node (pg-0)

Hasura Pro throw connection error on write replica. However read query still works

2: start master after stopped

Hasura Pro tries connecting to pg-0, then continue to works

3: stop pg-1

Now read queries are unreachable

4. add pg-0 as second read replica, then stop pg-1

My expectation is, if pg-1 is down, pg-0 will take both role read and write. However, Hasura Pro only runs several queries then get hanging. There is conflict between repeat read and read-write ISOLATION LEVEL

Conclusion

Pros

Simple to deploy and configure settings
Used as a backup standby node that can switch to master
Utilize power of standby node by read-replicas. Load balancing between master and standby

Cons

Doesn't support auto-failover. Database admin have to switch manually
Extra cost for Pro license

When to use

If you use cloud SQL services (AWS RDS, Google Cloud SQL), you don't really need to care about auto-failover. Cloud providers manage server for you. This solution is suitable for backup, or use standby node as a read-only database for read-only applications (exporter, query report...). However, with read-replica, Master node can share load pressure with Standby.

PostgreSQL Replication Manager (repmgr)

repmgr is an open-source tool suite for managing replication and failover in a cluster of PostgreSQL servers. It enhances PostgreSQL's built-in hot-standby capabilities with tools to set up standby servers, monitor replication, and perform administrative tasks such as failover or manual switchover operations.

Replication type: master-standby
Cluster setup: 1 master pg-0, 1 slave pg-1

Hasura Core

Docker image: bitnami/postgresql-repmgr:latest
Cluster setup: 1 master pg-0, 1 standby pg-1
Configuration: docker-compose.repmgr.yaml

Use cases

1: stop master node (pg-0)

repmgr automatically switches standby (pg-1) to master in 30 seconds
However, Hasura takes 2-3 minutes switching (pg-1). It works smoothly after then

2: start master after stopped

pg-1 is still master node. pg-0 is marked as standby

3: repeatedly stop and start master and slave nodes

Hasura switches back to master faster than before (about 5 seconds)

Conclusion

On first time, auto-failover schedule is slow (about 3 minutes). Later times are faster. Reconnect time is dynamic because Postgres nodes need to do recovering

Pros:

Support automatic failover
Simple to deploy and configure settings with docker

Hasura Pro

1 Master - 1 Standby

Set pg-1 as read replica
Hasura Pro database-url connects to both Postgres nodes
Hasura Pro replicas connect to both Postgres nodes
Configuration: docker-compose.pro.repmgr.yaml

1: stop master node (pg-0)

repmgr automatically switches standby (pg-1) to master in 30 seconds
pg-1 takes responsibility as both write and read replica
Hasura Pro can switch Master database URL to pg-1. However, it get hanging on both read and write queries because of transaction isolation conflict

2: start master after stopped

pg-1 is still master node. pg-0 is marked as standby
Hasura switches correct read replica and runs smoothly from now

1 Master - 2 Standby

Because Hasura Pro can switch back correct read replica, we can try with 1 Master + 2 Standby setup

Set pg-1, pg-2 as read replica
Hasura Pro database-url connects to 3 Postgres nodes
Hasura Pro replicas connect to 3 Postgres nodes
Configuration: docker-compose.pro.repmgr3.yaml

1: stop master node (pg-0)

repmgr automatically switches standby (pg-1) to master in 30 seconds
pg-1 takes responsibility as both write and read replica
Hasura Pro can switch Master database URL to pg-1. Thank to additional read replica, Hasura Pro can solve auto-failover problem.

Note: bitnami-docker-postgresql-repmgr is focusing on Kubernetes Helm chart. Their expectation is, when master node is stop, all standby nodes except promoted one should be restarted. However, in docker environment, standby nodes can't be restarted. So Hasura Pro read replica get stuck at that standby node.

Conclusion

Pros

Support automatic failover
Simple to deploy and configure settings
Utilize power of standby node by read-replicas. Load balancing between master and standby

Cons

There will be transaction isolation conflict when the master node is stopped

When to use

Auto-failover is critical on on-prem infrastructure, although Docker/Kubernetes system has auto-restart policy to reduce downtime. Read-replica works well with repmgr, but it is safer with 2 or more standby nodes.

PgBouncer

PgBouncer is used as lightweight connection pooler proxy over es. It doesn't have built-in support load balancing multiple servers or failover. We have to setup complex extensions to support them (patroni, HAProxy...)

How about PgBouncer + repmgr? I setup this combo to see if it works

Replication type: master-standby
Cluster setup: 1 master pg-0, 1 slave pg-1
PgBouncer setup: 2 nodes, connect to pg-0, pg-1 accordingly

Hasura Core

Docker image: edoburu/pgbouncer:latest
Configuration: docker-compose.pgbouncer.repmgr.yaml

Use case

1: stop master node (pg-0)

repmgr automatically switches standby (pg-1) to master in 30 seconds
However, pgbouncer-0 is still running. GraphQL engine can't switch to pg-1

2: start master after stopped

Similar to Postgres replication test. pg-0 becomes standby node now. GraphQL engine keeps showing read-write mode error

Hasura Pro

The behavior is similar. Hasura can't know when postgres server is stopped because it connects though PgBouncer proxy

Conclusion

This architecture aren't suitable for failover connection of Hasura Core

Pros

Simple to deploy and configure settings
Lightweight connection polling, help increasing performance when there are many concurrent connections
Utilize power of standby node by read-replicas. Load balancing between master and standby

Cons

Don't support automatic failover
Prevents Hasura doing auto-failover because Hasura can't detecting health status of Postgres instance.

When to use

PgBouncer can extends current streaming replication architecture, and you don't care about auto-failover or combine with advanced auto-failover and load balancer setup.

PgPool II

PgPool-II is a proxy software that sits between PostgreSQL servers. It seems like all-in-one solution:

Connection Pooling
Load Balancing
Automated fail over
Online Recovery
Replication
Limiting Exceeding Connections
Watchdog
In Memory Query Cache

At early look, PgPool can replace read-replica of Hasura Pro because of Load Balancing. So let's try some experiment.

Setup:

1 master pg-0, 1 standby pg-1
PgPool image: https://github.com/bitnami/bitnami-docker-pgpool
Configuration: docker-compose.pgpool.yaml

Load Balancing

After all services started up, I run some read queries. However, all queries are executed at master pg-0 node only.

node_id	hostname	port	status	lb_weight	role	select_cnt	load_balance_node	replication_delay	replication_state	replication_sync_state	last_status_change
0	pg-0	5432	up	0.500000	primary	82	false	0			2020-05-16 18:54:06
1	pg-1	5432	up	0.500000	standby	0	true	0			2020-05-16 18:55:02

Why? First of all, pgpool-II' load balancing is "session base", not "statement base". That means, DB node selection for load balancing is decided at the beginning of session. So all SQL statements are sent to the same DB node until the session ends.(PGPool FAQ)

In this case, because Hasura create long-live connection pool at startup, so it connects to pg-0 first. All queries on these connections are go through master. It is different with Read-replica that connects to master and slave node at once, then load balancing through read-write policy logic.

Therefore, we need to increase more connections to utilize PgPool Load Balancing. I run load test with 100 queries/second, and fortunately, load balancing work well.

node_id	hostname	port	status	lb_weight	role	select_cnt	load_balance_node	replication_delay	replication_state	replication_sync_state	last_status_change
0	pg-0	5432	up	0.500000	primary	2621	false	0			2020-05-17 04:27:21
1	pg-1	5432	up	0.500000	standby	3076	true	0			2020-05-17 04:27:21

PgPool loads balancing read queries into all master and standby nodes. The priority depends on lb_weight. To simulate similar read-replica on Hasura Pro, you can set low lb_weight on master.

Auto-failover

Use cases

1. Stop master pg-0

repmgr automatically switches standby (pg-1) to master
PGPool route pg-1 as master node. Hasura reconnect and continue to work

2. Start pg-0

pg-0 becomes standby node.
However, PGPool doesn't reload pg-0 online status. show pool_nodes still show down. It is expected behavior (FAQ)

3. Stop pg-1

pg-0 becomes master
PGPool thinks all backend nodes are down, and stop working. It needs to be restarted or manually re-attach nodes to reload.

What's wrong with PgPool?

This is expected behavior of PgPool.

Why does not Pgpool-II automatically recognize a database comes back online? It would be technically possible but we don't think it's a safe feature. Consider a streaming replication configuration. When a standby comes back online, it does not necessarily means it connects to the current primary node. It may connect to a different primary node , or even it's not a standby any more. If Pgpool-II automatically recognizes such that standby as online, SELECTs to the standby node will return different result as the primary, which is a disaster for database applications. Also please note that "pgpool reload" does not do anything for recognizing the standby node as online. It just reloads configuration files. Please note that in Pgpool-II 4.1 or later, it is possible to automatically make a standby server online if it's safe enough. See configuration parameter "auto_failback" for more information.

So, in default PgPool doesn't reload online status. New configuration to support auto_failback is only available from PgPool 4.1. Before that, DBA engineers use several workaround such as restart PgPool container/pod policy using health check on Docker/Kubernetes. The downside is client connection are disconnected.

Conclusion

PgPool is a alternative for Read-replica on Hasura Pro. However, DBA needs to understand how it works. Moreover, PgPool also needs to scale or performance and single point of failure will be potential issue.

Pros

All in one Connection Pooling and Load balancer
Alternative to Read-replica

Cons

Need more work to setup and configure.
PgPool instances also need scaling to avoid single point of failure. Server cost isn't cheaper than Hasura Pro pricing at scale.
Auto-failover is tricky. Need advanced infrastructure setup with Docker/Kubernetes to ensure high availability.

hgiasac / hasura-postgres-at-scale

Scaling and auto-failover Hasura with PostgreSQL

TLDR

Built-in Hasura features

Failover multi-hosts connections

Read Replica

PostgreSQL Streaming Replication

Hasura Core

Use Cases

Conclusion

Hasura Pro

Use Cases

Conclusion

When to use

PostgreSQL Replication Manager (repmgr)

Hasura Core

Use cases

Conclusion

Hasura Pro

1 Master - 1 Standby

1 Master - 2 Standby

Conclusion

When to use

PgBouncer

Hasura Core

Use case

Hasura Pro

Conclusion

When to use

PgPool II

Load Balancing

Auto-failover

Use cases

What's wrong with PgPool?

Conclusion

About

Languages