Add configuration for Hikari connection pool for database evolutions #480

rebecca-thompson · 2024-08-21T09:22:11Z

Out Typerighter PROD deployments have been failing. Upon investigation, the last instance in the auto-scaling group doesn't pass health checks, failing with the following exception:

org.postgresql.util.PSQLException: FATAL: remaining connection slots are reserved for non-replication superuser connections

The PROD RDS database is currently a db.t4g.micro which has max_connections of 81, with at least 5 of those connections reserved for internal Postgres processes.

The max pool size for database operations is set to 7, so you would think even when doubling the auto-scaling group to 6 instances during deployment, we would be under the threshold. However the app, also uses applicationEvolutions to run database evolutions when the app starts. This uses a separate Hikari connection pool which defaults to 10. The initial connections per instance then becomes up to 17 which is TOO MANY :)

What does this change?

This sets the Hikari connection pool to something small and sensible, given the size of the database.

How to test

When deployed to CODE we should see the number of connections in the RDS dashboard not spike as highly during deployment. The total number of connections should also decrease. The Rule Manager app should still come up and work as expected (https://manager.typerighter.code.dev-gutools.co.uk/) - we haven't touched the connection pool size for regular database operations, so performance of the app should be unaffected.

How can we measure success?

PROD deploys no longer fail

jonathonherbert

This is a convincing explanation – do we know what changed that suddenly meant our new instances were exhausting the connection pool?

rebecca-thompson · 2024-08-21T09:46:13Z

This is a convincing explanation – do we know what changed that suddenly meant our new instances were exhausting the connection pool?

Honestly not sure. There were some failed PROD deployments in June that correspond to high db connections and then it seemed to settle down until last week when deploys consistently started failing. From the graphs it looks like sometimes the app holds onto the connections for long enough to fail the deploy and other times they don't.

prout-bot · 2024-08-21T09:58:25Z

Seen on Rule Manager (merged by @rebecca-thompson 10 minutes and 34 seconds ago) Please check your changes!

prout-bot · 2024-08-21T10:02:54Z

Overdue on Checker (merged by @rebecca-thompson 15 minutes and 3 seconds ago) What's gone wrong?

add configuration for hikari connection pool

ac7e0ba

rebecca-thompson requested a review from a team as a code owner August 21, 2024 09:22

jonathonherbert approved these changes Aug 21, 2024

View reviewed changes

rebecca-thompson merged commit 6f475da into main Aug 21, 2024
4 checks passed

rebecca-thompson deleted the bt/add-hikari-cp-config branch August 21, 2024 09:47

prout-bot added Pending-on-Checker Pending-on-Rule Manager Seen-on-Rule Manager and removed Pending-on-Rule Manager labels Aug 21, 2024

prout-bot added Overdue-on-Checker and removed Pending-on-Checker labels Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configuration for Hikari connection pool for database evolutions #480

Add configuration for Hikari connection pool for database evolutions #480

rebecca-thompson commented Aug 21, 2024 •

edited

Loading

jonathonherbert left a comment

rebecca-thompson commented Aug 21, 2024

prout-bot commented Aug 21, 2024

prout-bot commented Aug 21, 2024

Add configuration for Hikari connection pool for database evolutions #480

Add configuration for Hikari connection pool for database evolutions #480

Conversation

rebecca-thompson commented Aug 21, 2024 • edited Loading

What does this change?

How to test

How can we measure success?

jonathonherbert left a comment

Choose a reason for hiding this comment

rebecca-thompson commented Aug 21, 2024

prout-bot commented Aug 21, 2024

prout-bot commented Aug 21, 2024

rebecca-thompson commented Aug 21, 2024 •

edited

Loading