Hello flyte-squadron, I'm attempting to deploy
flyte-core v1.10.6
via terragrunt on gcp and the apply is failing due to the helm release for flyte timing out (context deadline exceeded).
I've identified that the init run-migrations container for
flyteadmin
and
datacatalog
is failing to connect to the database:
`failed to initialize database, got error failed to connect to `host=
DB_IP user=
DB_USER database=flyte`: dial error (timeout: dial tcp
DB_IP5432 connect: connection timed out)`
Do you all have any suggestions for triaging this failing database connection?
I've copied the flyteadmin run-migrations container spec, and have modified it to run ubuntu to allow me to test with bash. With that manual spec applied (with all secrets mounted), I was able to reproduce connection timeout with
psql
but wasn't able to find any more clues towards network misconfigurations.
*Now thinking it has to be a vpc firewall misconfiguration. So less of a flyte question but any net test tips greatly appreciated