Andrew
04/04/2024, 10:17 PMGlime
04/04/2024, 10:17 PMterraform plan
to see the changes that will be applied and then terraform apply
to apply the changes. This step will update the infrastructure as needed based on the changes you made in the Terraform configuration. 4. Update Flyte Components: If the upgrade involves changes to the Flyte components themselves (for example, new Docker images for Flyte services), these changes should be reflected in the Helm chart values. When you apply the Terraform changes, it should automatically update the Flyte components if you have configured Terraform to manage your Helm releases. 5. Verify the Upgrade: After applying the changes, verify that the Flyte cluster is running the new version correctly. You can check the version of the deployed Flyte components and ensure that all services are operational. Please note, the steps above are a general guide. The specific steps for your upgrade might vary based on the Flyte version you are upgrading from and to, as well as your specific Terraform and Flyte configuration. Always refer to the release notes of the Flyte version for any version-specific upgrade instructions. Additionally, it's a good practice to perform the upgrade in a staging environment before applying it to your production environment to ensure that the upgrade process goes smoothly and does not disrupt your workflows.David Espejo (he/him)
04/08/2024, 3:49 PMAndrew
04/10/2024, 3:30 AMAndrew
04/10/2024, 3:11 PMError: context deadline exceeded
Its ever since I updated and had that flyte
flyteadmin
database issue, which I fixed in the code, but I'm not sure if I ended up in a weird state. Whenever I try to run terraform and get that timeout error, some of the pods are on CrashLoopBackOff
, but without any real errors. Any tips?David Espejo (he/him)
04/10/2024, 3:56 PMkubectl describe
on the failing pods, isn't there anything in the events section?Andrew
04/10/2024, 4:07 PMdatacatalog
got Init:Error
as status
flyteadmin
got Init:CrashLoopBackOff
flytescheduler
got CrashLoopBackOff
syncresources
got Error
Back-off restarting failed container run-migrations in pod datacatalog
That's what all of them say, so not really an error, but something with the run-migrations
stepAndrew
04/10/2024, 4:28 PMDavid Espejo (he/him)
04/10/2024, 4:29 PMkubectl logs <flyteadmin-pod-name> -c run-migrations -n flyte
Andrew
04/10/2024, 4:31 PM[error] failed to initialize database, got error failed to connect to host=<ip> user=flyte database=flyte: failed SASL auth (FATAL: password authentication failed for user "flyte" (SQLSTATE 28P01))
Ah, that's it. It's still trying to use flyte
instead of flyteadmin
for some reasonAndrew
04/10/2024, 4:32 PMadditional_databases = [
{
name = "flyteadmin"
charset = ""
collation = ""
}
]
additional_users = [
{
name = "flyteadmin"
password = ""
random_password = true
}
]
I did revert these after it failed when I pulled the new code. Not sure if its in a weird state or somethingDavid Espejo (he/him)
04/10/2024, 4:38 PMflyteadmin
CloudSQL instance with a flyteadmin
user but your values file point to a flyte
DB with a flyte
username.
Changing your values file to reflect that should make it work.
I wonder how it has worked before but let's fix this first.Andrew
04/10/2024, 4:43 PMAndrew
04/10/2024, 4:45 PMAndrew
04/10/2024, 4:53 PMhelm repo update
and then terraform apply
, does that seem right? It just listed update in-place
as the terraform executionDavid Espejo (he/him)
04/10/2024, 5:26 PMhelm ls -n flyte
?Andrew
04/10/2024, 5:28 PMNAME NAMESPACE REVISION UPDATED STATUS CHART APP VERSION
flyte-core flyte 46 2024-04-10 10:43:34.72876 -0600 MDT deployed flyte-core-v1.10.0
David Espejo (he/him)
04/10/2024, 5:29 PMAndrew
04/10/2024, 5:30 PMDavid Espejo (he/him)
04/10/2024, 5:31 PM