-
Notifications
You must be signed in to change notification settings - Fork 85
Error if a Site is created but old configuration is still present #2324
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
In case a Site is deleted and recreated quickly (automated), the skupper-router ConfigMap owned by the previous site, may still be present (owned recource not yet deleted). The controller now fails, if it finds a router configuration that is not owned by the currently active site. Fixes skupperproject#2323.
|
@fgiorgetti I am happy that you were able to reproduce this issue. Unfortunately i was unable to reproduce this in our lower environments, this is happening only in our production environment. |
@vsomwanshi I was able to reproduce it, when I quickly delete/create a site, like in an automated way through a script. What happened was that once a site is deleted and another site is created, the site that is created is being processed before the old resources, owned by the deleted site, have been removed, causing that error, which as you pointed out in the issue, can be recovered if you restart the skupper-controller pod. Can you share some details on the procedure you guys are following in production to reproduce it? Is it possible that you guys have 2 sites created on the same namespace at the time you're deleting it? This could potentially be a similar trigger to that. Or eventually once you remove a site, is there any gitops operator applying a new site definition? |
|
@fgiorgetti Please find comments inline; Can you share some details on the procedure you guys are following in production to reproduce it? is it possible that you guys have 2 sites created on the same namespace at the time you're deleting it? This could potentially be a similar trigger to that. Or eventually once you remove a site. is there any gitops operator applying a new site definition? Not sure but somehow i am unable to reproduce this issue in our lower environments. Would it be happening in production because as mentioned in above comment we have 55 skupper sites created in one OpenShift cluster and each site has 14 listeners and 5 connectors. is it creating more events and due to which |
|
@fgiorgetti or anyone of you can answer this; so this fix you are applying would be part of the latest release, right ? may be skupper If i need to go with this release in future in our environments. [1] During upgrade phase from [2] I believe no downtime required for this upgrade process but just for confirmation i am asking so i can accordingly take it to management. [3] No need to touch the site's as well as skupper link recreation also not required. Thank you. |
In case a Site is deleted and recreated quickly (eg: through an automation), the skupper-router ConfigMap owned by the previous site, may still be present.
The controller now fails, if it finds a router configuration that is not owned by the currently active site.
Fixes #2323.