Support localhost connection to k8s api server #1077

yanjunz97 · 2025-04-22T03:15:46Z

The PR supports to connect k8s API server through localhost when cpvm eth1 is down.

Testing done:
Override the Kubernetes service host to unaccessible and observe the NSX Operator
runs as expected by connecting k8s API server through localhost.

Signed-off-by: Yanjun Zhou <[email protected]>

codecov-commenter · 2025-04-22T05:57:05Z

Codecov Report

Attention: Patch coverage is 85.29412% with 5 lines in your changes missing coverage. Please review.

Project coverage is 75.79%. Comparing base (6822956) to head (f6c0470).

Files with missing lines	Patch %	Lines
pkg/util/kubernetes.go	87.50%	3 Missing and 1 partial ⚠️
cmd/main.go	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1077      +/-   ##
==========================================
+ Coverage   75.77%   75.79%   +0.01%     
==========================================
  Files         145      146       +1     
  Lines       19708    19740      +32     
==========================================
+ Hits        14934    14962      +28     
- Misses       3863     3866       +3     
- Partials      911      912       +1

Flag	Coverage Δ
unit-tests	`75.79% <85.29%> (+0.01%)`	⬆️

Files with missing lines	Coverage Δ
pkg/util/cert.go	`55.15% <100.00%> (ø)`
cmd/main.go	`0.00% <0.00%> (ø)`
pkg/util/kubernetes.go	`87.50% <87.50%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

heypnus · 2025-04-29T08:56:37Z

cmd/main.go

 func main() {
 	log.Info("Starting NSX Operator")
-	mgr, err := ctrl.NewManager(ctrl.GetConfigOrDie(), ctrl.Options{
+	mgr, err := ctrl.NewManager(pkgutil.GetConfig(), ctrl.Options{


So the API server address switch only can occur in the startup stage, right? Then if the eth1 down during the NSX operator runtime, what will happen?

Yes it only occurs in the startup stage.

The current case is when wcp enabled between backup and restore, cpvm eth1 will be down after NSX restore and we rely on NSX Operator to recover it. In this case, NSX Operator will always restarts as NSX connection will be down due to restore. In other cases eth1 may be down, shall we always expect NSX or WCP side to bring it back, and it might be fine NSX Operator does not work during that time?

If there is use case that NSX Operator should switch from cluster ip to localhost at runtime, maybe we can leverage the liveness probe to force the nsx operator restarting. Actually we need to refactor the liveness probe in a following up PR as currently it will try to check the eth1, i.e. get api like http://172.26.0.3:8384/healthz

I've checked this in HA mode, and found operator will restart after eht1 down automatically because the lease renewal failed.
Updated: But in non-HA mode, operator will not restart, but the api server call will fail with errors like {"error": "Put \"https://172.24.0.1:443/apis/crd.nsx.vmware.com/v1alpha1/namespaces/ns-1/subnetsets/pod-default/status\": http2: client connection lost"}

zhengxiexie · 2025-09-13T09:36:33Z

Can one of the admins verify this patch?

Support localhost connection to k8s api server

f6c0470

Signed-off-by: Yanjun Zhou <[email protected]>

yanjunz97 force-pushed the local-k8s-api branch from a8cdaff to f6c0470 Compare April 22, 2025 05:41

heypnus reviewed Apr 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support localhost connection to k8s api server #1077

Support localhost connection to k8s api server #1077

Uh oh!

yanjunz97 commented Apr 22, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 22, 2025

Uh oh!

heypnus Apr 29, 2025

Uh oh!

yanjunz97 May 7, 2025

Uh oh!

yanjunz97 May 7, 2025 •

edited

Loading

Uh oh!

zhengxiexie commented Sep 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Support localhost connection to k8s api server #1077

Are you sure you want to change the base?

Support localhost connection to k8s api server #1077

Uh oh!

Conversation

yanjunz97 commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Apr 22, 2025

Codecov Report

Uh oh!

heypnus Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

yanjunz97 May 7, 2025

Choose a reason for hiding this comment

Uh oh!

yanjunz97 May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhengxiexie commented Sep 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yanjunz97 commented Apr 22, 2025 •

edited

Loading

yanjunz97 May 7, 2025 •

edited

Loading