🐛 Describe the bug
if we use podGroupSize, the httpserver will be only launched in the rank=0 server, rest pods are pure GPU workers. In that case, we should not fetch the endpoint. this is kind of missing in autoscaler now.
Steps to Reproduce
PodGroupSize != 1
Expected behavior
autoscaler should only fetch the pods with rank=0.
Environment
nightly