about DP scheduler #4829
Unanswered
guojinrong-nn
asked this question in
Q&A
Replies: 2 comments 1 reply
-
|
A good question. I guess it may be implemented in vllm and sglang just reuse it innerly. |
Beta Was this translation helpful? Give feedback.
0 replies
-
It's false. In the latest main, we have already removed the vllm dependency. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have some questions about DeepSeek’s DP (data parallelism) setup. Does the scheduler need to ensure that both the prefill and decode phases of the same request are assigned to the same DP rank? This is because the KV cache is stored on the rank where the prefill computation was performed. Is my understanding correct? If so, where is the implementation about this, I didn't found it.
Beta Was this translation helpful? Give feedback.
All reactions