-
Notifications
You must be signed in to change notification settings - Fork 555
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI] Add multi-nodes EPLB configs of DeepSeek-R1-W8A8 & Qwen3-235B-W8A8
module:tests
#4144
opened Nov 12, 2025 by
845473182
Loading…
[WIP] Update newest
documentation
Improvements or additions to documentation
module:ops
module:tests
#4142
opened Nov 12, 2025 by
22dimensions
Loading…
[KV-Sharing] Support KV-Sharing feature in CLA models
#4138
opened Nov 12, 2025 by
MengqingCao
•
Draft
[bugfix]Prevent overwriting drafters lm-head and embed_tokens
#4134
opened Nov 12, 2025 by
HF-001
Loading…
[P/D] [Bugfix] mooncake connector device_ids out of index
module:tests
#4125
opened Nov 11, 2025 by
LCAIZJ
Loading…
[Test]Add ut test qwen3_moe and sfa
module:tests
#4121
opened Nov 11, 2025 by
ForBetterCodeNine
Loading…
[OPS] support triton causal_conv1d_fn ops
module:ops
module:tests
#4119
opened Nov 11, 2025 by
QilaiZhang
Loading…
mix_placement
module:core
module:ops
module:quantization
#4118
opened Nov 11, 2025 by
Mercykid-bash
Loading…
[Bugfix][SHM] Use writer lock by default and remove redundant env
#4117
opened Nov 11, 2025 by
slippersss
Loading…
[Fixbug] Fix Qwen2-Audio-7B-Instruct accuracy test
accuracy-test
enable all accuracy test for PR
module:tests
ready-for-test
start test by label for PR
#4108
opened Nov 10, 2025 by
zhangxinyuehfad
Loading…
[Bugfix] fix mtp profile run error where main model and mtp model use different quantization
module:ops
module:quantization
module:tests
#4102
opened Nov 10, 2025 by
realliujiaxu
Loading…
[Doc] add qwen3 reranker tutorial
documentation
Improvements or additions to documentation
#4101
opened Nov 10, 2025 by
Jeaniowang
Loading…
[cherry-pick][v0.11.0-dev][bugfix] Change seq_lens in dummy attn_metadata to max_query_len
#4099
opened Nov 10, 2025 by
Angazenn
Loading…
[feature] support pcp + mtp (in pd co-locate scenario)
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4098
opened Nov 10, 2025 by
zhangsicheng5
Loading…
[main][bugfix] Change seq_lens in dummy attn_metadata to max_query_len
ready
read for review
ready-for-test
start test by label for PR
#4097
opened Nov 10, 2025 by
Angazenn
Loading…
feat:support suffix decoding in vllm-ascend using PIECEWISE
#4091
opened Nov 10, 2025 by
Cyclone-07
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.