Skip to content

Pull requests: GeeeekExplorer/nano-vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[ADD] Add TTFT, TPOT metrics in tqdm bars.
#133 opened Nov 16, 2025 by mumupika Loading…
Add Qwen3-VL multimodal support
#132 opened Nov 11, 2025 by 86MaxCao Loading…
feat: Add support for Qwen2.5vl model
#123 opened Nov 4, 2025 by SoulSniper1212 Loading…
Add FuseMoeLinear and support Qwen3-Moe
#116 opened Oct 16, 2025 by Tokisakix Loading…
Add Multi-Environment Support for Nano-vLLM
#113 opened Oct 10, 2025 by BaoZhuhan Loading…
fix uv sync bug
#104 opened Sep 17, 2025 by CzealChen Loading…
future: add qwen2 and llama support
#88 opened Jul 28, 2025 by leo-hancock Loading…
[ROCm] add amd gpu guide and performance
#84 opened Jul 25, 2025 by billishyahao Loading…
Update README.md to add AMD GPU instructions
#83 opened Jul 25, 2025 by zhangnju Loading…
add Qwen2 model support
#70 opened Jul 6, 2025 by Zlzzzupup Loading…
Optimize block management in decode phase
#68 opened Jul 4, 2025 by xiaohajiayou Loading…
Fix bug in block manager's may_append
#66 opened Jul 3, 2025 by yue-zhang-2025 Loading…
Fix: can_append function returns incorrect result
#65 opened Jul 2, 2025 by YjyJeff Loading…
Add Serving Benchmark Script
#29 opened Jun 21, 2025 by tiannuo-yang Loading…
ProTip! Updated in the last three days: updated:>2025-11-23.