Skip to content

Commit 46f9a93

Browse files
author
wangyongjun
committed
[Doc] add qwen3 reranker tutorials
Signed-off-by: Jeaniowang <[email protected]>
1 parent ca01a6d commit 46f9a93

File tree

2 files changed

+3
-2
lines changed

2 files changed

+3
-2
lines changed

docs/source/tutorials/index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,8 @@
66
single_npu
77
single_npu_qwen2.5_vl
88
single_npu_qwen2_audio
9-
single_npu_qwen3_embedding
10-
single_npu_qwen3_reranker
9+
qwen3_embedding
10+
qwen3_reranker
1111
single_npu_qwen3_quantization
1212
single_npu_qwen3_w4a4
1313
multi_npu_qwen3_next

docs/source/tutorials/qwen3_embedding.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44
The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B). This guide describes how to run the model with vLLM Ascend. Note that only 0.9.2rc1 and higher versions of vLLM Ascend support the model.
55

66
## Deployment
7+
*only support single npu*
78

89
Using the Qwen3-Embedding-8B model as an example, first run the docker container with the following command:
910

0 commit comments

Comments
 (0)