[Doc] add qwen3 reranker tutorials

wangyongjun · wangyongjun · commit 46f9a936047a · 2025-11-16T20:20:19.000+08:00
Signed-off-by: Jeaniowang &lt;1104133197@qq.com&gt;
diff --git a/docs/source/tutorials/index.md b/docs/source/tutorials/index.md
@@ -6,8 +6,8 @@
 single_npu
 single_npu_qwen2.5_vl
 single_npu_qwen2_audio
-single_npu_qwen3_embedding
-single_npu_qwen3_reranker
+qwen3_embedding
+qwen3_reranker
 single_npu_qwen3_quantization
 single_npu_qwen3_w4a4
 multi_npu_qwen3_next
diff --git a/docs/source/tutorials/qwen3_embedding.md b/docs/source/tutorials/qwen3_embedding.md
@@ -4,6 +4,7 @@
 The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B). This guide describes how to run the model with vLLM Ascend. Note that only 0.9.2rc1 and higher versions of vLLM Ascend support the model.
 
 ## Deployment
+*only support single npu*
 
 Using the Qwen3-Embedding-8B model as an example, first run the docker container with the following command: