Fix const casting in non-writable numpy-based tensors #3086

sgonorov · 2025-11-28T12:13:17Z

Fix const casting in non-writable numpy-based tensors.

src/cpp/src/image_generation/image_processor.cpp

Copilot

Pull request overview

This PR fixes issues related to const casting in non-writable numpy-based tensors by creating writable copies of tensors before processing. The changes ensure that read-only numpy arrays (where writeable = False) can be safely used as inputs to various pipeline operations.

Key changes:

Added defensive tensor copying in C++ code to handle read-only numpy tensors
Added test coverage to verify read-only tensors work correctly across different pipeline types
Modified tensor handling to use copy_to() instead of direct assignment or pointer sharing

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/python_tests/test_vlm_pipeline.py	Added test case for VLM pipeline with read-only image tensors
tests/python_tests/test_llm_pipeline_static.py	Added test case for static LLM pipeline with read-only input tensors
tests/python_tests/test_llm_pipeline.py	Added test case for LLM pipeline with read-only input tensors
src/cpp/src/visual_language/phi4mm/classes.cpp	Creates writable copy of image tensor before processing
src/cpp/src/visual_language/inputs_embedder.cpp	Uses memcpy to create independent tensor copies instead of sharing pointers
src/cpp/src/speculative_decoding/speculative_decoding_stateful.cpp	Creates writable copies of input_ids and attention_mask tensors
src/cpp/src/llm/pipeline_static.cpp	Creates writable copies of input_ids and attention_mask tensors
src/cpp/src/image_generation/image_processor.cpp	Creates writable copy of image tensor before processing

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/python_tests/test_llm_pipeline.py

…33120) ### Details: - The tensor can be created as read-only view when input pointer is constant. This kind of tensor can be used as model inputs as model input should be read-only. There is issue to use `data()` if input tensor is not const object the acquired pointer is not constant which throw exception to indicate that there is intention to modify of read-only tensor. - Update core, and plugins to convert tensor object to const when get data pointer to call correct overload - Update tests to use read-only tensor for inference on different devices. ### Blocked PR: - openvinotoolkit/openvino.genai#3086 ### Tickets: - CVS-176653 --------- Signed-off-by: Pawel Raasz <[email protected]> Signed-off-by: Raasz, Pawel <[email protected]>

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-16T11:08:43Z

tests/python_tests/test_vlm_pipeline.py



+@parametrize_one_model_sdpa
+def test_vlm_readonly_image_tensor(ov_pipe_model: VlmModelInfo, cat_image_32x32):


The test calls ov_pipe.generate() but does not verify the output or assert that generation succeeded. Consider adding an assertion to check that the result is not empty or meets expected criteria to ensure the read-only tensor is properly handled.

Copilot · 2025-12-16T11:08:44Z

tests/python_tests/test_llm_pipeline_static.py

+
+@pytest.mark.parametrize("llm_model", MODELS_LIST, indirect=True)
+@pytest.mark.parametrize("npu_config", PIPELINE_CONFIGS, indirect=True)
+def test_readonly_input_tensor(npu_model: LLMPipeline):


The test calls npu_model.generate() twice but does not capture or validate the outputs. Add assertions to verify that generation completes successfully and produces expected results when using read-only tensors.

Copilot · 2025-12-16T11:08:44Z

tests/python_tests/test_llm_pipeline.py



+@pytest.mark.parametrize("llm_model", ["katuni4ka/tiny-random-phi3"], indirect=True)
+def test_readonly_input_tensor(ov_pipe: ov_genai.LLMPipeline) -> None:


The test calls ov_pipe.generate() twice but does not validate the outputs. Add assertions to confirm that generation succeeds and produces non-empty results when using read-only tensors.

sgonorov requested review from AlexanderKalistratov, Wovchena, sbalandi and yatarkan November 28, 2025 12:13

sgonorov self-assigned this Nov 28, 2025

github-actions bot added category: image generation Image generation pipelines category: visual language Visual language pipeline category: LLM LLM pipeline (stateful, static) category: speculative decoding Speculative decoding category: GGUF GGUF file reader labels Nov 28, 2025

sgonorov marked this pull request as ready for review November 28, 2025 14:35

Wovchena requested changes Dec 3, 2025

View reviewed changes

src/cpp/src/image_generation/image_processor.cpp Outdated Show resolved Hide resolved

Wovchena requested a review from Copilot December 3, 2025 05:23

Copilot AI reviewed Dec 3, 2025

View reviewed changes

tests/python_tests/test_llm_pipeline.py Show resolved Hide resolved

sgonorov requested a review from Wovchena December 3, 2025 08:25

praasz mentioned this pull request Dec 4, 2025

Correct use Tensor::data() for model inputs when read-only tensors openvinotoolkit/openvino#33120

Merged

Fix const casting in non-writable numpy-based tensors

b2df610

sgonorov force-pushed the fix_const_casting branch from a2554a2 to b2df610 Compare December 15, 2025 13:04

Revert fix

0a51213

github-actions bot removed category: image generation Image generation pipelines category: speculative decoding Speculative decoding labels Dec 15, 2025

sgonorov added 2 commits December 15, 2025 19:22

Fromat fix

588c71c

Fixed lint

02751be

sgonorov requested a review from praasz December 16, 2025 10:50

Wovchena approved these changes Dec 16, 2025

View reviewed changes

Wovchena requested a review from Copilot December 16, 2025 11:02

Wovchena enabled auto-merge December 16, 2025 11:02

Copilot AI reviewed Dec 16, 2025

View reviewed changes

Wovchena added this pull request to the merge queue Dec 18, 2025

Merged via the queue into openvinotoolkit:master with commit 87f9d9e Dec 18, 2025
254 of 271 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix const casting in non-writable numpy-based tensors #3086

Fix const casting in non-writable numpy-based tensors #3086

sgonorov commented Nov 28, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 16, 2025

Uh oh!

Copilot AI Dec 16, 2025

Uh oh!

Copilot AI Dec 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants



		@parametrize_one_model_sdpa
		def test_vlm_readonly_image_tensor(ov_pipe_model: VlmModelInfo, cat_image_32x32):



		@pytest.mark.parametrize("llm_model", ["katuni4ka/tiny-random-phi3"], indirect=True)
		def test_readonly_input_tensor(ov_pipe: ov_genai.LLMPipeline) -> None:

Fix const casting in non-writable numpy-based tensors #3086

Fix const casting in non-writable numpy-based tensors #3086

Conversation

sgonorov commented Nov 28, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants