Enabled stream on lmi #316

maykcaldas · 2025-07-18T07:20:04Z

This fixed our fake streaming mode we had with acompletion_iter. Now setting stream to True in both call and call_single returns an AsyncGenerator

Also allowed string inputs in call for convenience

TODO: Need to make sure other repos will not break because of the typing change. I typed overload to avoid that

Copilot

Pull Request Overview

This PR implements streaming functionality for the LMI (Language Model Interface) by enabling the stream parameter in both call and call_single methods. The change replaces the previous "fake streaming" mode with proper streaming that returns an AsyncGenerator when stream=True.

Key changes include:

Updated method signatures to support streaming with type overloads for both streaming and non-streaming modes
Added string input support for convenience in the call method
Modified the streaming implementation to yield results incrementally rather than accumulating them

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
packages/lmi/src/lmi/llms.py	Implements streaming functionality with method overloads, updates type annotations from AsyncIterable to AsyncGenerator, and adds string input support
packages/lmi/tests/test_llms.py	Adds type assertions to ensure backwards compatibility and proper return types for streaming vs non-streaming calls

Comments suppressed due to low confidence (1)

packages/lmi/src/lmi/llms.py:699

This line is a duplicate of the assertion removed on line 699 in the diff. One of the duplicate assertions was properly removed, but this appears to be an inconsistency in the diff presentation.

        state["__dict__"].pop("_router", None)

packages/lmi/src/lmi/llms.py

jamesbraza · 2025-07-18T19:30:31Z

packages/lmi/src/lmi/llms.py

        Raises:
            ValueError: If the LLM type is unknown.
        """
+        if isinstance(messages, str):


If we want to do this here, can we remove it from call_single?

jamesbraza · 2025-07-18T19:31:22Z

packages/lmi/src/lmi/llms.py

+            A list of LLMResult objects containing the result of the call when stream=False,
+            or an AsyncGenerator[LLMResult] when stream=True.


Two comments:

Indent the later lines by 4

Consider not restating the type hints/variable names, as that can lead to drift over time

For example:

When not streaming, it's a list of result objects for each call, otherwise it's an async generator of result objects.

jamesbraza · 2025-07-18T19:33:06Z

packages/lmi/src/lmi/llms.py

+        if tools:
+            raise NotImplementedError("Using tools with streaming is not supported")
+        if callbacks:
+            raise NotImplementedError("Using callbacks with streaming is not supported")


Can you reword to say "not yet supported"

Also, can you add a comment to each of these saying why they're not supported?

jamesbraza · 2025-07-18T19:33:27Z

packages/lmi/src/lmi/llms.py

+    ) -> LLMResult: ...
+
+    @overload
+    async def call_single(  # type: ignore[overload-cannot-match]


This type ignore -- why do you have it? We should have no type ignores here imo, otherwise it means the typing is wrong

maykcaldas added 4 commits July 2, 2025 12:44

wip

bd629d0

Merge branch 'main' into enable-stream

9d5778a

Enabled streaming on lmi

d028e48

Removed unused import

129063d

maykcaldas self-assigned this Jul 18, 2025

Copilot AI review requested due to automatic review settings July 18, 2025 07:20

Copilot AI reviewed Jul 18, 2025

View reviewed changes

packages/lmi/src/lmi/llms.py Outdated Show resolved Hide resolved

packages/lmi/src/lmi/llms.py Outdated Show resolved Hide resolved

packages/lmi/src/lmi/llms.py Outdated Show resolved Hide resolved

packages/lmi/src/lmi/llms.py Outdated Show resolved Hide resolved

Fixed the seconds_to_first_token calculation

baed6ae

jamesbraza reviewed Jul 18, 2025

View reviewed changes

maykcaldas added 2 commits July 18, 2025 17:02

Moved the message conversion from str to list[Message] to call only

e699df4

Fixed link error on concatenating dicts in call_single

c4770f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enabled stream on lmi #316

Enabled stream on lmi #316

Uh oh!

maykcaldas commented Jul 18, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jamesbraza Jul 18, 2025

Uh oh!

jamesbraza Jul 18, 2025

Uh oh!

jamesbraza Jul 18, 2025

Uh oh!

jamesbraza Jul 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		A list of LLMResult objects containing the result of the call when stream=False,
		or an AsyncGenerator[LLMResult] when stream=True.

Enabled stream on lmi #316

Are you sure you want to change the base?

Enabled stream on lmi #316

Uh oh!

Conversation

maykcaldas commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jamesbraza Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

jamesbraza Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

jamesbraza Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

jamesbraza Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

maykcaldas commented Jul 18, 2025 •

edited

Loading

jamesbraza Jul 18, 2025 •

edited

Loading