fix: Use eval_duration for output TPS calculations in Ollama LLM provider #4568

jonathanortega2023 · 2025-10-21T06:48:02Z

Pull Request Type

🐛 fix

Relevant Issues

resolves #4567

What is in this change?

Updated LLMPerformanceMonitor to support provider duration timing for more accurate output TPS calculations
measureAsyncFunction accepts duration from output.usage.duration
Modified measureStream to accept duration from reportedUsage.duration if provided by the LLM provider
Updated Ollama provider to use eval_duration for more accurate TPS calculation

This is a generic enhancement that any provider can leverage to supply more accurate timing information, while maintaining backward compatibility with providers that don't supply duration metrics.

Developer Validations

I ran yarn lint from the root of the repo & committed changes
Relevant documentation has been updated
I have tested my code functionality
Docker build succeeds locally

…c field

shatfield4

LGTM, refactored to work with streaming and utilizes LLMPerformanceMonitor.

timothycarambat

This PR does not address duration timing in measureAsyncFunction and if we are going to fix it, we should do so in both places.

We should also either have duration be an override-able property in the PerformanceMontior or just change it specifically in Ollama's AI provider so its impact is limited.

If we make changes to the entire utility then we cannot be sure what the side-effects are.

Since we are going to override eval_duration it should be a custom property name so that it does not get confused with the actual value from ollama if we ever want to try to use it in another place.

server/utils/helpers/chat/LLMPerformanceMonitor.js

server/utils/AiProviders/ollama/index.js

…al param

fix: Use eval_duration for output TPS calculations and add as a metri…

ca905f9

…c field

timothycarambat requested a review from shatfield4 October 22, 2025 17:22

timothycarambat assigned shatfield4 Oct 22, 2025

timothycarambat added the blocked label Oct 22, 2025

shatfield4 added 3 commits October 22, 2025 15:33

refactor usage of eval_duration from ollama metrics

5a62844

Merge branch 'master' into pr-4568-branch

a233373

move eval_duration to usage

809e5ac

shatfield4 approved these changes Oct 22, 2025

View reviewed changes

shatfield4 requested a review from timothycarambat October 22, 2025 23:09

shatfield4 assigned timothycarambat and unassigned shatfield4 Oct 22, 2025

shatfield4 added PR:needs review Needs review by core team and removed blocked labels Oct 22, 2025

shatfield4 changed the title ~~fix: Use eval_duration for output TPS calculations and add as a metri…~~ fix: Use eval_duration for output TPS calculations in Ollama LLM provider Oct 22, 2025

timothycarambat requested changes Oct 24, 2025

View reviewed changes

server/utils/helpers/chat/LLMPerformanceMonitor.js Outdated Show resolved Hide resolved

server/utils/AiProviders/ollama/index.js Show resolved Hide resolved

timothycarambat assigned shatfield4 and unassigned timothycarambat Oct 24, 2025

shatfield4 added 2 commits October 24, 2025 18:44

overwrite duration in ollama provider wip measureAsyncFunction option…

7831b16

…al param

allow for overloaded duration in measureAsyncFunction

b33c84d

shatfield4 requested a review from timothycarambat October 28, 2025 00:03

shatfield4 assigned timothycarambat and unassigned shatfield4 Oct 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: Use eval_duration for output TPS calculations in Ollama LLM provider #4568

fix: Use eval_duration for output TPS calculations in Ollama LLM provider #4568

Uh oh!

jonathanortega2023 commented Oct 21, 2025 •

edited by shatfield4

Loading

Uh oh!

shatfield4 left a comment

Uh oh!

timothycarambat left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

fix: Use eval_duration for output TPS calculations in Ollama LLM provider #4568

Are you sure you want to change the base?

fix: Use eval_duration for output TPS calculations in Ollama LLM provider #4568

Uh oh!

Conversation

jonathanortega2023 commented Oct 21, 2025 • edited by shatfield4 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Type

Relevant Issues

What is in this change?

What is in this change?

Developer Validations

Uh oh!

shatfield4 left a comment

Choose a reason for hiding this comment

Uh oh!

timothycarambat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jonathanortega2023 commented Oct 21, 2025 •

edited by shatfield4

Loading