Skip to content

Commit c7b1bd2

Browse files
authored
Merge pull request #130 from Kuadrant/token-rate-limit-sequence-diagram-patch
0013-ai-policies: Update token rate limit sequence diagram
2 parents abdd1e0 + 1e45fcb commit c7b1bd2

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

rfcs/0013-ai-policies.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -253,7 +253,7 @@ sequenceDiagram
253253
254254
%% pre-model-server token rate limiting check
255255
GW->>GW: Parse model from request body
256-
GW->>L: ShouldRateLimit (hits_addend: 0)
256+
GW->>L: CheckRateLimit (read only op)
257257
alt Limit not reached
258258
L-->>GW: Rate limit OK
259259
else Limit reached
@@ -270,7 +270,7 @@ sequenceDiagram
270270
GW->>GW: Parse usage metrics from response body
271271
272272
%% update token usage count via Limitador
273-
GW->>L: ShouldRateLimit (hits_addend: func(usage_metrics))
273+
GW->>L: ReportRateLimit (hits_addend: func(usage_metrics))
274274
L-->>GW: Acknowledge token count update
275275
276276
%% final inference response: deliver back to client

0 commit comments

Comments
 (0)