We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents abdd1e0 + 1e45fcb commit c7b1bd2Copy full SHA for c7b1bd2
rfcs/0013-ai-policies.md
@@ -253,7 +253,7 @@ sequenceDiagram
253
254
%% pre-model-server token rate limiting check
255
GW->>GW: Parse model from request body
256
- GW->>L: ShouldRateLimit (hits_addend: 0)
+ GW->>L: CheckRateLimit (read only op)
257
alt Limit not reached
258
L-->>GW: Rate limit OK
259
else Limit reached
@@ -270,7 +270,7 @@ sequenceDiagram
270
GW->>GW: Parse usage metrics from response body
271
272
%% update token usage count via Limitador
273
- GW->>L: ShouldRateLimit (hits_addend: func(usage_metrics))
+ GW->>L: ReportRateLimit (hits_addend: func(usage_metrics))
274
L-->>GW: Acknowledge token count update
275
276
%% final inference response: deliver back to client
0 commit comments