feat: add service_tier parameter to Responses API LLM by piyush-gambhir · Pull Request #5342 · livekit/agents

piyush-gambhir · 2026-04-04T22:51:29Z

Summary

The Chat Completions LLM (openai.LLM) already supports the service_tier parameter for configuring priority/flex/default processing per-request. The Responses API LLM (openai.responses.LLM) is missing this parameter despite the OpenAI Responses API supporting it.

This PR adds service_tier to the Responses LLM for parity.

Changes

livekit-plugins/livekit-plugins-openai/.../responses/llm.py (1 file, 6 lines):

Add service_tier: NotGivenOr[str] to _LLMOptions
Add service_tier parameter to LLM.__init__()
Pass service_tier through in chat() via extra kwargs

Usage

from livekit.plugins.openai import responses

llm = responses.LLM(
    model="gpt-5.4",
    service_tier="priority",  # now supported
)

Backward Compatible

Defaults to NOT_GIVEN — no impact on existing code
Matches the existing pattern used by the Chat Completions LLM

The Chat Completions LLM (openai.LLM) already supports the service_tier parameter for configuring priority/flex/default processing. This adds the same parameter to the Responses API LLM (openai.responses.LLM) for parity. OpenAI's Responses API accepts service_tier in the request body: https://platform.openai.com/docs/api-reference/responses/create Changes (responses/llm.py only): - Add service_tier to _LLMOptions dataclass - Add service_tier parameter to LLM.__init__() - Pass service_tier through in chat() via extra kwargs

piyush-gambhir · 2026-04-05T10:19:27Z

The CI failure is from test_blockguard.py::TestStress::test_many_short_blocks — a pre-existing flaky test in livekit-blockguard that's unrelated to this PR.

The test runs 20 × time.sleep(0.02) with a 500ms threshold and expects no blocking detection. On the CI runner (macOS), cumulative scheduling jitter causes the event loop heartbeat to exceed 500ms between watchdog polls, triggering a false positive. This happens because the test's total blocking time (20 × 20ms = 400ms) is close to the threshold, and any GC pause or scheduling delay pushes it over.

This PR only modifies livekit-plugins-openai/responses/llm.py — no relation to blockguard.

piyush-gambhir · 2026-04-05T10:20:52Z

Closing to re-trigger CI (flaky blockguard test failure unrelated to this change).

piyush-gambhir force-pushed the feat/responses-service-tier-param branch 2 times, most recently from 8f6a34a to b61e517 Compare April 4, 2026 23:11

piyush-gambhir mentioned this pull request Apr 4, 2026

feat: add serviceTier parameter to Responses API LLM livekit/agents-js#1206

Merged

piyush-gambhir force-pushed the feat/responses-service-tier-param branch from b61e517 to ce469bc Compare April 4, 2026 23:40

piyush-gambhir closed this Apr 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add service_tier parameter to Responses API LLM#5342

feat: add service_tier parameter to Responses API LLM#5342
piyush-gambhir wants to merge 1 commit intolivekit:mainfrom
piyush-gambhir:feat/responses-service-tier-param

piyush-gambhir commented Apr 4, 2026

Uh oh!

piyush-gambhir commented Apr 5, 2026

Uh oh!

piyush-gambhir commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

piyush-gambhir commented Apr 4, 2026

Summary

Changes

Usage

Backward Compatible

Uh oh!

piyush-gambhir commented Apr 5, 2026

Uh oh!

piyush-gambhir commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant