feat: Add n support for TRT-LLM (Stacked PR on top of #8744) by indrajit96 · Pull Request #8746 · ai-dynamo/dynamo

indrajit96 · 2026-04-27T04:32:02Z

Overview:

Add TRT-LLM backend support for the OpenAI-compatible n field on top of PR #8744.
This PR is intentionally scoped to TRT-LLM only. The shared OpenAI/Rust response contract and vLLM plumbing are handled in the parent PR.

WHY:

PR #8744 adds the shared Dynamo contract for carrying multiple OpenAI choices by preserving each backend output index.

TensorRT-LLM supports n in SamplingParams as the number of sequences to generate, and trtllm-serve exposes an OpenAI-compatible /v1/chat/completions endpoint. Dynamo needed the TRT-LLM-specific plumbing to pass n through and keep streamed choices separated.

References:

TRT-LLM OpenAI-compatible server: https://nvidia.github.io/TensorRT-LLM/commands/trtllm-serve.html
TRT-LLM SamplingParams.n: https://nvidia.github.io/TensorRT-LLM/llm-api/reference.html

Details:

components/src/dynamo/trtllm/llm_engine.py
- Preserves TRT-LLM output choice indexes.
- Tracks cumulative token offsets per choice so each Dynamo chunk emits only the new token delta for that choice.
- Reports completion usage using all returned choices.
components/src/dynamo/trtllm/request_handlers/handler_base.py
- Passes n through to TRT-LLM sampling params.
- Preserves output index in streamed chunks.
- Tracks token and logprob offsets per choice index for interleaved n > 1 output streams.
- Keeps TRT-LLM’s internal best_of field aligned with n when needed, because TRT-LLM validates best_of >= n.
components/src/dynamo/trtllm/tests/test_trtllm_handler_base.py
- Adds unit coverage that n is applied to sampling params.
- Verifies the internal best_of compatibility adjustment for TRT-LLM validation.
docs/backends/trtllm/trtllm-reference-guide.md
- Documents the TRT-LLM n > 1 behavior relevant to this backend path.
tests/serve/test_trtllm.py
- Adds pre-merge serve coverage validating that a chat request with n=2 returns two choices.

Where should the reviewer start?

Start with:

components/src/dynamo/trtllm/request_handlers/handler_base.py

That file contains the main TRT-LLM streaming path and the per-choice cursor logic for n > 1.

Then review:

components/src/dynamo/trtllm/llm_engine.py
components/src/dynamo/trtllm/tests/test_trtllm_handler_base.py
tests/serve/test_trtllm.py
docs/backends/trtllm/trtllm-reference-guide.md

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

Summary by CodeRabbit

Release Notes

New Features
- Added full support for requesting multiple response choices using the n parameter with TensorRT-LLM backend
- Implemented independent token tracking per output choice for accurate streaming and token accounting
Bug Fixes
- Fixed token delta computation to correctly process all output choices instead of only the first
- Corrected completion token counting to aggregate properly across all generated choices
Documentation
- Added configuration guide for enabling multiple response choices with TensorRT-LLM, including required environment variable setup

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

github-actions · 2026-04-27T04:34:17Z

🌿 Fern Docs Preview: https://nvidia-preview-e1741288-895f-498f-8998-4ab1ab83c957.docs.buildwithfern.com/dynamo/dev

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

…mo into ibhosale/n-options-output

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

…tllm Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com> # Conflicts: # components/src/dynamo/trtllm/tests/test_trtllm_handler_base.py

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

indrajit96 added 3 commits April 26, 2026 21:01

Add n support for OpenAI contract and vLLM

4aaa241

Add n support for TRT-LLM

cb20aeb

Fix Rust compilation issue

ea99fc1

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

indrajit96 requested review from keivenchang, nv-yna and tanmayv25 April 27, 2026 04:32

indrajit96 requested review from a team as code owners April 27, 2026 04:32

pull-request-size Bot added the size/L label Apr 27, 2026

github-actions Bot added documentation Improvements or additions to documentation backend::trtllm Relates to the trtllm backend labels Apr 27, 2026

indrajit96 changed the title ~~Add n support for TRT-LLM~~ feat: Add n support for TRT-LLM Apr 27, 2026

Merge branch 'ibhosale/n-options-output' into ibhosale/n-options-trtllm

cd9ce4e

github-actions Bot added the feat label Apr 27, 2026

copy-pr-bot Bot temporarily deployed to GITLAB April 27, 2026 04:32 Inactive

indrajit96 added 4 commits April 26, 2026 22:18

Merge branch 'main' into ibhosale/n-options-output

25dc472

Fix MyPy issues

9c393f4

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

Merge branch 'ibhosale/n-options-output' of github.com:ai-dynamo/dyna…

f60beac

…mo into ibhosale/n-options-output

Merge branch 'ibhosale/n-options-output' into ibhosale/n-options-trtllm

7e36239

copy-pr-bot Bot temporarily deployed to GITLAB April 27, 2026 07:13 Inactive

copy-pr-bot Bot temporarily deployed to GITLAB April 27, 2026 18:23 Inactive

indrajit96 added 2 commits April 27, 2026 11:40

Merge branch 'main' into ibhosale/n-options-output

5292ca7

Merge branch 'ibhosale/n-options-output' into ibhosale/n-options-trtllm

cc3d394

copy-pr-bot Bot temporarily deployed to GITLAB April 27, 2026 18:42 Inactive

indrajit96 changed the title ~~feat: Add n support for TRT-LLM~~ feat: Add n support for TRT-LLM (Stacked PR on top of https://github.com/ai-dynamo/dynamo/pull/8744 Apr 27, 2026

indrajit96 changed the title ~~feat: Add n support for TRT-LLM (Stacked PR on top of https://github.com/ai-dynamo/dynamo/pull/8744~~ feat: Add n support for TRT-LLM #8744 Apr 27, 2026

indrajit96 changed the title ~~feat: Add n support for TRT-LLM #8744~~ feat: Add n support for TRT-LLM (Stacked PR on top of https://github.com/ai-dynamo/dynamo/pull/8744) Apr 27, 2026

indrajit96 changed the title ~~feat: Add n support for TRT-LLM (Stacked PR on top of https://github.com/ai-dynamo/dynamo/pull/8744)~~ feat: Add n support for TRT-LLM (Stacked PR on top of #8744) Apr 27, 2026

Fix mypy typing issues

2a8277c

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 02:23 Inactive

nv-yna reviewed Apr 29, 2026

View reviewed changes

Comment thread components/src/dynamo/trtllm/llm_engine.py

Fix review comments

3f5f128

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 17:15 Inactive

Merge remote-tracking branch 'origin/main' into ibhosale/n-options-tr…

6b2e4bf

…tllm Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com> # Conflicts: # components/src/dynamo/trtllm/tests/test_trtllm_handler_base.py

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 17:20 Inactive

nv-yna approved these changes Apr 29, 2026

View reviewed changes

indrajit96 enabled auto-merge (squash) April 29, 2026 17:26

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 17:37 Inactive

Merge branch 'main' into ibhosale/n-options-trtllm

93e2b0c

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 18:53 Inactive

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 19:21 Inactive

Merge branch 'main' into ibhosale/n-options-trtllm

6ae4489

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 19:32 Inactive

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 19:39 Inactive

Merge branch 'main' into ibhosale/n-options-trtllm

3dcc23a

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 20:25 Inactive

Merge branch 'main' into ibhosale/n-options-trtllm

f12e182

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 21:36 Inactive

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 22:24 Inactive

Merge branch 'main' into ibhosale/n-options-trtllm

3fbc4e8

copy-pr-bot Bot temporarily deployed to GITLAB April 29, 2026 23:43 Inactive

copy-pr-bot Bot temporarily deployed to GITLAB April 30, 2026 00:04 Inactive

Merge branch 'main' into ibhosale/n-options-trtllm

3cbaba1

copy-pr-bot Bot temporarily deployed to GITLAB April 30, 2026 00:18 Inactive

copy-pr-bot Bot temporarily deployed to GITLAB April 30, 2026 00:41 Inactive

indrajit96 merged commit e314a9f into main Apr 30, 2026
154 of 156 checks passed

indrajit96 deleted the ibhosale/n-options-trtllm branch April 30, 2026 05:33

keivenchang mentioned this pull request Apr 30, 2026

test(revalidate): feat: Add n support for TRT-LLM (Stacked PR on top of #8744) #8746 #8897

Closed

furionw pushed a commit that referenced this pull request May 2, 2026

feat: Add n support for TRT-LLM (Stacked PR on top of #8744) (#8746)

644b066

Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add n support for TRT-LLM (Stacked PR on top of #8744)#8746

feat: Add n support for TRT-LLM (Stacked PR on top of #8744)#8746
indrajit96 merged 24 commits into
mainfrom
ibhosale/n-options-trtllm

indrajit96 commented Apr 27, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

github-actions Bot commented Apr 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

indrajit96 commented Apr 27, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

WHY:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Release Notes

Uh oh!

github-actions Bot commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

indrajit96 commented Apr 27, 2026 •

edited by coderabbitai Bot

Loading

github-actions Bot commented Apr 27, 2026 •

edited

Loading