Add structured-output migration repro test#1
Conversation
Signed-off-by: Julien Mancuso <jmancuso@nvidia.com>
Signed-off-by: Julien Mancuso <jmancuso@nvidia.com>
…amo#6655) Signed-off-by: PeaBrane <yanrpei@gmail.com>
Signed-off-by: alec-flowers <aflowers@nvidia.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
…otes (ai-dynamo#6648) Signed-off-by: Dan Gil <dagil@nvidia.com>
Signed-off-by: Dan Gil <dagil@nvidia.com>
…i-dynamo#6662) Signed-off-by: Dan Gil <dagil@nvidia.com> Signed-off-by: dagil-nvidia <dagil@nvidia.com> Signed-off-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com> Co-authored-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
…namo#6663) Signed-off-by: PeaBrane <yanrpei@gmail.com>
Signed-off-by: ashnamehrotra <ashnamehrotra@gmail.com> Signed-off-by: Hannah Zhang <hannahz@nvidia.com>
…i-dynamo#6679) Signed-off-by: hongkuanz <hongkuanz@nvidia.com>
…es (ai-dynamo#6682) Signed-off-by: Anant Sharma <anants@nvidia.com>
…metadata to backends (ai-dynamo#6692) (ai-dynamo#6718) Signed-off-by: Julien Mancuso <jmancuso@nvidia.com>
Signed-off-by: Hannah Zhang <hannahz@nvidia.com>
Signed-off-by: Julien Mancuso <jmancuso@nvidia.com>
…6650) (ai-dynamo#6706) Signed-off-by: Anant Sharma <anants@nvidia.com>
Signed-off-by: hongkuanz <hongkuanz@nvidia.com>
Signed-off-by: Dan Gil <dagil@nvidia.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: PeaBrane <yanrpei@gmail.com>
Signed-off-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
Signed-off-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
…amo#6753) Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>
…age type is pvc (ai-dynamo#6752) (ai-dynamo#6755) Signed-off-by: Julien Mancuso <jmancuso@nvidia.com>
…iation (ai-dynamo#6651) (ai-dynamo#6776) Signed-off-by: Guan Luo <41310872+GuanLuo@users.noreply.github.com>
Signed-off-by: Qi Wang <qiwa@nvidia.com>
…rker (ai-dynamo#6765) Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>
…s (http://nvbugs/5936491/1) (ai-dynamo#6772) Signed-off-by: Matej Kosec <mkosec@nvidia.com>
…amo#7283 (ai-dynamo#7284) Signed-off-by: Dan Gil <dagil@nvidia.com>
…mo#7306) Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Signed-off-by: Anant Sharma <anants@nvidia.com> Co-authored-by: Anant Sharma <anants@nvidia.com>
Signed-off-by: Hannah Zhang <hannahz@nvidia.com> Signed-off-by: Dan Gil <dagil@nvidia.com> Co-authored-by: hhzhang16 <54051230+hhzhang16@users.noreply.github.com>
Signed-off-by: Dan Gil <dagil@nvidia.com> Signed-off-by: akshatha-k <33278067+akshatha-k@users.noreply.github.com> Co-authored-by: akshatha-k <33278067+akshatha-k@users.noreply.github.com>
…7312) (ai-dynamo#7332) Signed-off-by: Dmitry Tokarev <dtokarev@nvidia.com> Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
…ynamo#7336, ai-dynamo#7350, ai-dynamo#7352) (ai-dynamo#7354) Signed-off-by: Dan Gil <dagil@nvidia.com> Signed-off-by: dagil-nvidia <dagil@nvidia.com> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Co-authored-by: Neal Vaidya <nealv@nvidia.com>
ai-dynamo#7404) Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Co-authored-by: Biswa Panda <biswa.panda@gmail.com>
…7410) Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Signed-off-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
Signed-off-by: Dan Gil <dagil@nvidia.com> Signed-off-by: Neal Vaidya <nealv@nvidia.com> Signed-off-by: athreesh <anish.maddipoti@utexas.edu> Signed-off-by: Anish <80174047+athreesh@users.noreply.github.com> Signed-off-by: akshatha-k <akshutk@gmail.com> Signed-off-by: Nikhar Maheshwari <nikharm@nvidia.com> Signed-off-by: Keiven Chang <keivenchang@users.noreply.github.com> Signed-off-by: Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by: Neal Vaidya <nealv@nvidia.com> Co-authored-by: Anish <80174047+athreesh@users.noreply.github.com> Co-authored-by: akshatha-k <33278067+akshatha-k@users.noreply.github.com> Co-authored-by: akshatha-k <akshutk@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: nikharm <nikharm@nvidia.com> Co-authored-by: Keiven C <213854356+keivenchang@users.noreply.github.com> Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com>
Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
…to be use Kimi's tokenizer and fix tiktoken multi-byte handling (ai-dynamo#7424)
…mo#7433) Signed-off-by: Dan Gil <dagil@nvidia.com> Co-authored-by: Ben Hamm <ben.hamm@gmail.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
…o#7412) (ai-dynamo#7429) Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Co-authored-by: Biswa Panda <biswa.panda@gmail.com>
Signed-off-by: Dan Gil <dagil@nvidia.com>
a10f970 to
43f38e4
Compare
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: a10f9700e4
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| @pytest.mark.vllm | ||
| @pytest.mark.gpu_1 | ||
| @pytest.mark.e2e | ||
| @pytest.mark.post_merge |
There was a problem hiding this comment.
Mark the repro test non-blocking until migration fix lands
This new test is tagged post_merge, and the post-merge workflow selects all vllm and gpu_1 tests with (pre_merge or post_merge) markers, so it will run in nightly CI; because the test asserts json.loads(response_text) succeeds after forced migration (the known repro path), it will keep the post-merge pipeline red on environments where the migration bug is still present. Please gate it with xfail/skip (or remove the post_merge marker) until the product fix is merged.
Useful? React with 👍 / 👎.
|
Reran the repro on the rebased Command: source /tmp/dynamo-install-mv3o75/.venv-main/bin/activate
cd /tmp/dynamo-install-mv3o75
python -m pytest -q tests/fault_tolerance/migration/test_vllm_structured.py::test_request_migration_vllm_aggregated_structured_output -sLatest result on Observed behavior differs from the earlier
So on rebased |
43f38e4 to
a10f970
Compare
|
This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days. |
|
This PR has been closed due to inactivity. If you believe this PR is still relevant, please feel free to reopen it with additional context or information. |
Summary
dynamo.frontendand realdynamo.vllmworkersresponse_formatJSON schema requestWhat this reproduces
On the current
v1.0.1-based environment, request migration is happening, but the resumed structured-output stream becomes invalid JSON.Observed in the targeted test run on
gotenks:Stream disconnected... recreating stream...ongoing_request: 1, new_request: 0json.loads(...)then fails with:Repro command
Latest result