Feat/OpenAI backend worker by rushilbhat · Pull Request #3 · doublewordai/dynamo

rushilbhat · 2026-04-27T14:29:56Z

Summary

Add an OpenAI-compatible backend worker for Dynamo so SGLang and vLLM can handle chat processing through their own OpenAI-compatible APIs, reducing the lag between upstream engine changes and Dynamo support.

Problem

Dynamo’s native SGLang/vLLM paths are still tightly coupled to model-specific chat processing, which means upstream engine changes to chat templating, reasoning, tool calling, and related request/response behavior often take time to be reflected in Dynamo. That lag makes it harder to stay current with engine behavior and support new upstream capabilities quickly.

Solution

This PR adds an OpenAI-compatible backend worker for Dynamo that forwards chat/completions requests to a colocated engine’s OpenAI-compatible endpoint, so chat processing can be delegated to the engine itself.

To support that cleanly, this PR also:

adds a generic forwarding worker for OpenAI-compatible backends
preserves streaming behavior and coalesces streamed tool-call arguments into complete tool calls
normalizes compatibility gaps between Dynamo and the underlying engine, including:
- chat_template_args -> chat_template_kwargs
- vLLM streamed reasoning -> reasoning_content
splits launcher entrypoints by engine:
- dynamo.openai_backend.sglang
- dynamo.openai_backend.vllm
moves shared launcher logic into a common helper
updates the vLLM install flow to support explicit stable and nightly build paths

- Implement a Dynamo worker that forwards requests to a local OpenAI-compatible server - Support streaming chat/completions responses - Coalesce streamed tool call arguments - Normalize chat :template fields before forwarding requests

+        return
+    task.cancel()
+    with contextlib.suppress(asyncio.CancelledError):
+        await task


+        future.cancel()
+        try:
+            await future
+        except asyncio.CancelledError:


+        )
+        try:
+            yield event_source
+        except BaseException as exc:


rushilbhat added 3 commits April 27, 2026 15:12

feat(openai-backend): split launchers by engine and add vLLM support

debe855

build(vllm): support stable and nightly installs

0414406

github-actions Bot added backend::vllm container labels Apr 27, 2026

rushilbhat merged commit 636fe13 into main Apr 27, 2026
7 of 13 checks passed

github-advanced-security AI found potential problems Apr 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/OpenAI backend worker#3

Feat/OpenAI backend worker#3
rushilbhat merged 3 commits into
mainfrom
feat/openai-backend-worker

rushilbhat commented Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rushilbhat commented Apr 27, 2026

Summary

Problem

Solution

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants