fix(api): record completions even when client disconnects by pescn · Pull Request #58 · EM-GeekLab/NexusGate

pescn · 2026-01-24T13:19:33Z

Summary

Fixes #21: Completions are now properly recorded to the database even when the client disconnects mid-stream or times out.

Key changes:

Added aborted status to track client-disconnected requests
Created StreamingContext class with abort event detection via request.signal
Updated all three API endpoints (completions, messages, responses) to handle aborts
Added frontend support for displaying aborted status with amber indicators

Test plan

Run python_test_code/test_abort_handling.py - all 3 tests pass:
- Normal streaming: completed ✅
- Streaming abort: aborted ✅
- Non-streaming abort: aborted ✅
Frontend build passes with no pattern matching errors
Database migration applied successfully
Manual testing: verify aborted requests appear in dashboard with amber badge

🤖 Generated with Claude Code

Summary by CodeRabbit

新功能
- 新增“已中止”请求状态，界面在请求列表、请求详情和嵌入列表中以琥珀色显示。
改进
- 流式与非流式处理现在对客户端中断更具感知性：在中断时能继续收集并可靠保存已产生的数据，区分中止与失败，减少中断时噪音日志并确保最终状态持久化。
本地化
- 增补中英文文案以支持“已中止”状态显示。

_{✏️ Tip: You can customize this high-level summary in your review settings.}

This fix ensures that LLM completions are properly recorded to the database even when the client disconnects mid-stream or times out. Backend changes: - Add 'aborted' status to CompletionsStatusEnum - Create StreamingContext class to manage streaming state with abort detection - Register abort event listener on request.signal to save completion on disconnect - Update processStreamingResponse to use StreamingContext - Update processNonStreamingResponse to await database writes and detect aborts - Apply changes to all three endpoints: completions, messages, responses Frontend changes: - Add 'aborted' status handling in ts-pattern exhaustive matching - Add amber indicator badges for aborted requests in requests and embeddings tables - Add i18n translations for 'Aborted' status (en-US: "Aborted", zh-CN: "已中止") Database migration: - Add 'aborted' value to completions_status enum Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai · 2026-01-24T13:19:56Z

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

📝 Walkthrough

Walkthrough

新增 StreamingContext 并将 AbortSignal 贯穿流式与非流式响应处理；在客户端断开时把 completion 标记为 aborted 并持久化；数据库与 TypeScript schema 扩展枚举包含 aborted；前端增加翻译与 UI 指示器支持。

Changes

Cohort / File(s)	变更概要
数据库迁移与元数据 `backend/drizzle/0010_noisy_deathbird.sql`, `backend/drizzle/meta/0010_snapshot.json`, `backend/drizzle/meta/_journal.json`	在 PostgreSQL 枚举 `public.completions_status` 中新增值 `'aborted'`，添加迁移脚本、快照与 journal 条目。注意点：生产 DB 枚举同步与迁移时序。
后端 Schema `backend/src/db/schema.ts`	在 TypeScript/DB schema 中扩展 CompletionsStatusEnum，加入 `"aborted"`，需与数据库枚举一致。
流处理上下文 `backend/src/utils/streaming-context.ts`	新增 `StreamingContext`：管理流式状态（text/thinking、tokens、TTFT、tool calls）、接收 AbortSignal 并负责在中止/完成/失败时构建并持久化 completion、消费 tokens。审查并发、重入及错误处理。
API：完成/消息/响应处理 `backend/src/api/v1/completions.ts`, `backend/src/api/v1/messages.ts`, `backend/src/api/v1/responses.ts`	为 streaming 与 non-streaming 处理新增可选 `signal?: AbortSignal` 参数；将原有 ad-hoc 状态管理替换为 `StreamingContext`，在客户端中止时保存 `aborted` 状态并调整日志、错误分支与上游调用链（需确认 signal 在全链路正确传递）。
前端本地化 `frontend/src/i18n/locales/en-US.json`, `frontend/src/i18n/locales/zh-CN.json`	添加 Aborted 状态的翻译键（多语言）。
前端 UI 状态指示器 `frontend/src/pages/embeddings/columns.tsx`, `frontend/src/pages/requests/columns.tsx`, `frontend/src/pages/requests/detail-panel/header.tsx`	在状态渲染处新增 `aborted` 分支，渲染为琥珀色指示器并保持 exhaustiveness 校验。

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant Server as API_Server
    participant Context as StreamingContext
    participant DB as Database

    Client->>Server: 发起请求（流式/非流式，含 AbortSignal）
    Server->>Server: 创建 Completion (status="pending")
    Server->>Context: new StreamingContext(completion, bearer, apiKey, begin, signal)

    loop 接收上游数据块
        Server->>Server: 读取上游分片
        Server->>Context: recordTTFT()/累积分片/计数tokens
        alt 客户端未中止
            Server->>Client: 推送分片
        else 客户端已中止
            Server-->>Client: 停止推送（不再 yield）
        end
    end

    alt 正常完成
        Server->>Context: saveCompletion("completed")
        Context->>DB: 插入/更新 completion(status=completed, parts, tokens, ttft)
        DB-->>Context: OK
        Context->>Server: consumeTokens()
    else 客户端中止
        Client-->>Server: 触发 AbortSignal
        Context->>Context: isAborted() -> true
        Context->>DB: 保存 completion(status=aborted, 已收分片)
        DB-->>Context: OK
    else 上游/处理错误
        Server->>Context: saveCompletion("failed", error)
        Context->>DB: 保存失败状态
        DB-->>Context: OK
    end

    Context->>Context: cleanup()

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

feat: Add per-API-key rate limiting (RPM/TPM) #45 — 修改了相同后端 API 处理文件（completions/messages/responses），与本次引入的 AbortSignal/StreamingContext 及枚举扩展在函数签名与 schema 层面存在重叠。

Poem

🐰 兔子的中止颂
我在流里轻轻嗅风，🫧
第一块到来我记下心动，
若风止息便写下 “aborted” 的印记，
数据库替我留存，流不再无影。

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 75.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	标题清晰准确地总结了主要变更：修复API在客户端断开连接时仍然记录completions的问题。
Linked Issues check	✅ Passed	PR完整实现了Issue `#21的所有目标`：添加'aborted'状态、检测客户端中止、确保断线时仍保存完整数据、UI展示aborted状态。
Out of Scope Changes check	✅ Passed	所有变更都与Issue `#21相关`：数据库schema扩展aborted状态、API处理器添加AbortSignal、StreamingContext管理abort逻辑、前端显示aborted状态。

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 80684f9 and babc04a.

📒 Files selected for processing (3)

backend/src/api/v1/completions.ts
backend/src/api/v1/messages.ts
backend/src/api/v1/responses.ts

🧰 Additional context used

🧬 Code graph analysis (3)

backend/src/api/v1/messages.ts (3)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

backend/src/adapters/index.ts (2)

getUpstreamAdapter (85-92)

getResponseAdapter (60-66)

backend/src/utils/streaming-context.ts (1)

StreamingContext (18-153)

backend/src/api/v1/completions.ts (4)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

backend/src/adapters/index.ts (2)

getUpstreamAdapter (85-92)

getResponseAdapter (60-66)

backend/src/utils/streaming-context.ts (1)

StreamingContext (18-153)

backend/src/utils/redisClient.ts (1)

set (71-85)

backend/src/api/v1/responses.ts (3)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

backend/src/adapters/index.ts (2)

getUpstreamAdapter (85-92)

getResponseAdapter (60-66)

backend/src/utils/streaming-context.ts (1)

StreamingContext (18-153)

🔇 Additional comments (10)

backend/src/api/v1/completions.ts (4)

199-222: 非流式响应的中止处理逻辑正确。

代码正确地在返回前保存完成记录，并根据 signal?.aborted 状态设置相应的完成状态。即使客户端断开连接，仍然消耗 token 是合理的，因为上游提供商已被调用，需要准确计量。

247-264: StreamingContext 集成良好。

流式处理正确地使用 StreamingContext 管理状态，即使客户端断开连接也会继续收集上游数据以确保完整的计量和审计。loggedAbort 标志避免重复日志。

345-378: 流式响应的最终保存和清理逻辑完整。

代码正确处理了所有情况：正常完成、客户端断开、以及发生错误时的保存。ctx.isSaved() 检查有效防止重复保存。

631-675: 非流式错误处理中的 alreadySaved 检查正确。

通过检查 completion.status !== "pending" 来判断是否已保存，有效防止了 consumeTokens 失败后的重复写入。所有 addCompletions 调用都包装在 try-catch 中，确保数据库写入失败不会阻止错误响应的发送。

backend/src/api/v1/responses.ts (3)

245-377: responses.ts 的 StreamingContext 集成与 completions.ts 保持一致。

流式处理逻辑、工具调用收集、中止检测和最终保存逻辑与 completions.ts 保持一致，同时正确使用 OpenAI Response API 格式。

555-562: 流式错误响应格式正确使用 SSE。

与 completions.ts 不同，Response API 使用 SSE 格式进行流式传输，event: error\ndata: {...}\n\n 格式符合规范。

626-672: 非流式错误处理与 completions.ts 保持一致。

alreadySaved 检查和 try-catch 包装模式与其他端点一致，错误响应格式正确遵循 Response API 规范。

backend/src/api/v1/messages.ts (3)

256-307: 工具调用处理已正确实现。

之前的审查指出 messages.ts 缺少 content_block_start 和 content_block_stop 的工具调用处理。现在已与 completions.ts 保持一致，正确处理工具调用的元数据收集和参数组装。

539-546: 流式错误响应格式符合 Anthropic API 规范。

SSE 格式的错误响应正确使用 Anthropic 的 { type: "error", error: {...} } 结构。

610-657: 非流式错误处理与其他端点保持一致。

alreadySaved 检查模式和错误格式与 completions.ts、responses.ts 一致，同时正确使用 Anthropic API 的错误结构。

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-01-24T13:19:57Z

Summary of Changes

Hello @pescn, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a critical issue where API completion records were not reliably saved if a client disconnected prematurely or a request timed out. By introducing an aborted status and a dedicated StreamingContext for robust state management, the system now ensures that all completion attempts are properly logged, providing a more accurate and comprehensive history of API usage, regardless of client-side interruptions.

Highlights

Database Schema Update: Introduced a new 'aborted' status to the completions_status enum in the database, allowing for explicit tracking of requests terminated by client disconnection.
Robust Streaming Context: Implemented a StreamingContext class to centralize state management and ensure that completion records are saved to the database even if the client disconnects or the request times out during streaming or non-streaming operations.
API Endpoint Integration: Modified the completions, messages, and responses API endpoints to leverage the new StreamingContext and AbortSignal to detect client disconnections and correctly log requests with the aborted status.
Frontend UI Support: Added internationalization keys and UI components to the frontend to display the new aborted status with an amber indicator in the requests and embeddings dashboards.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a robust mechanism to handle client disconnects during API requests, ensuring that completions are logged even if the client aborts. This is achieved by adding an aborted status and implementing a StreamingContext class that leverages request.signal to detect disconnections. The refactoring of the completions, messages, and responses API endpoints to use this new context class is a significant improvement, centralizing state management and cleanup logic for streaming responses. The changes are well-structured and also improve correctness by awaiting database operations. The frontend has also been updated to reflect the new aborted status. I have one suggestion to improve logging for aborted streaming requests.

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

backend/src/api/v1/responses.ts (1)
245-310: 修复 abort 时的异常日志噪声。

当前 catch 块无条件调用 logger.error，当客户端 abort 触发上游异常时会产生不必要的错误日志噪声。建议添加条件检查，仅在非 abort 情况下记录错误。

已确认 StreamingContext 在 abort 时正确保存完成记录并标记 isSaved，防止重复保存。
建议修复
-  logger.error("Stream processing error", error);
+  if (!ctx.isAborted()) {
+    logger.error("Stream processing error", error);
+  }
backend/src/api/v1/messages.ts (1)
234-295: abort 场景的流式异常日志建议做保护。

catch 里无条件记录错误，若因 abort 触发异常会导致噪声。已确认 StreamingContext 在 abort 时会立即调用 saveCompletion("aborted") 并设置 isSaved = true，后续 catch 块中的 isSaved() 检查能有效防止被 completed 覆盖。建议如下改进来减少 abort 误日志：
推荐改进
-  logger.error("Stream processing error", error);
+  if (!ctx.isAborted()) {
+    logger.error("Stream processing error", error);
+  }

🤖 Fix all issues with AI agents

In `@backend/src/api/v1/completions.ts`:
- Around line 620-634: If request.signal.aborted is true the current branch
skips updating the completion, leaving it stuck as "pending"; modify the abort
branch in the error handling around processNonStreamingResponse to persist an
aborted status and a minimal audit entry: set completion.status = "aborted" (or
similar), call await addCompletions(completion, bearer, { level: "info" |
"warn", message: "Request aborted before processing", details: { type:
"abortedDuringProcessing" } }) and ensure set.status = 499 (or appropriate) and
yield a short aborted response; this change should be applied next to the
existing request.signal.aborted check so completion and logs are always written
even if processing threw after the client aborted.

In `@backend/src/utils/streaming-context.ts`:
- Around line 138-142: The current TPM consumption only runs when both
this.inputTokens and this.outputTokens are > 0 which can skip charging when
output is 0 (e.g., aborted streams); clamp negative/undefined token counts to
zero (e.g., safeInput = Math.max(0, this.inputTokens), safeOutput = Math.max(0,
this.outputTokens)), compute totalTokens = safeInput + safeOutput and call await
consumeTokens(this.apiKeyRecord.id, this.apiKeyRecord.tpmLimit, totalTokens)
whenever totalTokens > 0; keep references to this.apiKeyRecord,
this.inputTokens, this.outputTokens and consumeTokens to locate the change.
- Around line 55-65: The abort handler registration should handle the case where
signal is already aborted before listener registration: after creating and
adding this.abortHandler via signal.addEventListener in the constructor (or
initializer) of the StreamingContext, immediately check if (signal.aborted) and
if so invoke this.abortHandler() to ensure saveCompletion("aborted") runs;
reference the existing symbols this.abortHandler, signal, saveCompletion, and
logger and ensure the handler is invoked safely (guard against undefined) so the
aborted path is processed even when the AbortSignal was triggered prior to
listener registration.

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 50f9873 and 1fbef4f.

📒 Files selected for processing (13)

backend/drizzle/0010_noisy_deathbird.sql
backend/drizzle/meta/0010_snapshot.json
backend/drizzle/meta/_journal.json
backend/src/api/v1/completions.ts
backend/src/api/v1/messages.ts
backend/src/api/v1/responses.ts
backend/src/db/schema.ts
backend/src/utils/streaming-context.ts
frontend/src/i18n/locales/en-US.json
frontend/src/i18n/locales/zh-CN.json
frontend/src/pages/embeddings/columns.tsx
frontend/src/pages/requests/columns.tsx
frontend/src/pages/requests/detail-panel/header.tsx

🧰 Additional context used

🧬 Code graph analysis (4)

frontend/src/pages/requests/detail-panel/header.tsx (1)

frontend/src/components/ui/indicator-badge.tsx (1)

IndicatorBadge (6-16)

frontend/src/pages/requests/columns.tsx (1)

frontend/src/components/ui/indicator-badge.tsx (1)

MiniIndicatorBadge (18-32)

backend/src/utils/streaming-context.ts (2)

backend/src/db/schema.ts (2)

ToolCallType (78-85)

CompletionsStatusEnumType (155-155)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

frontend/src/pages/embeddings/columns.tsx (1)

frontend/src/components/ui/indicator-badge.tsx (1)

MiniIndicatorBadge (18-32)

🔇 Additional comments (20)

backend/src/db/schema.ts (1)

149-155: 枚举与类型同步新增 aborted
pgEnum 与 TypeScript 联合类型保持一致，端到端状态对齐良好。

backend/src/utils/streaming-context.ts (2)

71-83: TTFT 记录与中止判定清晰
逻辑简洁直接，易于维护。

145-160: cleanup 释放监听器处理到位
对 abort 监听器的解除与状态清理很清楚。

backend/drizzle/meta/_journal.json (1)

74-80: 迁移日志条目新增一致
条目格式与既有记录保持一致。

frontend/src/i18n/locales/zh-CN.json (1)

93-98: “已中止”本地化覆盖完整
请求列表、详情头部与向量化列表均已补齐。

Also applies to: 118-124, 192-197

frontend/src/pages/requests/detail-panel/header.tsx (1)

65-69: 明细头部新增 aborted 指示器一致
配色与其他状态保持一致。

frontend/src/i18n/locales/en-US.json (1)

92-97: Aborted 英文文案补齐
与中文键值同步，覆盖请求与向量化视图。

Also applies to: 117-123, 191-196

frontend/src/pages/requests/columns.tsx (1)

33-46: 新增 aborted 状态指示与现有模式一致。

保持了与 pending/completed/failed 相同的渲染与 i18n 使用方式。

frontend/src/pages/embeddings/columns.tsx (1)

21-42: aborted 状态展示保持一致性。

与其他状态分支的结构和样式一致，便于统一视觉反馈。

backend/drizzle/0010_noisy_deathbird.sql (1)

1-1: 枚举新增 aborted 与业务语义对齐。

迁移与新增状态保持一致。

backend/drizzle/meta/0010_snapshot.json (1)

1-1002: 自动生成快照文件，无需人工评审项。

backend/src/api/v1/responses.ts (3)

26-28: StreamingContext 引入合理。

为流式处理提供统一的 abort 管理与保存入口。

166-213: 非流式 abort 处理清晰。

在 signal.aborted 时记录 aborted 并写入数据库，符合需求目标。

486-587: 路由层面传递 signal 并在 abort 时抑制错误响应很到位。

避免了客户端断开后的多余错误输出与重复日志。

backend/src/api/v1/messages.ts (3)

26-28: StreamingContext 引入有助于统一流式状态管理。

156-203: 非流式 abort 保存逻辑符合预期。

在客户端中断时写入 aborted，减少遗漏。

470-571: signal 传递与 abort 时的错误抑制处理到位。

减少了客户端断开后的无意义错误响应与日志。

backend/src/api/v1/completions.ts (3)

33-33: 引入 StreamingContext 后流式聚合与落库更清晰。

上下文集中管理内容聚合、TTFT 与保存逻辑，流程更一致、更易追踪。

Also applies to: 227-356

535-552: 流式路由层的异常分支处理更干净。

减少噪音并避免重复响应，整体可读性更好。

163-215: 非流式路径在返回前持久化并区分终态，👍

先落库再返回，有助于保证记录完整性。

Also applies to: 608-617

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

Addresses review comments from CodeRabbit and Gemini: 1. StreamingContext improvements: - Pass "Client disconnected" reason to saveCompletion for better logging - Handle already-aborted signals (AbortSignal spec: addEventListener won't trigger for already-aborted signals) - Fix TPM consumption to use Math.max(0, ...) for token values, ensuring partial usage is charged even when output is 0 2. Non-streaming abort handling: - Save completion as "aborted" when processing throws after client disconnect (previously left as "pending") 3. Streaming error logging: - Add ctx.isAborted() check before logger.error to reduce noise when client disconnects Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Fix no-base-to-string lint error by using instanceof Error check instead of String(error ?? "...") pattern. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

pescn · 2026-01-24T13:39:01Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces robust handling for client disconnects during API requests, ensuring that completions are logged with a new aborted status. The implementation is centered around a new StreamingContext class, which is a clean and effective way to manage state and handle abort signals. The changes are consistently applied across the completions, messages, and responses endpoints.

My review focuses on improving the robustness of the error handling logic in the non-streaming paths. I've identified a critical issue where a failure to log a completion could lead to an unhandled exception, preventing an error response from being sent to the client. I've provided a detailed comment and code suggestion for each of the affected API endpoint files to address this.

Address Gemini critical review: 1. Remove flawed `completion.status === "pending"` check that could prevent logging when status was already updated 2. Use .catch() instead of await for addCompletions to handle database failures gracefully without blocking error response Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

pescn · 2026-01-24T13:49:57Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a robust mechanism for handling client disconnects by adding an aborted status and a StreamingContext class. This is a significant improvement for ensuring completion records are saved reliably, especially for streaming responses. The logic is well-encapsulated and consistently applied across the completions, messages, and responses API endpoints. I've noted a few areas in the non-streaming error handlers where database calls should be awaited to guarantee data persistence. Overall, this is a very solid and valuable change.

gemini-code-assist · 2026-01-24T13:52:31Z

+          if (request.signal.aborted) {
+            // Client disconnected - save as aborted
+            completion.status = "aborted";
+            addCompletions(completion, bearer, {
+              level: "info",
+              message: "Client disconnected during non-streaming response",
+              details: {
+                type: "completionError",
+                data: { type: "aborted", msg: errorMsg },
+              },
+            }).catch((logError: unknown) => {
+              logger.error("Failed to log aborted completion after processing error", logError);
+            });
+          } else {
+            logger.error("Failed to process response", error);
+            completion.status = "failed";
+            addCompletions(completion, bearer, {
+              level: "error",
+              message: `Response processing error: ${errorMsg}`,
+              details: {
+                type: "completionError",
+                data: { type: "processingError", msg: errorMsg },
+              },
+            }).catch((logError: unknown) => {
+              logger.error("Failed to log completion after processing error", logError);
+            });
+            set.status = 500;
+            yield JSON.stringify({ error: "Failed to process response" });
+          }


In the error handling logic for non-streaming requests, the addCompletions calls are not awaited. This creates a "fire-and-forget" situation where the application might terminate or the response might be sent before the database operation completes, potentially leading to lost completion records in error scenarios. To ensure data is persisted reliably, you should await these calls.

if (request.signal.aborted) { // Client disconnected - save as aborted completion.status = "aborted"; try { await addCompletions(completion, bearer, { level: "info", message: "Client disconnected during non-streaming response", details: { type: "completionError", data: { type: "aborted", msg: errorMsg }, }, }); } catch (logError) { logger.error("Failed to log aborted completion after processing error", logError); } } else { logger.error("Failed to process response", error); completion.status = "failed"; try { await addCompletions(completion, bearer, { level: "error", message: `Response processing error: ${errorMsg}`, details: { type: "completionError", data: { type: "processingError", msg: errorMsg }, }, }); } catch (logError) { logger.error("Failed to log completion after processing error", logError); } set.status = 500; yield JSON.stringify({ error: "Failed to process response" }); }

gemini-code-assist · 2026-01-24T13:52:31Z

+          if (request.signal.aborted) {
+            // Client disconnected - save as aborted
+            completion.status = "aborted";
+            addCompletions(completion, bearer, {
+              level: "info",
+              message: "Client disconnected during non-streaming response",
+              details: {
+                type: "completionError",
+                data: { type: "aborted", msg: errorMsg },
+              },
+            }).catch((logError: unknown) => {
+              logger.error("Failed to log aborted completion after processing error", logError);
+            });
+          } else {
+            logger.error("Failed to process response", error);
+            completion.status = "failed";
+            addCompletions(completion, bearer, {
+              level: "error",
+              message: `Response processing error: ${errorMsg}`,
+              details: {
+                type: "completionError",
+                data: { type: "processingError", msg: errorMsg },
+              },
+            }).catch((logError: unknown) => {
+              logger.error("Failed to log completion after processing error", logError);
+            });
+            set.status = 500;
+            yield JSON.stringify({
+              type: "error",
+              error: { type: "api_error", message: "Failed to process response" },
+            });
+          }
        }


The addCompletions calls within this catch block are not awaited. This could lead to a race condition where the function returns or the process exits before the completion record is saved to the database, especially during error handling. It's important to await these calls to ensure the completion status is reliably recorded.

if (request.signal.aborted) { // Client disconnected - save as aborted completion.status = "aborted"; try { await addCompletions(completion, bearer, { level: "info", message: "Client disconnected during non-streaming response", details: { type: "completionError", data: { type: "aborted", msg: errorMsg }, }, }); } catch (logError) { logger.error("Failed to log aborted completion after processing error", logError); } } else { logger.error("Failed to process response", error); completion.status = "failed"; try { await addCompletions(completion, bearer, { level: "error", message: `Response processing error: ${errorMsg}`, details: { type: "completionError", data: { type: "processingError", msg: errorMsg }, }, }); } catch (logError) { logger.error("Failed to log completion after processing error", logError); } set.status = 500; yield JSON.stringify({ type: "error", error: { type: "api_error", message: "Failed to process response" }, }); }

…eaming responses Addresses reviewer feedback: - Set Content-Type: text/event-stream for SSE streaming responses - Set Content-Type: application/json for non-streaming JSON responses When using async generators in Elysia, the content-type cannot be auto-detected, so we need to explicitly set the appropriate headers based on the request type. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…tors Per Elysia documentation, async generators should: - Use `return` for non-streaming responses (Elysia converts to normal JSON response) - Use `yield` only for streaming responses This ensures: - Non-streaming responses get proper `application/json` content-type - Streaming responses get proper streaming behavior - Early error responses are returned as normal JSON, not streamed References: - https://elysiajs.com/essential/handler - elysiajs/elysia#742 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

pescn · 2026-01-24T14:01:50Z

关于 Content-Type 问题的修复

经过查阅 Elysia Handler 文档和 Issue #742，确认了 Reviewer 的观点是正确的：

Elysia 的 async generator 行为

yield = 流式响应：当使用 yield 时，Elysia 将其视为流式响应
return without yield = 普通响应：如果在 generator 中使用 return 而没有先 yield，Elysia 会自动转换为普通 JSON 响应

之前的问题

之前的代码中，所有响应都使用 yield，包括早期错误和非流式成功响应，导致所有响应都被当作流式处理。

修复方案 (commit `65bbdb8`)

早期错误：使用 return 返回错误对象（Elysia 自动设置 application/json）
流式请求成功：使用 yield* 转发流
非流式请求成功：使用 return 返回解析后的对象

这样 Elysia 可以正确区分响应类型并设置适当的 Content-Type。

pescn · 2026-01-24T14:02:31Z

/gemini review

pescn · 2026-01-24T14:03:50Z

@coderabbitai review

coderabbitai · 2026-01-24T14:03:57Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

gemini-code-assist

Code Review

This pull request introduces robust handling for client disconnects during API requests, a critical feature for improving reliability. It correctly adds an 'aborted' status to the database and frontend, and refactors the streaming logic into a new StreamingContext class. This new class centralizes the complex state management and ensures that completions are saved even when the client disconnects, which is a significant improvement. The changes are well-structured across all three API endpoints (completions, messages, responses). My review includes a few suggestions to enhance the consistency of asynchronous database operations within error handling blocks to further improve robustness.

Changed from fire-and-forget .catch() pattern to await with try-catch for addCompletions calls in non-streaming error handlers. With the refactoring to use `return` instead of `yield`, the fire-and-forget pattern is problematic because: 1. The function returns immediately after return statement 2. The pending promise may not complete before the function exits Using await ensures the completion record is reliably saved before returning the response or exiting the handler. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

pescn · 2026-01-24T14:12:24Z

关于 Gemini 新反馈的修复 (commit `3399b60`)

Gemini 指出在重构后（使用 return 替代 yield），fire-and-forget 的 .catch() 模式可能导致数据库写入在函数返回前未完成。

修复内容

将 addCompletions().catch() 改为 await addCompletions() + try-catch 模式：

// 之前 (fire-and-forget)
addCompletions(completion, bearer, {...}).catch((logError) => {
  logger.error("...", logError);
});
return;

// 之后 (await with try-catch)
try {
  await addCompletions(completion, bearer, {...});
} catch (logError: unknown) {
  logger.error("...", logError);
}
return;

这确保了：

数据库写入在返回响应前完成
数据库错误被正确捕获和记录
不会因为数据库问题导致响应失败

修复覆盖了三个文件：

completions.ts - 行 626, 641
messages.ts - 行 562, 577
responses.ts - 行 578, 593

pescn · 2026-01-24T14:13:36Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a robust mechanism for handling client disconnects during API requests, ensuring that completions are logged even if the client aborts. The core of this change is the new StreamingContext class, which is a great addition for managing the lifecycle of streaming responses and handling abort signals. The changes are well-implemented across the completions, messages, and responses APIs.

My review identifies one critical issue in the new StreamingContext that would cause incorrect data to be saved for two of the three updated API endpoints. Overall, this is a valuable improvement to the system's reliability.

Previously, when a client disconnected mid-stream: 1. The abort handler immediately saved partial data 2. The for-await loop broke, discarding remaining chunks Now the behavior is: 1. No immediate save on abort (removed abort handler) 2. Continue reading all chunks from upstream 3. Only skip yielding to disconnected client 4. Save full response with "aborted" status when stream ends This ensures complete data is captured for observability even when clients disconnect before the stream finishes. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

pescn · 2026-01-24T15:02:48Z

修复：客户端断开后继续收集 chunks (commit `64b28ed`)

问题

之前的行为：

客户端断开时，abort handler 立即保存部分数据
for await 循环 break 退出
上游继续发送的 chunks 被丢弃

修复后的行为

移除 abort handler 中的立即保存
继续读取上游所有 chunks（但不再 yield 给已断开的客户端）
等流自然结束后保存完整数据，状态为 "aborted"

代码变化

// 之前
for await (const chunk of chunks) {
  if (ctx.isAborted()) {
    break;  // 丢弃后续 chunks
  }
  // ... collect and yield
}

// 之后
for await (const chunk of chunks) {
  const clientAborted = ctx.isAborted();
  
  // Always collect data
  if (chunk.type === "content_block_delta") {
    ctx.textParts.push(chunk.delta.text);
  }
  
  // Only yield if client is still connected
  if (!clientAborted) {
    yield serialized;
  }
}

// Save with appropriate status at the end
if (ctx.isAborted()) {
  await ctx.saveCompletion("aborted", "Client disconnected");
} else {
  await ctx.saveCompletion("completed");
}

这确保了即使客户端提前断开，也能记录完整的 LLM 响应内容，便于后续分析和计费。

Log an info-level message when we first detect that a client has disconnected during streaming. This helps track client disconnect events while still continuing to collect all upstream data for accurate token metering. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 4

🤖 Fix all issues with AI agents

In `@backend/src/api/v1/completions.ts`:
- Around line 631-669: The outer catch is calling addCompletions again even if
processNonStreamingResponse already saved the completion; to prevent duplicates,
introduce a saved flag or state on the Completion lifecycle (e.g., a boolean
like completionSaved) that processNonStreamingResponse sets when it successfully
persists, or have processNonStreamingResponse return a result indicating
“saved”; then in the outer catch (around processNonStreamingResponse /
consumeTokens) check that flag/result before calling addCompletions or changing
completion.status (referencing processNonStreamingResponse, consumeTokens,
addCompletions, and the completion object) so addCompletions is only invoked
once for the same completion.

In `@backend/src/api/v1/messages.ts`:
- Around line 567-608: Before calling addCompletions in the catch block (both
the aborted branch and the failed branch) guard the save with a check that
completion.status === "pending" to avoid duplicate saves; i.e., inspect the
completion.status field and skip calling addCompletions (and skip related
logging) if it has already been changed from "pending" (this affects the abort
branch that sets completion.status = "aborted" and the error branch that sets
completion.status = "failed", so perform the check before invoking
addCompletions for those cases).
- Around line 234-295: The streaming loop in messages.ts only handles
content_block_delta and thus misses content_block_start/stop tool-call events;
update the for-await loop (where upstreamAdapter.parseStreamResponse(resp) is
iterated and ctx.textParts / thinkingParts are collected) to also handle
chunk.type === "content_block_start" and "content_block_stop" like
completions.ts: on content_block_start record the tool metadata into
ctx.indexToIdMap using ctx.nextToolCallIndex and increment
ctx.nextToolCallIndex, and on content_block_stop finalize any necessary
tool-call entry so the resulting completion saved via
StreamingContext.saveCompletion includes all tool-call entries; ensure
responseAdapter.serializeStreamChunk/yield logic remains unchanged for non-tool
chunks.

In `@backend/src/api/v1/responses.ts`:
- Around line 583-624: The catch block may double-save the completion via
addCompletions if the record was already persisted earlier (same issue as in
completions.ts); introduce and check an alreadySaved flag (either on the
completion object or a local boolean captured in the surrounding scope) before
calling addCompletions in both the aborted and failed branches, skip
logging/saving when alreadySaved is true, and ensure the code that first
persists the completion (the spot in processNonStreamingResponse/consumeTokens
that currently calls addCompletions) sets alreadySaved = true so subsequent
catch handlers do not duplicate the record.

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 64b28ed and 80684f9.

📒 Files selected for processing (3)

backend/src/api/v1/completions.ts
backend/src/api/v1/messages.ts
backend/src/api/v1/responses.ts

🧰 Additional context used

🧬 Code graph analysis (3)

backend/src/api/v1/messages.ts (3)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

backend/src/adapters/index.ts (2)

getUpstreamAdapter (85-92)

getResponseAdapter (60-66)

backend/src/utils/streaming-context.ts (1)

StreamingContext (18-153)

backend/src/api/v1/responses.ts (3)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

backend/src/adapters/index.ts (2)

getUpstreamAdapter (85-92)

getResponseAdapter (60-66)

backend/src/utils/streaming-context.ts (1)

StreamingContext (18-153)

backend/src/api/v1/completions.ts (3)

backend/src/utils/completions.ts (1)

addCompletions (29-64)

backend/src/adapters/index.ts (2)

getUpstreamAdapter (85-92)

getResponseAdapter (60-66)

backend/src/utils/streaming-context.ts (1)

StreamingContext (18-153)

🔇 Additional comments (10)

backend/src/api/v1/completions.ts (5)

199-215: 非流式响应的中止处理逻辑正确。

在完成响应解析后检查 signal?.aborted，并根据状态保存为 "aborted" 或 "completed"。使用 await 确保数据库写入完成后再返回，符合 PR 目标。

247-264: 流式响应中止检测与日志记录实现正确。

使用 loggedAbort 标志避免重复日志，客户端断开后继续收集上游数据以确保完整的计量数据。这符合 PR 目标中"保留完整 LLM 输出用于计量和分析"的要求。

329-341: 流式数据仅在客户端未断开时发送。

通过 !clientAborted 条件控制 yield，确保在客户端断开后不再尝试发送数据，同时继续收集上游响应。实现符合预期。

357-376: 流式错误处理区分中止与非中止场景。

当客户端中止且发生错误时，保存为 "aborted" 状态并附带错误信息；仅在客户端仍连接时重新抛出错误。这避免了不必要的错误传播，同时确保完成记录被正确保存。

549-566: 流式处理的外层错误处理正确传递中止信号。

request.signal 被正确传递给 processStreamingResponse，外层 catch 块在客户端中止时跳过错误日志和响应发送。

backend/src/api/v1/responses.ts (3)

198-213: 非流式响应的中止处理与 completions.ts 保持一致。

实现模式与 completions.ts 一致，确保 API 行为统一。

245-311: 流式响应使用 StreamingContext 正确处理中止。

实现与 completions.ts 一致，Response API 的流式处理仅收集文本和 thinking 内容，符合该 API 的语义。

512-518: 流式错误响应使用正确的 SSE 格式。

错误事件格式 event: error\ndata: ... 符合 OpenAI Response API 的 SSE 规范。

backend/src/api/v1/messages.ts (2)

187-202: 非流式响应的中止处理与其他端点保持一致。

Anthropic Messages API 的实现与 completions.ts 和 responses.ts 保持一致的模式。

496-502: 流式错误响应使用 Anthropic API 错误格式。

错误事件格式符合 Anthropic Messages API 的约定。

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

Address CodeRabbit review feedback: - Add alreadySaved check in catch blocks to prevent duplicate database writes if processNonStreamingResponse already saved the completion before consumeTokens throws an error - Add tool call handling (content_block_start/stop) to messages.ts and responses.ts streaming to match completions.ts behavior Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

gemini-code-assist Bot reviewed Jan 24, 2026

View reviewed changes

Comment thread backend/src/utils/streaming-context.ts Outdated

coderabbitai Bot reviewed Jan 24, 2026

View reviewed changes

Comment thread backend/src/api/v1/completions.ts Outdated

Comment thread backend/src/utils/streaming-context.ts Outdated

Comment thread backend/src/utils/streaming-context.ts Outdated

pescn and others added 2 commits January 24, 2026 21:33

fix(api): use type guard for error message in abort handling

5566bc0

Fix no-base-to-string lint error by using instanceof Error check instead of String(error ?? "...") pattern. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

gemini-code-assist Bot reviewed Jan 24, 2026

View reviewed changes

Comment thread backend/src/api/v1/completions.ts

Comment thread backend/src/api/v1/messages.ts

Comment thread backend/src/api/v1/responses.ts

gemini-code-assist Bot reviewed Jan 24, 2026

View reviewed changes

pescn and others added 2 commits January 24, 2026 21:54

gemini-code-assist Bot reviewed Jan 24, 2026

View reviewed changes

Comment thread backend/src/utils/streaming-context.ts

coderabbitai Bot reviewed Jan 24, 2026

View reviewed changes

Comment thread backend/src/api/v1/completions.ts

Comment thread backend/src/api/v1/messages.ts

Comment thread backend/src/api/v1/messages.ts

Comment thread backend/src/api/v1/responses.ts

pescn merged commit 300e813 into main Jan 24, 2026
2 checks passed

pescn deleted the fix/issue-21-abort-handling branch January 24, 2026 15:43

This was referenced Jan 24, 2026

feat(api): add X-NexusGate-ReqId request deduplication #59

Merged

feat(api): add Prometheus metrics endpoint #63

Merged

Conversation

pescn commented Jan 24, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Other AI code review bot(s) detected

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

gemini-code-assist Bot commented Jan 24, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pescn commented Jan 24, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pescn commented Jan 24, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pescn commented Jan 24, 2026

关于 Content-Type 问题的修复

Elysia 的 async generator 行为

之前的问题

修复方案 (commit 65bbdb8)

Uh oh!

pescn commented Jan 24, 2026

Uh oh!

pescn commented Jan 24, 2026

Uh oh!

coderabbitai Bot commented Jan 24, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pescn commented Jan 24, 2026

关于 Gemini 新反馈的修复 (commit 3399b60)

修复内容

pescn commented Jan 24, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jan 24, 2026 •

edited

Loading

修复方案 (commit `65bbdb8`)

关于 Gemini 新反馈的修复 (commit `3399b60`)

修复：客户端断开后继续收集 chunks (commit `64b28ed`)