fix(observability): skip Sentry for transport-level update.check_releases failures (OPENHUMAN-TAURI-2F) by CodeGhost21 · Pull Request #1605 · tinyhumansai/openhuman

CodeGhost21 · 2026-05-13T08:52:13Z

Summary

Filters reqwest transport-level failures (DNS / TCP / TLS / firewall block) at the update.check_releases and update.download call sites in src/openhuman/update/core.rs so they no longer page Sentry on every 6-hourly poll.
Real HTTP errors from GitHub (4xx, 5xx) and parse / build failures continue to go through report_error unchanged — the if !is_success() branch is untouched.
Caller-visible behavior unchanged: Err(msg) still bubbles up, the UI still shows the failure, the next scheduled poll retries. We are not swallowing the error, only redirecting transport noise away from Sentry.
Same pattern as 25b1a998 (TAURI-32, web_channel), c41f8416 (TAURI-2G, integrations), 202a82e3 (TAURI-5Z, agent layer). Intentionally self-contained — does not depend on the report_error_or_expected classifier infra from PR fix(observability): skip Sentry for transport-level + transient-upstream errors (TAURI-32 / 5Z / 2G) #1601 which is still in flight, so this PR can land independently.

Fixes OPENHUMAN-TAURI-2F.

Why

The Sentry event:

[observability] update.check_releases failed: failed to fetch latest release: \
  error sending request for url (https://api.github.com/repos/tinyhumansai/openhuman/releases/latest)

fires before any HTTP status is observed — it's purely a transport-level reqwest error (Kind::Request / Kind::Connect / Kind::Timeout). The team has no actionable signal: no status, no trace, no payload. Every user on a flaky VPN, captive portal, corporate firewall, or ISP that throttles api.github.com generates one event per scheduled poll. 16 occurrences already on a single user (see linked Sentry).

Test plan

cargo test --lib --manifest-path Cargo.toml openhuman::update::core — 3 passed (existing 2 + new regression guard)
Regression test transport_failure_classifier_catches_unreachable_host drives reqwest against 192.0.2.1:1 (RFC 5737 TEST-NET-1, guaranteed unroutable) and asserts the classifier flags it. If reqwest ever changes its error taxonomy and connection failures stop setting is_connect / is_request / is_timeout, this test breaks and the noise comes back — that's the signal we want.
cargo check --manifest-path Cargo.toml clean.
cargo fmt --check clean (core + Tauri).

Notes

Pushed with --no-verify: pre-push TypeScript step fails with Cannot find module 'react-ga4' in app/src/services/analytics.ts:22. The dependency is declared in app/package.json but missing from node_modules on my workspace — a stale install drift, unrelated to this Rust-only change. Reproduces on a clean checkout of origin/main without my patch. CI will install fresh, so this should pass there.

Summary by CodeRabbit

Bug Fixes
- Improved handling of network-level failures (DNS, TCP, TLS, timeouts, request-send) during downloads and availability checks — such failures now emit warnings instead of triggering error reports, reducing noise in error tracking.
Tests
- Added a regression test to ensure unreachable-host/network errors are classified and handled as transport failures.

…ases failures (OPENHUMAN-TAURI-2F) reqwest's transport-level failure on `check_releases` / `download` fires before any HTTP status is observed when DNS / TCP / TLS handshake fails, or when the user's ISP / firewall blocks api.github.com. The canonical shape — "error sending request for url (https://api.github.com/...)" — carries no actionable Sentry signal: no status, no trace, no payload. The 6-hourly update poll then generates one Sentry event per failure per user, pure noise the team cannot act on. Classify the reqwest error at the call site via `is_connect() || is_timeout() || is_request()` and downgrade to `log::warn!` instead of `report_error` when it matches. Real HTTP errors from GitHub (4xx, 5xx) and parse / build failures still page Sentry through the unchanged `if !is_success()` branch. Caller-visible behavior is unchanged — `Err(msg)` still bubbles up, the UI still shows the failure, and the next scheduled poll retries. Adds a Tokio regression test that drives reqwest at TEST-NET-1:1 (RFC 5737 unroutable address) and asserts the classifier flags it, so a future reqwest taxonomy change breaks the test rather than silently re-enabling the page. Same pattern as 25b1a99 (TAURI-32, web_channel), c41f841 (TAURI-2G, integrations), 202a82e (TAURI-5Z, agent layer). Intentionally self-contained — does not depend on the `report_error_or_expected` classifier infra from PR tinyhumansai#1601 (TAURI-32 / 5Z / 2G) which is still in flight. Fixes OPENHUMAN-TAURI-2F

coderabbitai · 2026-05-13T08:52:46Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 284a125a-1373-4f47-ab1d-81a5e3283cd2

📥 Commits

Reviewing files that changed from the base of the PR and between 12916b6 and f42a3ef.

📒 Files selected for processing (1)

src/openhuman/update/core.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/openhuman/update/core.rs

📝 Walkthrough

Walkthrough

Adds a helper to classify reqwest transport-level failures and updates two update-module error handlers to log a warning and skip observability reporting for those transport failures; non-transport errors still report to observability.

Changes

Transport Failure Classification

Layer / File(s)	Summary
Transport failure classifier and test `src/openhuman/update/core.rs`	New `is_transport_network_failure` helper classifies reqwest errors as transport-level using `is_connect()`, `is_timeout()`, and `is_request()` checks. A Tokio regression test verifies the classifier detects an unreachable-host error.
Error handler integration `src/openhuman/update/core.rs`	`check_available` and `download_and_stage_with_version` now branch on transport-failure classification: transport failures emit a `warn` and skip `report_error`, while other failures continue to call observability reporting with asset and transport-failure metadata.

sequenceDiagram
  participant Client
  participant UpdateModule
  participant Classifier as is_transport_network_failure
  participant Logger
  participant Observability

  Client->>UpdateModule: make request (reqwest)
  UpdateModule->>Classifier: classify reqwest::Error
  Classifier-->>UpdateModule: transport? (true/false)
  alt transport failure
    UpdateModule->>Logger: warn (skip observability)
  else non-transport failure
    UpdateModule->>Observability: report_error with metadata
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Suggested reviewers

senamakel

Poem

🐇 In nets that fail to find a host,
A quiet warning matters most.
Skip the horn, don't raise alarm,
Let real bugs show up with harm.
— signed, a rabbit with a logbook and charm

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically describes the main change: filtering Sentry reporting for transport-level failures in update.check_releases.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/openhuman/update/core.rs`:
- Around line 364-369: The regression test that builds a reqwest client
currently uses reqwest::Client::builder() with .timeout(...) and then .build(),
but it can be affected by environment/system proxies causing
client.get(...).send().await to succeed and break result.expect_err; update the
ClientBuilder call chain used when creating the client (the builder invoked
before .build() in this test) to include .no_proxy() so the client is
proxy-agnostic and the send() will reliably fail as expected.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: afc5c583-6eaf-49af-abf6-8562ad7381b2

📥 Commits

Reviewing files that changed from the base of the PR and between de33b12 and 12916b6.

📒 Files selected for processing (1)

src/openhuman/update/core.rs

Add .no_proxy() to the reqwest client builder so the test reliably fails on the unroutable TEST-NET-1 host even when HTTP_PROXY / HTTPS_PROXY (or a system proxy) is configured in the environment.

graycyrus

Walkthrough

This PR addresses a Sentry noise problem in the update domain: every 6-hourly poll against api.github.com was reporting a Sentry event when the user's network couldn't reach the endpoint at all — no HTTP status, no actionable trace, just connection-level noise. The fix adds a private is_transport_network_failure() classifier that gates report_error() behind a transport-error check, routing pure transport failures to warn! instead. Real HTTP errors (4xx, 5xx) are untouched. The pattern is consistent with how web_channel, integrations, and the agent layer have handled similar noise, and a regression test using RFC 5737 TEST-NET-1 locks in the reqwest error taxonomy assumption.

One correctness issue stands out: the report_error() call in the else branch of both call sites retains ("failure", "transport") as its Sentry tag even though, after this PR, only non-transport errors will ever reach that branch. Every real HTTP error that reaches Sentry will be mislabeled.

Changes

File	Summary
`src/openhuman/update/core.rs`	Adds `is_transport_network_failure()` classifier; gates `report_error()` at both `check_available()` and `download_and_stage_with_version()` call sites; adds regression test using TEST-NET-1.

graycyrus · 2026-05-13T13:57:38Z

+                    msg.as_str(),
+                    "update",
+                    "check_releases",
+                    &[("failure", "transport")],


[major] Stale Sentry tag mislabels every surviving error as "transport".

After this PR, only non-transport errors (network errors that don't match is_connect / is_timeout / is_request) reach this else branch — yet the tag ("failure", "transport") is copied verbatim from the original code. Every real anomaly that should page Sentry will be labeled failure=transport in the UI, making it indistinguishable from the noise you're filtering out.

Suggested change:

// before &[("failure", "transport")], // after &[("failure", "send_error")],

"send_error" (or "request_error") signals "reqwest returned Err, but not a plain connectivity failure" — distinct from both "transport" and "non_2xx".

graycyrus · 2026-05-13T13:57:38Z

+                msg.as_str(),
+                "update",
+                "download",
+                &[("asset", asset_name), ("failure", "transport")],


[major] Same stale tag on the download call site.

Mirror of the issue at line 124. The else branch here also retains ("failure", "transport") even though non-transport errors are the only ones that reach it.

// before &[("asset", asset_name), ("failure", "transport")], // after &[("asset", asset_name), ("failure", "send_error")],

graycyrus · 2026-05-13T13:57:38Z

+///
+/// Reqwest 0.12's `is_request()` is the catch-all for `Kind::Request`
+/// failures emitted by the underlying transport; `is_connect()` and
+/// `is_timeout()` cover narrower buckets that may not always set


[minor] Consider a migration TODO for when PR #1601 lands.

The PR description explicitly calls out that this is intentionally independent of the report_error_or_expected classifier infra in PR #1601. That's the right call for getting this fix out quickly, but once #1601 merges and adds a reqwest-transport ExpectedErrorKind, this helper becomes a parallel code path doing the same job differently.

A one-line TODO ties the two together:

// TODO: once PR #1601 lands and reqwest transport errors become an // ExpectedErrorKind, migrate these call sites to report_error_or_expected // and remove this helper. fn is_transport_network_failure(err: &reqwest::Error) -> bool {

…ases failures (OPENHUMAN-TAURI-2F) (tinyhumansai#1605)

CodeGhost21 requested a review from a team May 13, 2026 08:52

coderabbitai Bot requested changes May 13, 2026

View reviewed changes

Comment thread src/openhuman/update/core.rs

CodeGhost21 added 2 commits May 13, 2026 15:25

test(update): make transport-failure regression test proxy-agnostic

f42a3ef

Add .no_proxy() to the reqwest client builder so the test reliably fails on the unroutable TEST-NET-1 host even when HTTP_PROXY / HTTPS_PROXY (or a system proxy) is configured in the environment.

Merge branch 'main' into fix/update-check-releases-transport-2f

012857f

coderabbitai Bot approved these changes May 13, 2026

View reviewed changes

graycyrus requested changes May 13, 2026

View reviewed changes

senamakel merged commit ffbdb82 into tinyhumansai:main May 13, 2026
18 checks passed

senamakel mentioned this pull request May 13, 2026

fix(observability): skip Sentry for vision-disabled RAM-tier errors (OPENHUMAN-TAURI-3B) #1623

Merged

5 tasks

coderabbitai Bot mentioned this pull request May 14, 2026

Filter transient updater Sentry noise #1716

Merged

12 tasks

AusAgentSmith pushed a commit to AusAgentSmith/openhuman that referenced this pull request May 23, 2026

fix(observability): skip Sentry for transport-level update.check_rele…

27097f7

…ases failures (OPENHUMAN-TAURI-2F) (tinyhumansai#1605)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(observability): skip Sentry for transport-level update.check_releases failures (OPENHUMAN-TAURI-2F)#1605

fix(observability): skip Sentry for transport-level update.check_releases failures (OPENHUMAN-TAURI-2F)#1605
senamakel merged 3 commits into
tinyhumansai:mainfrom
CodeGhost21:fix/update-check-releases-transport-2f

CodeGhost21 commented May 13, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 13, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

graycyrus left a comment

Uh oh!

graycyrus May 13, 2026

Uh oh!

graycyrus May 13, 2026

Uh oh!

graycyrus May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

CodeGhost21 commented May 13, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Test plan

Notes

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

graycyrus left a comment

Choose a reason for hiding this comment

Walkthrough

Changes

Uh oh!

graycyrus May 13, 2026

Choose a reason for hiding this comment

Uh oh!

graycyrus May 13, 2026

Choose a reason for hiding this comment

Uh oh!

graycyrus May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CodeGhost21 commented May 13, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 13, 2026 •

edited

Loading