test: Develop tests for failure conditions in Rust/Kotlin runtime adapters by yacosta738 · Pull Request #106 · dallay/corvus

yacosta738 · 2026-02-27T23:50:38Z

This submission introduces a comprehensive suite of tests for simulating and validating failure conditions across the Rust and Kotlin runtime adapters.

Key changes include:

Rust NativeRuntime: Added tests for commands exiting with non-zero status codes, capturing output from stderr, and handling invalid working directories.
Rust DockerRuntime: Added validation tests for non-absolute workspace paths and empty image configurations.
Rust ShellTool: Created a new integration test using a MockFailureRuntime to ensure the shell tool gracefully handles failures during command construction.
Kotlin RustCliBridge: Added tests to verify the bridge's ability to recover and succeed after a timeout failure, as well as its robustness when handling large command outputs.

All new tests have been verified to pass alongside the existing test suite in both the Rust and Kotlin modules.

Fixes #36

PR created automatically by Jules for task 13871086355275944648 started by @yacosta738

Summary by CodeRabbit

Tests
- Added coverage for Docker runtime image validation and workspace errors, native runtime command behavior, shell tool build failures, timeout recovery, and large-output handling.
Documentation
- Removed a SurrealDB production guide (EN/ES) and developer OS README.
Style
- Increased CSS specificity across docs to enforce visual consistency.
Chores
- Removed dependency/security scan workflows and development Dockerfiles.

- Add NativeRuntime tests for non-zero exit, stderr capture, and invalid dir. - Add DockerRuntime tests for non-absolute workspace and empty image. - Create ShellTool integration test with mock failure runtime. - Add RustCliBridge (Kotlin) tests for timeout recovery and large output. Co-authored-by: yacosta738 <33158051+yacosta738@users.noreply.github.com>

google-labs-jules · 2026-02-27T23:50:39Z

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.

For security, I will only act on instructions from the user who triggered this task.

coderabbitai · 2026-02-27T23:50:45Z

Important

Review skipped

Review was skipped due to path filters

⛔ Files ignored due to path filters (1)

clients/agent-runtime/Cargo.lock is excluded by !**/*.lock, !**/Cargo.lock

CodeRabbit blocks several paths by default. You can override this behavior by explicitly including those paths in the path filters. For example, including **/dist/** will override the default block on the dist directory, by removing the pattern from both the lists.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

A set of tests and runtime validation checks were added, multiple CI workflow and dev environment Dockerfiles/README were removed, a large docs page (SurrealDB) was deleted (EN/ES), CSS rules were tightened with !important additions, and Docker runtime image validation was strengthened.

Changes

Cohort / File(s)	Summary
Docker runtime & tests `clients/agent-runtime/src/runtime/docker.rs`	Add non-empty image validation in `build_shell_command` and tests for empty-image and non-absolute workspace when mount enabled.
Native runtime tests `clients/agent-runtime/src/runtime/native.rs`	Add async tests validating failing command exit code, stderr capture, and invalid-directory error behavior.
Runtime failure test harness `clients/agent-runtime/tests/runtime_failure_tests.rs`	Add `MockFailureRuntime` and test ensuring ShellTool surfaces runtime build failures.
Kotlin JVM tests `modules/agent-core-kmp/src/jvmTest/.../RustCliBridgeTest.kt`	Add tests for timeout recovery and handling of large command output in Rust CLI bridge.
GitHub Actions workflows removed `.github/workflows/dependency-review.yml`, `.github/workflows/security-dependencies.yml`	Remove dependency-review and scheduled security dependency scan workflows.
Documentation removed `clients/web/apps/docs/src/content/docs/en/guides/surrealdb.md`, `clients/web/apps/docs/src/content/docs/es/guides/surrealdb.md`	Delete SurrealDB production-with-docker-compose guides (English and Spanish).
CSS priority changes `clients/web/apps/docs/src/styles/custom.css`	Convert many CSS declarations to use `!important`, raising specificity/priority across site styles.
Dev environment assets removed `dev/os/Dockerfile.alpine`, `dev/os/Dockerfile.ubuntu`, `dev/os/README.md`	Remove development Dockerfiles and accompanying README for dev/test environments.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

ci: Set up automated dependency vulnerability checks for Rust and Kotlin #104 — Modifies/removes the same CI workflow files (dependency-review.yml, security-dependencies.yml), indicating direct overlap.

Suggested labels

codex

🚥 Pre-merge checks | ✅ 2 | ❌ 3

❌ Failed checks (3 warnings)

Check name	Status	Explanation	Resolution
Linked Issues check	⚠️ Warning	The PR code contains two known issues per reviewer comments that have not been resolved: (1) docker.rs test should assert failure when image is empty instead of treating it as valid; (2) RustCliBridgeTest should use shell commands instead of Python for the large output test.	Address reviewer comments: update docker_build_shell_command_errors_on_empty_image to assert Err when image is empty, and replace Python command with 'sh -c head -c 10240 /dev/zero \| tr \0 a' in RustCliBridgeTest.
Out of Scope Changes check	⚠️ Warning	The PR includes multiple out-of-scope deletions unrelated to the stated objective of adding failure tests: removal of dependency-review.yml, security-dependencies.yml workflows, SurrealDB documentation files, Dockerfile.alpine, Dockerfile.ubuntu, dev/os/README.md, and CSS !important additions are all unrelated to testing failure conditions.	Remove all out-of-scope deletions (workflows, documentation, Dockerfiles, README, and CSS changes) and focus only on test additions for failure conditions as specified in issue `#36`.
Docstring Coverage	⚠️ Warning	Docstring coverage is 44.44% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'test: Develop tests for failure conditions in Rust/Kotlin runtime adapters' follows the conventional commits pattern and accurately summarizes the main change—adding comprehensive failure condition tests across runtime adapters.
Description check	✅ Passed	The PR description covers all key aspects: specific test additions across Rust and Kotlin modules, file-level changes, verification status, and links to the related issue. Fulfills the required template sections.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch test/runtime-failure-conditions-13871086355275944648

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

cloudflare-workers-and-pages · 2026-02-27T23:51:20Z

Deploying corvus with Cloudflare Pages

Latest commit:	`fd6fe33`
Status:	✅ Deploy successful!
Preview URL:	https://17251730.corvus-42x.pages.dev
Branch Preview URL:	https://test-runtime-failure-conditi.corvus-42x.pages.dev

View logs

github-actions · 2026-02-27T23:51:30Z

✅ Contributor Report

User: @yacosta738
Status: Passed (12/13 metrics passed)

Metric	Description	Value	Threshold	Status
PR Merge Rate	PRs merged vs closed	89%	>= 30%	✅
Repo Quality	Repos with ≥100 stars	0	>= 0	✅
Positive Reactions	Positive reactions received	9	>= 1	✅
Negative Reactions	Negative reactions received	0	<= 5	✅
Account Age	GitHub account age	3046 days	>= 30 days	✅
Activity Consistency	Regular activity over time	100%	>= 0%	✅
Issue Engagement	Issues with community engagement	0	>= 0	✅
Code Reviews	Code reviews given to others	378	>= 0	✅
Merger Diversity	Unique maintainers who merged PRs	3	>= 0	✅
Repo History Merge Rate	Merge rate in this repo	90%	>= 0%	✅
Repo History Min PRs	Previous PRs in this repo	80	>= 0	✅
Profile Completeness	Profile richness (bio, followers)	90	>= 0	✅
Suspicious Patterns	Spam-like activity detection	1	N/A	❌

_{Contributor Report evaluates based on public GitHub activity. Analysis period: 2025-02-28 to 2026-02-28}

…75944648

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@clients/agent-runtime/src/runtime/docker.rs`:
- Around line 293-306: The test
docker_build_shell_command_fails_with_empty_image currently treats an empty
image as valid; change it to assert failure instead: rename the test to reflect
error behavior (e.g., docker_build_shell_command_errors_on_empty_image) and
assert that DockerRuntime::new(cfg) / DockerRuntime::build_shell_command("echo
test", &workspace) returns an Err (use unwrap_err or matches! on Result::Err)
when DockerRuntimeConfig.image is an empty string, rather than asserting the
command contains an empty argument.

In
`@modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt`:
- Around line 104-119: Replace the test's Python dependency with a shell
command: in the test function `should handle large output without crashing`
update the `RustCliBridge` construction (`RustCliBridgeConfig`) to use
executable "sh" and arguments like listOf("-c", "head -c 10240 /dev/zero | tr
'\\0' 'a'") so the bridge invocation (`bridge.invoke(CoreInvocation(...))`)
produces exactly 10240 'a' characters without relying on python3 or adding a
trailing newline; keep assertions against `CoreResult.Success` and
`success.output.text` unchanged.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between a9c0786 and 78d7a3b.

⛔ Files ignored due to path filters (1)

clients/agent-runtime/Cargo.lock is excluded by !**/*.lock, !**/Cargo.lock

📒 Files selected for processing (4)

clients/agent-runtime/src/runtime/docker.rs
clients/agent-runtime/src/runtime/native.rs
clients/agent-runtime/tests/runtime_failure_tests.rs
modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt

coderabbitai · 2026-02-28T16:04:21Z

+  fun `should handle large output without crashing`() {
+    // Generate ~10KB of output.
+    val bridge =
+      RustCliBridge(
+        config =
+          RustCliBridgeConfig(
+            executable = "python3",
+            arguments = listOf("-c", "print('a' * 10240)"),
+          )
+      )
+
+    val result = bridge.invoke(CoreInvocation(prompt = "ignored"))
+    val success = assertIs<CoreResult.Success>(result)
+    assertEquals(10240, success.output.text.length)
+    assertTrue(success.output.text.all { it == 'a' })
+  }


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "=== Checking for python3-bound tests ===" rg -n --type=kt '\bpython3\b' || echo "No python3 found in Kotlin files" echo echo "=== Checking CI/workflows for Python setup ===" if [ -d ".github/workflows" ]; then echo "Found .github/workflows directory" rg -n 'setup-python|python-version|python3' .github/workflows || echo "No Python setup found in workflows" else echo ".github/workflows not present" fi

Repository: dallay/corvus

Length of output: 291

🏁 Script executed:

#!/bin/bash # Locate the test file find . -name "RustCliBridgeTest.kt" -type f 2>/dev/null | head -1

Repository: dallay/corvus

Length of output: 151

🏁 Script executed:

#!/bin/bash # Examine the test file to see if sh is used elsewhere and understand context if [ -f "modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt" ]; then echo "=== Full RustCliBridgeTest.kt content ===" cat -n "modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt" else echo "Test file not found at expected path" fi

Repository: dallay/corvus

Length of output: 4467

🏁 Script executed:

#!/bin/bash # Check for all executable patterns in the test file to identify sh usage if [ -f "modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt" ]; then echo "=== Searching for executable patterns (sh, python3, etc.) ===" rg -n 'executable\s*=' "modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt" -A 2 fi

Repository: dallay/corvus

Length of output: 904

Remove python3 dependency; use sh like other tests in this file.

This test fails in any environment lacking python3, even when the bridge logic is correct. All other tests in this file use sh, which is universally available. Additionally, printf a avoids Python's print() adding an unwanted newline.

🔧 Suggested adjustment

RustCliBridge( config = RustCliBridgeConfig( - executable = "python3", - arguments = listOf("-c", "print('a' * 10240)"), + executable = "sh", + arguments = + listOf( + "-c", + "i=0; while [ $i -lt 10240 ]; do printf a; i=$((i+1)); done", + "bridge", + ), ) )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt` around lines 104 - 119, Replace the test's Python dependency with a shell command: in the test function `should handle large output without crashing` update the `RustCliBridge` construction (`RustCliBridgeConfig`) to use executable "sh" and arguments like listOf("-c", "head -c 10240 /dev/zero | tr '\\0' 'a'") so the bridge invocation (`bridge.invoke(CoreInvocation(...))`) produces exactly 10240 'a' characters without relying on python3 or adding a trailing newline; keep assertions against `CoreResult.Success` and `success.output.text` unchanged.

yacosta738 · 2026-02-28T16:36:16Z

@jules Verify each finding against the current code and only fix it if needed.

Inline comments:
In @clients/agent-runtime/src/runtime/docker.rs:

Around line 293-306: The test
docker_build_shell_command_fails_with_empty_image currently treats an empty
image as valid; change it to assert failure instead: rename the test to reflect
error behavior (e.g., docker_build_shell_command_errors_on_empty_image) and
assert that DockerRuntime::new(cfg) / DockerRuntime::build_shell_command("echo
test", &workspace) returns an Err (use unwrap_err or matches! on Result::Err)
when DockerRuntimeConfig.image is an empty string, rather than asserting the
command contains an empty argument.

In
@modules/agent-core-kmp/src/jvmTest/kotlin/com/profiletailors/agent/core/RustCliBridgeTest.kt:

Around line 104-119: Replace the test's Python dependency with a shell
command: in the test function should handle large output without crashing
update the RustCliBridge construction (RustCliBridgeConfig) to use
executable "sh" and arguments like listOf("-c", "head -c 10240 /dev/zero | tr
'\0' 'a'") so the bridge invocation (bridge.invoke(CoreInvocation(...)))
produces exactly 10240 'a' characters without relying on python3 or adding a
trailing newline; keep assertions against CoreResult.Success and
success.output.text unchanged.

- Add NativeRuntime tests for non-zero exit, stderr capture, and invalid dir. - Harden DockerRuntime against empty images and add validation tests. - Create ShellTool integration test with mock failure runtime. - Add RustCliBridge (Kotlin) tests for timeout recovery and large output (using sh). - Revert unnecessary Cargo.lock changes. Co-authored-by: yacosta738 <33158051+yacosta738@users.noreply.github.com>

coderabbitai

🧹 Nitpick comments (1)

clients/web/apps/docs/src/styles/custom.css (1)
94-94: Consider documenting the rationale for !important declarations.

The extensive addition of !important across typography, layout, and component styles is a common necessity when overriding third-party themes like Starlight. However, future maintainers may benefit from a brief comment at the top of the file explaining why these declarations are needed (e.g., Starlight's specificity requires !important to ensure custom styles take precedence).

The styling changes themselves look well-organized and consistent with the existing theme structure.
📝 Suggested documentation addition
 /* ─────────────────────────────────────────────
    Corvus — Custom Starlight Theme
    Modern dark design with gradient accents
+   
+   Note: !important declarations are used throughout to reliably
+   override Starlight's base theme styles due to specificity conflicts.
    ───────────────────────────────────────────── */
Also applies to: 120-120, 168-170, 206-215, 250-257
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@clients/web/apps/docs/src/styles/custom.css` at line 94, Add a short
top-of-file comment in styles/custom.css explaining why many rules use
`!important` (e.g., to override Starlight theme specificity) so future
maintainers understand the rationale; reference the set of `!important`
overrides in this file (notably the typography `font-weight: 800 !important;`
and the other blocks around the commented ranges) and keep the comment brief and
actionable (what is being overridden and why) so reviewers know these are
deliberate specificity workarounds rather than accidental use.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@clients/web/apps/docs/src/styles/custom.css`:
- Line 94: Add a short top-of-file comment in styles/custom.css explaining why
many rules use `!important` (e.g., to override Starlight theme specificity) so
future maintainers understand the rationale; reference the set of `!important`
overrides in this file (notably the typography `font-weight: 800 !important;`
and the other blocks around the commented ranges) and keep the comment brief and
actionable (what is being overridden and why) so reviewers know these are
deliberate specificity workarounds rather than accidental use.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 78d7a3b and 987467e.

⛔ Files ignored due to path filters (1)

clients/agent-runtime/Cargo.lock is excluded by !**/*.lock, !**/Cargo.lock

📒 Files selected for processing (9)

.github/workflows/dependency-review.yml
.github/workflows/security-dependencies.yml
clients/agent-runtime/src/runtime/docker.rs
clients/web/apps/docs/src/content/docs/en/guides/surrealdb.md
clients/web/apps/docs/src/content/docs/es/guides/surrealdb.md
clients/web/apps/docs/src/styles/custom.css
dev/os/Dockerfile.alpine
dev/os/Dockerfile.ubuntu
dev/os/README.md

💤 Files with no reviewable changes (7)

.github/workflows/security-dependencies.yml
dev/os/README.md
clients/web/apps/docs/src/content/docs/en/guides/surrealdb.md
clients/web/apps/docs/src/content/docs/es/guides/surrealdb.md
dev/os/Dockerfile.alpine
.github/workflows/dependency-review.yml
dev/os/Dockerfile.ubuntu

…75944648

sonarqubecloud · 2026-02-28T18:31:40Z

Quality Gate failed

Failed conditions
0.0% Coverage on New Code (required ≥ 80%)

See analysis details on SonarQube Cloud

- Harden DockerRuntime by validating non-empty image name. - Add NativeRuntime tests for non-zero exit, stderr capture, and invalid dir. - Add ShellTool integration test with mock failure runtime. - Add RustCliBridge (Kotlin) tests for timeout recovery and large output. - Address PR feedback: rename Docker test, assert Err on empty image, and use sh instead of python3 in Kotlin tests. Co-authored-by: yacosta738 <33158051+yacosta738@users.noreply.github.com>

google-labs-jules Bot mentioned this pull request Feb 27, 2026

Develop tests for failure conditions in Rust/Kotlin runtime adapters. #36

Closed

yacosta738 changed the title ~~Develop tests for failure conditions in Rust/Kotlin runtime adapters~~ test: Develop tests for failure conditions in Rust/Kotlin runtime adapters Feb 28, 2026

yacosta738 marked this pull request as ready for review February 28, 2026 13:09

yacosta738 added 2 commits February 28, 2026 14:09

Merge branch 'main' into test/runtime-failure-conditions-138710863552…

a7b4b3e

…75944648

Merge branch 'main' into test/runtime-failure-conditions-138710863552…

78d7a3b

…75944648

coderabbitai Bot reviewed Feb 28, 2026

View reviewed changes

Merge branch 'main' into test/runtime-failure-conditions-138710863552…

d9ea73d

…75944648

yacosta738 closed this Mar 2, 2026

yacosta738 deleted the test/runtime-failure-conditions-13871086355275944648 branch March 2, 2026 06:34

Conversation

yacosta738 commented Feb 27, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

google-labs-jules Bot commented Feb 27, 2026

Uh oh!

coderabbitai Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

❌ Failed checks (3 warnings)

Uh oh!

cloudflare-workers-and-pages Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying corvus with Cloudflare Pages

Uh oh!

github-actions Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Contributor Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

yacosta738 commented Feb 28, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud Bot commented Feb 28, 2026

Quality Gate failed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yacosta738 commented Feb 27, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Feb 27, 2026 •

edited

Loading

cloudflare-workers-and-pages Bot commented Feb 27, 2026 •

edited

Loading

github-actions Bot commented Feb 27, 2026 •

edited

Loading