Fix UFFD EEXIST handling for older kernels by ejc3 · Pull Request #17 · ejc3/fcvm

ejc3 · 2025-12-26T01:42:55Z

Summary

On older kernels (e.g., CI's 5.15 vs local 6.14), page fault coalescing is less aggressive. Multiple faults for the same page get queued, and when the second fault tries to copy, the kernel returns EEXIST because the page was already filled.

Our code was treating ALL copy errors as fatal, disconnecting the VM. This caused the egress stress tests to fail on CI but pass locally.

Changes:

Check for CopyFailed(EEXIST) and continue instead of returning an error
Log EEXIST at debug level since it's expected behavior

Evidence

CI logs confirmed the hypothesis:

error=CopyFailed(EEXIST)

References

Linux kernel documentation confirms this is expected:

"the kernel must cope with it returning -EEXIST from ioctl(UFFDIO_COPY) as expected"

Source: https://docs.kernel.org/admin-guide/mm/userfaultfd.html

Test plan

CI passes egress stress tests on older kernel
Local tests continue to pass on kernel 6.14

Dependencies

Based on #13 (fix/ci-simplify)

On older kernels (e.g., CI's 5.15 vs local 6.14), page fault coalescing is less aggressive, leading to multiple faults for the same page being queued. When the second fault tries to copy, it gets EEXIST because the page was already filled. Our code was treating ALL copy errors as fatal, disconnecting the VM. This is wrong - EEXIST just means "page already valid". Fix: Check for CopyFailed(EEXIST) and continue instead of returning an error. The Linux kernel documentation confirms this is expected behavior: "the kernel must cope with it returning -EEXIST from ioctl(UFFDIO_COPY) as expected" See: https://docs.kernel.org/admin-guide/mm/userfaultfd.html Verified from CI logs: error=CopyFailed(EEXIST) Tested: cargo check, cargo clippy, cargo fmt

Lint tests (fmt, clippy, audit, deny) run under test-root with sudo via CARGO_TARGET_*_RUNNER. The sudo secure_path doesn't include ~/.cargo/bin, so cargo commands fail with ENOENT. Added symlinks for cargo, cargo-fmt, and cargo-clippy in /usr/local/bin alongside the existing cargo-audit and cargo-deny symlinks. Also fixed retention-days: failure() function not valid outside if conditions, changed to fixed 14 days.

The test was using `fcvm ls --json` and checking if ANY VM was healthy. When running in parallel with other tests, this caused false positives - the health check would pass immediately if another test's VM was healthy, even though this test's VM hadn't started yet. Fix: Use `--pid` flag to query only the specific VM being tested. Root cause analysis from CI logs: - test_sigterm_cleanup_rootless took only 0.507s (should take ~27s) - "VM is healthy after 120.538145ms" - way too fast - pgrep found 17 firecracker processes from parallel tests - Our VM's firecracker (PID 441147) hadn't even started yet when the health check passed (pgrep ran at 03:56:56.028, firecracker started at 03:56:56.684)

Root cause analysis from CI run 20516048896: - Early tests: boot in 19s, image pull in 28s = 47s total (success) - Late tests: boot in 32s, image pull ongoing = >60s (timeout) Resource contention from parallel VMs causes variable boot times. Combined with 27-33s image pulls, late-starting tests exceed 60s. Changes: - poll_health_by_pid timeout: 60 → 120 seconds - tokio::time::timeout for clone health: 60 → 120 seconds - Health polling loops in signal tests: 60 → 120 seconds Files updated: - test_clone_connection.rs - test_egress.rs - test_egress_stress.rs - test_port_forward.rs - test_signal_cleanup.rs - test_snapshot_clone.rs

When 17 pjdfstest_vm tests run in parallel via nextest, they all check if localhost/pjdfstest image exists and try to build it simultaneously. This causes podman overlay storage race conditions: error extracting layer: lgetxattr .../America/Winnipeg: no such file Fix: Use fs2 file lock around the check+build section so only one test builds the container while others wait.

ejc3 added 6 commits December 26, 2025 01:42

CI: Add workflow_dispatch for manual triggers

29ed0e2

docs: Add manual CI trigger to CLAUDE.md

afedd80

CI: Add job selector for workflow_dispatch

329dfc2

CI: Add symlinks for cargo-audit/deny so sudo can find them

719f7aa

CI: Keep failing test logs for 30 days, passing for 7

2ea6e07

ejc3 force-pushed the fix/uffd-eexist branch from 187c1a7 to 16fcee6 Compare December 26, 2025 03:15

ejc3 force-pushed the fix/uffd-eexist branch from 16fcee6 to 3cad4cb Compare December 26, 2025 03:18

ejc3 added 3 commits December 26, 2025 03:24

Add debug output to signal cleanup test to diagnose process tree

5a4a2c0

Fix formatting in signal cleanup debug output

6645aa5

ejc3 force-pushed the fix/uffd-eexist branch from f592fd7 to 21f2b15 Compare December 26, 2025 04:36

ejc3 added 2 commits December 26, 2025 05:25

ejc3 force-pushed the fix/uffd-eexist branch 2 times, most recently from ba1e9f5 to 983a36a Compare December 26, 2025 09:17

ejc3 mentioned this pull request Dec 26, 2025

UFFD EEXIST fix and test stability (4/4) #25

Closed

3 tasks

ejc3 closed this Dec 26, 2025

ejc3 deleted the fix/uffd-eexist branch December 26, 2025 11:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix UFFD EEXIST handling for older kernels#17

Fix UFFD EEXIST handling for older kernels#17
ejc3 wants to merge 12 commits intofix/ci-simplifyfrom
fix/uffd-eexist

ejc3 commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ejc3 commented Dec 26, 2025

Summary

Evidence

References

Test plan

Dependencies

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant