Skip to content

docs: refresh doc and start collecting troubleshooting items in a singular place#331

Merged
ericksoa merged 3 commits intomainfrom
doc/refresh
Mar 18, 2026
Merged

docs: refresh doc and start collecting troubleshooting items in a singular place#331
ericksoa merged 3 commits intomainfrom
doc/refresh

Conversation

@miyoungc
Copy link
Copy Markdown
Contributor

@miyoungc miyoungc commented Mar 18, 2026

Summary by CodeRabbit

  • Documentation
    • Added a Troubleshooting guide with remediation steps and examples for installation, onboarding, and runtime
    • Added PATH guidance for nvm/fnm users and notes for reopening or sourcing shells
    • Added cgroup v2 validation and Docker configuration checks with post-fix guidance to retry onboarding
    • Enforced RFC‑1123 sandbox naming with automatic lowercasing and updated onboarding/preflight flow
    • Clarified sandbox vs host status messaging and guidance to inspect host-side sandbox state
    • Consolidated prior Common Issues into the new Troubleshooting reference

@miyoungc miyoungc self-assigned this Mar 18, 2026
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai bot commented Mar 18, 2026

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 2e0deefc-05a9-46a0-b347-4e147efd48ac

📥 Commits

Reviewing files that changed from the base of the PR and between 1ecee61 and 6251177.

📒 Files selected for processing (3)
  • docs/monitoring/monitor-sandbox-activity.md
  • docs/reference/commands.md
  • docs/reference/troubleshooting.md

📝 Walkthrough

Walkthrough

Added a new Troubleshooting reference page and linked it from the docs navigation and README; removed the duplicated Common Issues section from a monitoring doc; updated command reference and onboarding/preflight messaging (RFC 1123 sandbox naming, cgroup v2 Docker checks, host-side sandbox inspection guidance), and added PATH remediation notes for nvm/fnm users.

Changes

Cohort / File(s) Summary
Navigation & README
README.md, docs/index.md
Added a Troubleshooting entry to navigation and README; added remediation notes about nvm/fnm preventing installer PATH updates and how to source shell rc or reopen terminal.
New Troubleshooting Guide
docs/reference/troubleshooting.md
Added full troubleshooting page covering Installation (PATH, unsupported platforms), Onboarding (RFC 1123 sandbox name rules, cgroup v2 Docker fixes, Colima on macOS), and Runtime (sandbox status, lifecycle, timeouts, logs, network/agent issues) with commands and remediation steps.
Command Reference
docs/reference/commands.md
Reworked onboarding/preflight flow: prompt for sandbox name with RFC 1123 validation and auto-lowercasing, move preflight checks before gateway creation, add cgroup v2 Docker requirements and fix instructions, clarify inside-sandbox status messaging and recommend openshell sandbox list on the host.
Monitoring doc cleanup
docs/monitoring/monitor-sandbox-activity.md
Removed the duplicated "Common Issues" section and replaced local troubleshooting content with a link to the new Troubleshooting page.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

I nibble at docs with a twitch and a hop,
Tucking loose tips in a neat little crop.
Paths fixed, names lowered, checks all in line,
🐇 Hooray — the guide’s stitched and now it will shine! ✨

🚥 Pre-merge checks | ✅ 3
✅ Passed checks (3 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately describes the main changes: refreshing documentation and consolidating troubleshooting information into a dedicated troubleshooting guide.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch doc/refresh
📝 Coding Plan
  • Generate coding plan for human review comments

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@docs/reference/commands.md`:
- Around line 66-69: Replace the incorrect CLI invocation "openshell sandbox
status" with the correct command "openshell sandbox list" wherever it appears in
the docs (specifically update the sentence that currently reads 'Run `openshell
sandbox status` on the host for full details.' and the troubleshooting entry
that references the status command); search for the exact string "openshell
sandbox status" and change it to "openshell sandbox list", and update any
surrounding wording if needed to match the README usage.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: ffc0e2e4-5320-403e-b459-50480e0ae0bd

📥 Commits

Reviewing files that changed from the base of the PR and between 1e23347 and 57d7106.

📒 Files selected for processing (5)
  • README.md
  • docs/index.md
  • docs/monitoring/monitor-sandbox-activity.md
  • docs/reference/commands.md
  • docs/reference/troubleshooting.md

Comment thread docs/reference/commands.md
Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

♻️ Duplicate comments (2)
docs/reference/commands.md (1)

66-68: ⚠️ Potential issue | 🟡 Minor

Correct the OpenShell command per previous review.

The command openshell sandbox status on line 68 is not valid. According to the previous review verification, the correct command is openshell sandbox list for checking sandbox state on the host.

🔧 Proposed fix
 When running inside an active OpenShell sandbox, the status command detects the sandbox context and reports "active (inside sandbox)" instead of false negatives.
 Host-side sandbox state and inference configuration are not inspectable from inside the sandbox.
-Run `openshell sandbox status` on the host for full details.
+Run `openshell sandbox list` on the host for full details.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/commands.md` around lines 66 - 68, Update the documentation
text that instructs users to run the host-side command: replace the incorrect
command string "openshell sandbox status" with the correct "openshell sandbox
list" wherever it appears (the sentence that currently reads "Run `openshell
sandbox status` on the host for full details." should instead read "Run
`openshell sandbox list` on the host for full details."); ensure the
example/description around sandbox status detection and host inspection remains
unchanged except for this command correction.
docs/reference/troubleshooting.md (1)

92-92: ⚠️ Potential issue | 🟡 Minor

Correct the OpenShell command per previous review.

The command openshell sandbox status is not valid. According to the previous review verification, the correct command is openshell sandbox list for checking sandbox state.

🔧 Proposed fix
-Run `openshell sandbox status` on the host for full details.
+Run `openshell sandbox list` on the host for full details.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` at line 92, Replace the invalid command
invocation "openshell sandbox status" with the correct command "openshell
sandbox list" in the docs; update the text line that currently reads Run
`openshell sandbox status` to instead read Run `openshell sandbox list` so the
troubleshooting instructions point to the valid command for checking sandbox
state.
🧹 Nitpick comments (3)
docs/reference/troubleshooting.md (3)

64-64: Clarify which DNS is propagating.

The phrase "gateway's DNS has not finished propagating" is vague. Consider specifying what DNS records or service discovery mechanism needs to propagate (e.g., internal service mesh DNS, gateway endpoint registration).

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` at line 64, Update the sentence "gateway's
DNS has not finished propagating" in troubleshooting.md to specify exactly which
DNS or registration must propagate (e.g., internal service mesh DNS records,
gateway endpoint A/AAAA/CNAME entries, or gateway endpoint registration in
service discovery), and give a brief example such as "the gateway's internal
service mesh DNS record or external gateway endpoint A/CNAME record has not
finished propagating" so readers know whether to check service-mesh DNS,
external DNS entries, or gateway registration.

37-37: Simplify the platform description.

The phrasing "Linux Ubuntu 22.04 LTS or later" is redundant since Ubuntu is inherently a Linux distribution.

✏️ Suggested revision
-NemoClaw requires Linux Ubuntu 22.04 LTS or later.
+NemoClaw requires Ubuntu 22.04 LTS or later.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` at line 37, Replace the redundant platform
phrase "Linux Ubuntu 22.04 LTS or later" in the troubleshooting text with a
simplified statement such as "Ubuntu 22.04 LTS or later" (or, if you want to
keep OS family, "Linux (Ubuntu 22.04 LTS or later)") so the platform requirement
is clear and not repetitive; update the sentence at the same location where the
current line reads "NemoClaw requires Linux Ubuntu 22.04 LTS or later." to the
chosen simplified phrasing.

50-51: Optional: MD014 markdownlint warnings about console prompt style.

Static analysis flags the $ prompts in console blocks without showing command output (MD014). This is typically a stylistic choice in documentation. If the project prefers showing only commands (without output), these warnings can be safely ignored. If output examples would help users, consider adding representative output after the commands.

Also applies to: 76-76, 100-100, 111-111, 122-122

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` around lines 50 - 51, Remove or fix MD014
warnings in the console code blocks by either (a) removing the leading "$ "
prompt from the shown commands (e.g., "sudo nemoclaw setup-spark" and "nemoclaw
onboard") so only the commands remain, (b) adding representative command output
lines immediately after each command to justify keeping the "$ " prompt, or (c)
disabling the rule for those blocks with an inline markdownlint directive.
Update the affected blocks in docs/reference/troubleshooting.md accordingly so
the markdown linter no longer flags MD014.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@docs/reference/commands.md`:
- Around line 66-68: Update the documentation text that instructs users to run
the host-side command: replace the incorrect command string "openshell sandbox
status" with the correct "openshell sandbox list" wherever it appears (the
sentence that currently reads "Run `openshell sandbox status` on the host for
full details." should instead read "Run `openshell sandbox list` on the host for
full details."); ensure the example/description around sandbox status detection
and host inspection remains unchanged except for this command correction.

In `@docs/reference/troubleshooting.md`:
- Line 92: Replace the invalid command invocation "openshell sandbox status"
with the correct command "openshell sandbox list" in the docs; update the text
line that currently reads Run `openshell sandbox status` to instead read Run
`openshell sandbox list` so the troubleshooting instructions point to the valid
command for checking sandbox state.

---

Nitpick comments:
In `@docs/reference/troubleshooting.md`:
- Line 64: Update the sentence "gateway's DNS has not finished propagating" in
troubleshooting.md to specify exactly which DNS or registration must propagate
(e.g., internal service mesh DNS records, gateway endpoint A/AAAA/CNAME entries,
or gateway endpoint registration in service discovery), and give a brief example
such as "the gateway's internal service mesh DNS record or external gateway
endpoint A/CNAME record has not finished propagating" so readers know whether to
check service-mesh DNS, external DNS entries, or gateway registration.
- Line 37: Replace the redundant platform phrase "Linux Ubuntu 22.04 LTS or
later" in the troubleshooting text with a simplified statement such as "Ubuntu
22.04 LTS or later" (or, if you want to keep OS family, "Linux (Ubuntu 22.04 LTS
or later)") so the platform requirement is clear and not repetitive; update the
sentence at the same location where the current line reads "NemoClaw requires
Linux Ubuntu 22.04 LTS or later." to the chosen simplified phrasing.
- Around line 50-51: Remove or fix MD014 warnings in the console code blocks by
either (a) removing the leading "$ " prompt from the shown commands (e.g., "sudo
nemoclaw setup-spark" and "nemoclaw onboard") so only the commands remain, (b)
adding representative command output lines immediately after each command to
justify keeping the "$ " prompt, or (c) disabling the rule for those blocks with
an inline markdownlint directive. Update the affected blocks in
docs/reference/troubleshooting.md accordingly so the markdown linter no longer
flags MD014.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 5ae3d50e-d085-4e66-91d1-4edd281e98ee

📥 Commits

Reviewing files that changed from the base of the PR and between 57d7106 and 1ecee61.

📒 Files selected for processing (5)
  • README.md
  • docs/index.md
  • docs/monitoring/monitor-sandbox-activity.md
  • docs/reference/commands.md
  • docs/reference/troubleshooting.md
🚧 Files skipped from review as they are similar to previous changes (3)
  • README.md
  • docs/index.md
  • docs/monitoring/monitor-sandbox-activity.md

@miyoungc miyoungc added the documentation Improvements or additions to documentation label Mar 18, 2026
Copy link
Copy Markdown
Contributor

@ericksoa ericksoa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@ericksoa ericksoa merged commit 417b4bf into main Mar 18, 2026
3 checks passed
Ryuketsukami pushed a commit to Ryuketsukami/NemoClaw that referenced this pull request Mar 24, 2026
…gular place (NVIDIA#331)

* doc refresh and start troubleshooting page

* refresh doc and start collecting troubleshooting

* openshell command fix
jessesanford pushed a commit to jessesanford/NemoClaw that referenced this pull request Mar 24, 2026
…gular place (NVIDIA#331)

* doc refresh and start troubleshooting page

* refresh doc and start collecting troubleshooting

* openshell command fix
mafueee pushed a commit to mafueee/NemoClaw that referenced this pull request Mar 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants