docs: refresh doc and start collecting troubleshooting items in a singular place by miyoungc · Pull Request #331 · NVIDIA/NemoClaw

miyoungc · 2026-03-18T18:05:28Z

Summary by CodeRabbit

Documentation
- Added a Troubleshooting guide with remediation steps and examples for installation, onboarding, and runtime
- Added PATH guidance for nvm/fnm users and notes for reopening or sourcing shells
- Added cgroup v2 validation and Docker configuration checks with post-fix guidance to retry onboarding
- Enforced RFC‑1123 sandbox naming with automatic lowercasing and updated onboarding/preflight flow
- Clarified sandbox vs host status messaging and guidance to inspect host-side sandbox state
- Consolidated prior Common Issues into the new Troubleshooting reference

coderabbitai · 2026-03-18T18:05:41Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 2e0deefc-05a9-46a0-b347-4e147efd48ac

📥 Commits

Reviewing files that changed from the base of the PR and between 1ecee61 and 6251177.

📒 Files selected for processing (3)

docs/monitoring/monitor-sandbox-activity.md
docs/reference/commands.md
docs/reference/troubleshooting.md

📝 Walkthrough

Walkthrough

Added a new Troubleshooting reference page and linked it from the docs navigation and README; removed the duplicated Common Issues section from a monitoring doc; updated command reference and onboarding/preflight messaging (RFC 1123 sandbox naming, cgroup v2 Docker checks, host-side sandbox inspection guidance), and added PATH remediation notes for nvm/fnm users.

Changes

Cohort / File(s)	Summary
Navigation & README `README.md`, `docs/index.md`	Added a Troubleshooting entry to navigation and README; added remediation notes about nvm/fnm preventing installer PATH updates and how to source shell rc or reopen terminal.
New Troubleshooting Guide `docs/reference/troubleshooting.md`	Added full troubleshooting page covering Installation (PATH, unsupported platforms), Onboarding (RFC 1123 sandbox name rules, cgroup v2 Docker fixes, Colima on macOS), and Runtime (sandbox status, lifecycle, timeouts, logs, network/agent issues) with commands and remediation steps.
Command Reference `docs/reference/commands.md`	Reworked onboarding/preflight flow: prompt for sandbox name with RFC 1123 validation and auto-lowercasing, move preflight checks before gateway creation, add cgroup v2 Docker requirements and fix instructions, clarify inside-sandbox status messaging and recommend `openshell sandbox list` on the host.
Monitoring doc cleanup `docs/monitoring/monitor-sandbox-activity.md`	Removed the duplicated "Common Issues" section and replaced local troubleshooting content with a link to the new Troubleshooting page.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

I nibble at docs with a twitch and a hop,
Tucking loose tips in a neat little crop.
Paths fixed, names lowered, checks all in line,
🐇 Hooray — the guide’s stitched and now it will shine! ✨

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main changes: refreshing documentation and consolidating troubleshooting information into a dedicated troubleshooting guide.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch doc/refresh

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@docs/reference/commands.md`:
- Around line 66-69: Replace the incorrect CLI invocation "openshell sandbox
status" with the correct command "openshell sandbox list" wherever it appears in
the docs (specifically update the sentence that currently reads 'Run `openshell
sandbox status` on the host for full details.' and the troubleshooting entry
that references the status command); search for the exact string "openshell
sandbox status" and change it to "openshell sandbox list", and update any
surrounding wording if needed to match the README usage.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: ffc0e2e4-5320-403e-b459-50480e0ae0bd

📥 Commits

Reviewing files that changed from the base of the PR and between 1e23347 and 57d7106.

📒 Files selected for processing (5)

README.md
docs/index.md
docs/monitoring/monitor-sandbox-activity.md
docs/reference/commands.md
docs/reference/troubleshooting.md

coderabbitai

♻️ Duplicate comments (2)

docs/reference/commands.md (1)

66-68: ⚠️ Potential issue | 🟡 Minor

Correct the OpenShell command per previous review.

The command openshell sandbox status on line 68 is not valid. According to the previous review verification, the correct command is openshell sandbox list for checking sandbox state on the host.

🔧 Proposed fix

 When running inside an active OpenShell sandbox, the status command detects the sandbox context and reports "active (inside sandbox)" instead of false negatives.
 Host-side sandbox state and inference configuration are not inspectable from inside the sandbox.
-Run `openshell sandbox status` on the host for full details.
+Run `openshell sandbox list` on the host for full details.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@docs/reference/commands.md` around lines 66 - 68, Update the documentation
text that instructs users to run the host-side command: replace the incorrect
command string "openshell sandbox status" with the correct "openshell sandbox
list" wherever it appears (the sentence that currently reads "Run `openshell
sandbox status` on the host for full details." should instead read "Run
`openshell sandbox list` on the host for full details."); ensure the
example/description around sandbox status detection and host inspection remains
unchanged except for this command correction.

docs/reference/troubleshooting.md (1)

92-92: ⚠️ Potential issue | 🟡 Minor

Correct the OpenShell command per previous review.

The command openshell sandbox status is not valid. According to the previous review verification, the correct command is openshell sandbox list for checking sandbox state.
🔧 Proposed fix
-Run `openshell sandbox status` on the host for full details.
+Run `openshell sandbox list` on the host for full details.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` at line 92, Replace the invalid command
invocation "openshell sandbox status" with the correct command "openshell
sandbox list" in the docs; update the text line that currently reads Run
`openshell sandbox status` to instead read Run `openshell sandbox list` so the
troubleshooting instructions point to the valid command for checking sandbox
state.

🧹 Nitpick comments (3)

docs/reference/troubleshooting.md (3)
64-64: Clarify which DNS is propagating.

The phrase "gateway's DNS has not finished propagating" is vague. Consider specifying what DNS records or service discovery mechanism needs to propagate (e.g., internal service mesh DNS, gateway endpoint registration).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` at line 64, Update the sentence "gateway's
DNS has not finished propagating" in troubleshooting.md to specify exactly which
DNS or registration must propagate (e.g., internal service mesh DNS records,
gateway endpoint A/AAAA/CNAME entries, or gateway endpoint registration in
service discovery), and give a brief example such as "the gateway's internal
service mesh DNS record or external gateway endpoint A/CNAME record has not
finished propagating" so readers know whether to check service-mesh DNS,
external DNS entries, or gateway registration.
37-37: Simplify the platform description.

The phrasing "Linux Ubuntu 22.04 LTS or later" is redundant since Ubuntu is inherently a Linux distribution.
✏️ Suggested revision
-NemoClaw requires Linux Ubuntu 22.04 LTS or later.
+NemoClaw requires Ubuntu 22.04 LTS or later.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` at line 37, Replace the redundant platform
phrase "Linux Ubuntu 22.04 LTS or later" in the troubleshooting text with a
simplified statement such as "Ubuntu 22.04 LTS or later" (or, if you want to
keep OS family, "Linux (Ubuntu 22.04 LTS or later)") so the platform requirement
is clear and not repetitive; update the sentence at the same location where the
current line reads "NemoClaw requires Linux Ubuntu 22.04 LTS or later." to the
chosen simplified phrasing.
50-51: Optional: MD014 markdownlint warnings about console prompt style.

Static analysis flags the $ prompts in console blocks without showing command output (MD014). This is typically a stylistic choice in documentation. If the project prefers showing only commands (without output), these warnings can be safely ignored. If output examples would help users, consider adding representative output after the commands.

Also applies to: 76-76, 100-100, 111-111, 122-122
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/reference/troubleshooting.md` around lines 50 - 51, Remove or fix MD014
warnings in the console code blocks by either (a) removing the leading "$ "
prompt from the shown commands (e.g., "sudo nemoclaw setup-spark" and "nemoclaw
onboard") so only the commands remain, (b) adding representative command output
lines immediately after each command to justify keeping the "$ " prompt, or (c)
disabling the rule for those blocks with an inline markdownlint directive.
Update the affected blocks in docs/reference/troubleshooting.md accordingly so
the markdown linter no longer flags MD014.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@docs/reference/commands.md`:
- Around line 66-68: Update the documentation text that instructs users to run
the host-side command: replace the incorrect command string "openshell sandbox
status" with the correct "openshell sandbox list" wherever it appears (the
sentence that currently reads "Run `openshell sandbox status` on the host for
full details." should instead read "Run `openshell sandbox list` on the host for
full details."); ensure the example/description around sandbox status detection
and host inspection remains unchanged except for this command correction.

In `@docs/reference/troubleshooting.md`:
- Line 92: Replace the invalid command invocation "openshell sandbox status"
with the correct command "openshell sandbox list" in the docs; update the text
line that currently reads Run `openshell sandbox status` to instead read Run
`openshell sandbox list` so the troubleshooting instructions point to the valid
command for checking sandbox state.

---

Nitpick comments:
In `@docs/reference/troubleshooting.md`:
- Line 64: Update the sentence "gateway's DNS has not finished propagating" in
troubleshooting.md to specify exactly which DNS or registration must propagate
(e.g., internal service mesh DNS records, gateway endpoint A/AAAA/CNAME entries,
or gateway endpoint registration in service discovery), and give a brief example
such as "the gateway's internal service mesh DNS record or external gateway
endpoint A/CNAME record has not finished propagating" so readers know whether to
check service-mesh DNS, external DNS entries, or gateway registration.
- Line 37: Replace the redundant platform phrase "Linux Ubuntu 22.04 LTS or
later" in the troubleshooting text with a simplified statement such as "Ubuntu
22.04 LTS or later" (or, if you want to keep OS family, "Linux (Ubuntu 22.04 LTS
or later)") so the platform requirement is clear and not repetitive; update the
sentence at the same location where the current line reads "NemoClaw requires
Linux Ubuntu 22.04 LTS or later." to the chosen simplified phrasing.
- Around line 50-51: Remove or fix MD014 warnings in the console code blocks by
either (a) removing the leading "$ " prompt from the shown commands (e.g., "sudo
nemoclaw setup-spark" and "nemoclaw onboard") so only the commands remain, (b)
adding representative command output lines immediately after each command to
justify keeping the "$ " prompt, or (c) disabling the rule for those blocks with
an inline markdownlint directive. Update the affected blocks in
docs/reference/troubleshooting.md accordingly so the markdown linter no longer
flags MD014.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 5ae3d50e-d085-4e66-91d1-4edd281e98ee

📥 Commits

Reviewing files that changed from the base of the PR and between 57d7106 and 1ecee61.

📒 Files selected for processing (5)

README.md
docs/index.md
docs/monitoring/monitor-sandbox-activity.md
docs/reference/commands.md
docs/reference/troubleshooting.md

🚧 Files skipped from review as they are similar to previous changes (3)

README.md
docs/index.md
docs/monitoring/monitor-sandbox-activity.md

ericksoa

lgtm

…gular place (NVIDIA#331) * doc refresh and start troubleshooting page * refresh doc and start collecting troubleshooting * openshell command fix

miyoungc self-assigned this Mar 18, 2026

miyoungc added 2 commits March 18, 2026 11:08

doc refresh and start troubleshooting page

97afc85

refresh doc and start collecting troubleshooting

1ecee61

miyoungc force-pushed the doc/refresh branch from 57d7106 to 1ecee61 Compare March 18, 2026 18:09

coderabbitai bot reviewed Mar 18, 2026

View reviewed changes

Comment thread docs/reference/commands.md

coderabbitai bot reviewed Mar 18, 2026

View reviewed changes

openshell command fix

6251177

miyoungc requested review from ericksoa and jacobtomlinson March 18, 2026 18:21

miyoungc added the documentation Improvements or additions to documentation label Mar 18, 2026

ericksoa approved these changes Mar 18, 2026

View reviewed changes

ericksoa merged commit 417b4bf into main Mar 18, 2026
3 checks passed

This was referenced Mar 20, 2026

feat: add sandbox export/import backup commands #161

Closed

chore: add docs-to-skills pipeline #607

Merged

mafueee pushed a commit to mafueee/NemoClaw that referenced this pull request Mar 28, 2026

fix(ci): remove legacy wheel publishing machinery (NVIDIA#331)

c334222

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: refresh doc and start collecting troubleshooting items in a singular place#331

docs: refresh doc and start collecting troubleshooting items in a singular place#331
ericksoa merged 3 commits intomainfrom
doc/refresh

miyoungc commented Mar 18, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 18, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

ericksoa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

miyoungc commented Mar 18, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

ericksoa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

miyoungc commented Mar 18, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 18, 2026 •

edited

Loading