Re-implement cat builtin with full flag support by AlexandreYang · Pull Request #22 · DataDog/rshell

AlexandreYang · 2026-03-10T10:44:32Z

What does this PR do?

Re-implements the cat builtin command with full GNU-compatible flag support. The previous implementation was bare-bones (no flags, unbounded io.Copy). The new version adds 11 flags, streaming line-by-line processing with bounded buffers, context cancellation, and proper error handling.

Motivation

The existing cat builtin lacked flag support (-n, -b, -s, -v, -E, -T, -A, -e, -t, -u, -h) and used unbounded io.Copy which is unsafe for infinite sources. This re-implementation brings it to parity with GNU coreutils cat while maintaining the safety guarantees required by RULES.md.

Testing

Go unit tests (cat_test.go): 33 tests covering all flags, edge cases, error paths, and RULES.md compliance
GNU compatibility tests (cat_gnu_compat_test.go): 20 tests with byte-for-byte output verification against GNU coreutils
Pentest tests (builtin_cat_pentest_test.go): 19 tests covering flag injection, path traversal, long lines at buffer cap, resource exhaustion, context cancellation, and CRLF edge cases
YAML scenario tests: 14 new scenarios in tests/scenarios/cmd/cat/ covering number, number-nonblank, squeeze, show-ends, show-tabs, combined flags, help, and hardening
Import allowlist test passes — no new symbols needed
All existing cat YAML scenarios continue to pass (backward compatible)

Checklist

Tests added/updated
Documentation updated (if applicable)

PR by Bits
View session in Datadog

Comment @DataDog to request changes

Co-authored-by: AlexandreYang <49917914+AlexandreYang@users.noreply.github.com>

datadog-datadog-prod-us1 · 2026-03-10T10:44:34Z

View session in Datadog

Bits Dev status: ✅ Done

Comment @DataDog to request changes

…device names Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

AlexandreYang · 2026-03-10T12:00:19Z

@codex review

AlexandreYang · 2026-03-10T12:00:21Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 413f8cb3e4

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-10T12:08:47Z

+	flagE := fs.BoolP("show-nonprinting-ends", "e", false, "equivalent to -vE")
+	flagT := fs.BoolP("show-nonprinting-tabs", "t", false, "equivalent to -vT")
+	_ = fs.BoolP("unbuffered", "u", false, "ignored")


Drop non-GNU long aliases for short-only cat flags

GNU cat --help documents -e, -t, and -u as short-only switches (no long forms), and GNU rejects --unbuffered/similar names as unrecognized options. Defining these with BoolP and custom long names here makes rshell accept --show-nonprinting-ends, --show-nonprinting-tabs, and --unbuffered, so invocations that should fail now succeed, which breaks the stated GNU-compatibility behavior and can hide mistyped flags.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-10T12:08:47Z

@@ -0,0 +1,9 @@
+description: cat rejects unknown flags with exit code 1.
+skip_assert_against_bash: true


Assert unknown-flag cat scenario against bash

AGENTS.md requires skip_assert_against_bash: true only for intentional divergences, but unknown flag handling should match bash/coreutils (cat --follow ... exits non-zero with an option error). Marking this scenario as skipped removes the bash parity check for a normal compatibility case, so parser regressions for unsupported flags can slip through unnoticed.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 413f8cb3e4

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-10T12:08:53Z

+		content, term := splitTerminator(line)
+		hasTerm := len(term) > 0
+		blank := len(content) == 0 && hasTerm


Preserve logical line state across file boundaries

cat is specified to concatenate inputs into one output stream (see cat --help: “Concatenate FILE(s) to standard output”), but this token handling treats each file’s first scanner token as a fresh line. If one file ends without \n, the next file’s first bytes are a continuation of that same logical line; with -n/-b this code will incorrectly add a new line number (e.g. a.txt="a", b.txt="b\n" should produce one numbered line ab\n, not two numbered fragments), and similar boundary errors also affect -s blank-line squeezing.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-10T12:08:53Z

+		if ctx.Err() != nil {
+			break


Propagate context cancellation as a command failure

Breaking out of the file loop when ctx.Err() != nil returns success unless another file error occurred, which can silently truncate output with exit code 0 if cancellation/deadline happens between files. This regresses reliability for timeout-driven runs because callers can observe a successful cat even though not all requested files were processed.

Useful? React with 👍 / 👎.

matt-dz

Security Audit: APPROVED

Overall Risk: LOW — 0 Critical, 0 High, 1 Medium (theoretical), 2 Low, 2 Informational

Summary

Performed a thorough security audit of the cat builtin reimplementation covering memory safety, sandbox bypass, DoS/resource exhaustion, context cancellation, integer overflow, flag parsing, byte handling, and stdin edge cases.

Findings

MEDIUM (theoretical) — lineNum (int64) could overflow after ~9.2×10¹⁸ lines (~8 EiB), producing 0 line numbers. Not practically reachable given context cancellation and shell timeouts. No crash or memory corruption.

All other areas clean:

Memory: fixed 32 KiB buffer in catRaw, 1 MiB scanner cap in catLines — no unbounded allocations
Sandbox: all file access delegated to callCtx.OpenFile(), no filesystem ops in the builtin
DoS: ctx.Err() checked every iteration in both code paths
appendNonprinting: all 256 byte values covered, matches GNU cat -v
Flag parsing: unknown flags properly rejected via pflag.ContinueOnError
Scanner split function: correct advance/token pairs, no infinite loop possible

Positive observations

Strong streaming architecture — never loads full files into memory
Clean sandbox delegation — zero filesystem logic in the builtin
Comprehensive pentest test coverage (path traversal, boundary line caps, cancellation, CRLF)
Proper error handling across multiple files

Well-engineered implementation following sound security principles for a restricted shell.

AlexandreYang · 2026-03-10T14:08:28Z

/merge

gh-worker-devflow-routing-ef8351 · 2026-03-10T14:08:32Z

View all feedbacks in Devflow UI.

2026-03-10 14:08:32 UTC ℹ️ Start processing command /merge

2026-03-10 14:08:37 UTC ℹ️ MergeQueue: pull request added to the queue

The expected merge time in main is approximately 35s (p90).

2026-03-10 14:09:18 UTC ℹ️ MergeQueue: This merge request was merged

Re-implement cat builtin with full flag support

48ef475

Co-authored-by: AlexandreYang <49917914+AlexandreYang@users.noreply.github.com>

datadog-prod-us1-5 Bot added the Bits AI label Mar 10, 2026

datadog-datadog-prod-us1-2 Bot and others added 3 commits March 10, 2026 11:11

Merge branch 'main' into dd/reimpl-cat-builtin-flags

886b8c5

Merge branch 'main' into dd/reimpl-cat-builtin-flags

7450c13

Fix TestCatPentestDevNull on Windows: assert sandbox blocks reserved …

413f8cb

…device names Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

AlexandreYang marked this pull request as ready for review March 10, 2026 12:00

AlexandreYang requested review from matt-dz and thieman as code owners March 10, 2026 12:00

chatgpt-codex-connector Bot reviewed Mar 10, 2026

View reviewed changes

matt-dz approved these changes Mar 10, 2026

View reviewed changes

gh-worker-dd-devflow-36fce6 Bot added mergequeue-status: queued mergequeue-status: in_progress and removed mergequeue-status: queued labels Mar 10, 2026

gh-worker-dd-mergequeue-cf854d Bot merged commit f84860e into main Mar 10, 2026
9 checks passed

gh-worker-dd-devflow-36fce6 Bot removed the mergequeue-status: in_progress label Mar 10, 2026

gh-worker-dd-mergequeue-cf854d Bot deleted the dd/reimpl-cat-builtin-flags branch March 10, 2026 14:09

gh-worker-dd-devflow-36fce6 Bot added the mergequeue-status: done label Mar 10, 2026

AlexandreYang mentioned this pull request Mar 12, 2026

Implement printf builtin command #57

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-implement cat builtin with full flag support#22

Re-implement cat builtin with full flag support#22
gh-worker-dd-mergequeue-cf854d[bot] merged 4 commits intomainfrom
dd/reimpl-cat-builtin-flags

AlexandreYang commented Mar 10, 2026

Uh oh!

datadog-datadog-prod-us1 Bot commented Mar 10, 2026 •

edited

Loading

Uh oh!

AlexandreYang commented Mar 10, 2026

Uh oh!

AlexandreYang commented Mar 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Uh oh!

matt-dz left a comment

Uh oh!

AlexandreYang commented Mar 10, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 Bot commented Mar 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,9 @@
		description: cat rejects unknown flags with exit code 1.
		skip_assert_against_bash: true

Conversation

AlexandreYang commented Mar 10, 2026

What does this PR do?

Motivation

Testing

Checklist

Uh oh!

datadog-datadog-prod-us1 Bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexandreYang commented Mar 10, 2026

Uh oh!

AlexandreYang commented Mar 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

matt-dz left a comment

Choose a reason for hiding this comment

Security Audit: APPROVED

Summary

Findings

Positive observations

Uh oh!

AlexandreYang commented Mar 10, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 Bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

datadog-datadog-prod-us1 Bot commented Mar 10, 2026 •

edited

Loading

gh-worker-devflow-routing-ef8351 Bot commented Mar 10, 2026 •

edited

Loading