drivers/pipes: return after short write if buffer is full #14667

Cynerd · 2024-11-06T08:48:00Z

Summary

The write should return even in case of O_NONBLOCK if at least some bytes were written.

The previous state where always all bytes were written was breaking a common combination with poll, because poll would signal POLLOUT and some bytes would really be consumed but write could still block afterwards. That would prevent from execution returning to the poll loop again.

None the less it is also the standard C library behavior for the write function.

Impact

This could have impack on any code that uses pipes and fifos and relies on this non-standard behavior or write always writing all bytes. But it is non-standard behavior and thus considered to be a bug in my eyes.

Testing

Tested with the following code that after this change works:

// Must be longer than CONFIG_DEV_PIPE_SIZE
const char *text =
        "Lorem ipsum odor amet, consectetuer adipiscing elit. Eget litora laoreet ornare elementum pellentesque class penatibus eleifend morbi. Conubia cras accumsan blandit at fames mauris varius. Nisl c
const size_t tlen = strlen(text);
printf("tlen %zd\n", tlen);
int fds[2];
assert(pipe2(fds, 0) == 0);
struct pollfd pfd[2] = {
        {
                .fd = fds[0],
                .events = POLLIN | POLLHUP,
        },
        {
                .fd = fds[1],
                .events = POLLOUT,
        },
};
size_t off = 0;
while (true) {
        if (poll(pfd, 2, -1) == -1) {
                printf("Poll fail %s\n", strerror(errno));
                exit(1);
        }
        if (pfd[1].revents & POLLOUT) {
                ssize_t wlen = write(fds[1], text + off, tlen - off);
                if (wlen < 0) {
                        printf("Write fail %s\n", strerror(errno));
                        exit(1);
                }
                printf("Written %zd\n", wlen);
                off += wlen;
                if (off >= tlen)
                        close(fds[1]);
                continue;
        }
        if (pfd[0].revents & POLLIN) {
                char buf[BUFSIZ];
                ssize_t rlen = read(fds[0], buf, BUFSIZ);
                if (rlen < 0) {
                        printf("Read fail %s\n", strerror(errno));
                        exit(1);
                }
                fwrite(buf, 1, rlen, stdout);
        }
        if (pfd[0].revents & POLLHUP) {
                putc('\n', stdout);
                exit(0);
        }
}

The write should return even in case of O_NONBLOCK if at least some bytes were written. The previous state where always all bytes were written was breaking a common combination with poll, because poll would signal POLLOUT and some bytes would really be consumed but write could still block afterwards. That would prevent from execution returning to the poll loop again. None the less it is also the standard C library behavior for the write function.

nuttxpr · 2024-11-06T08:50:04Z

[Experimental Bot, please feedback here]

Yes, this PR appears to meet the NuttX requirements, although some sections could be more explicit.

Strengths:

Clear Summary: The summary explains the problem (blocking write with O_NONBLOCK when partial writes are possible), the solution (returning after a partial write), and the rationale (conforming to standard C library behavior and fixing the interaction with poll).
Impact Description: The impact section identifies potential compatibility issues with code relying on the previous, non-standard behavior.
Test Case: Provides a concrete test case demonstrating the issue and its resolution. This is crucial for verifying the fix.

Areas for Improvement:

Impact Specificity: While the impact section mentions potential compatibility issues, it would be stronger with more details. Which specific parts of NuttX or NuttX apps might be affected? If none are known, explicitly stating that would be helpful. The current phrasing "This could have impact..." is vague. Be more definitive if possible.
Testing Details: While the test code is provided, the "Testing logs before change" and "Testing logs after change" sections are empty. Include the actual output of the test program before and after the change. This concretely demonstrates the fix's effect. Also specify the Build Host and Target details as requested.
Impact on Documentation: If this change affects how write behaves with pipes/FIFOs, the documentation should be updated to reflect the new behavior. The Impact section should specify whether documentation updates are included in the PR.
Impact on Security and Compatibility: While it's likely there are no security implications and backward compatibility is the main concern (already addressed), explicitly stating "NO" with a brief justification for security and a more detailed explanation for compatibility (e.g., "Backward compatibility may be affected for applications relying on the non-standard behavior...") would strengthen the PR.

Example of improved Testing Section:

## Testing

I confirm that changes are verified on local setup and works as intended:
* Build Host(s): Linux (Ubuntu 20.04), x86_64, GCC 9.4.0
* Target(s): sim:qemu-x86_64


Testing logs before change:

tlen 295
Written 65536
Poll fail Interrupted system call <-- Demonstrates the blocking issue


Testing logs after change:

tlen 295
Written 65536
Written 229
Lorem ipsum odor amet, consectetuer adipiscing elit. Eget litora laoreet ornare elementum pellentesque class penatibus eleifend morbi. Conubia cras accumsan blandit at fames mauris varius. Nisl c

By making these improvements, the PR will be clearer, easier to review, and demonstrate a higher level of due diligence.

lupyuen · 2024-11-07T02:39:34Z

Sorry @Cynerd: CI Test in arm-02 is failing with Pipe Timeout, I wonder if it's due to this PR?

Configuration/Tool: sabre-6quad/citest,CONFIG_ARM_TOOLCHAIN_GNU_EABI
__________________________________ test_pipe ___________________________________
p = <utils.common.connectNuttx object at 0x7f35223[161](https://github.com/NuttX/nuttx/actions/runs/11713931118/job/32627704349#step:7:162)a0>
    def test_pipe(p):
        ret = p.sendCommand("pipe", "redirect_reader: Returning success", timeout=60)
>       assert ret == 0
E       assert -1 == 0
E         +-1
E         -0

/github/workspace/sources/nuttx/tools/ci/testrun/script/test_example/test_example.py:41: AssertionError
----------------------------- Captured stdout call -----------------------------
+------------------------------------------- Catch Exception -------------------------------------------+
+----------------------------------------------- Result ------------------------------------------------+
| Command     : 'pipe'                                                                                  |
| Expect value: ['redirect_reader: Returning success']                                                  |
| Timeout     : 60s                                                                                     |
| Test result : Timeout

https://github.com/NuttX/nuttx/actions/runs/11713931118/job/32627704349#step:7:174

Update: CI Test is also super slow in sim-01 and risc-v-05, taking 6 hours (before getting killed by GitHub):

Cynerd · 2024-11-07T11:31:27Z

Sorry @Cynerd: CI Test in arm-02 is failing with Pipe Timeout, I wonder if it's due to this PR?

It might be. I will look into that. I suspect that it is because of some invalid expectations for some components using the pipe. As I wrote in the "Impact": it might cause issues if code that uses pipecommon relies on write working in a non-standard way.

Edit: The pipe example fix should be easy. I will test it and submit PR with fix. We will see if it will help with timeout as well.

Edit2: @lupyuen apache/nuttx-apps#2829

guohao15 · 2024-11-21T09:02:25Z

hello @Cynerd there is a test failed in libuv [fs_partial_read] after this PR

https://github.com/libuv/libuv/blob/v1.x/test/test-fs.c#L3764
the result expect to be equal to nbytes （1000 which is less than the pipe buffer size 2048)

and i find this explanation about write api in https://pubs.opengroup.org/onlinepubs/9799919799/

`Complete/partial/deferred: A write request:

ret = write(fildes, buf, nbyte);
may return:

Complete
ret=nbyte
Partial
ret<nbyte
This shall never happen if nbyte<={PIPE_BUF}. If it does happen (with nbyte>{PIPE_BUF}), this volume of POSIX.1-2024 does not guarantee atomicity, even if ret<={PIPE_BUF}, because atomicity is guaranteed according to the amount requested, not the amount written.`

does this mean that if the nbytes < buffer_size, than the write should be block when the O_NONBLOCK flag is not set

Cynerd · 2024-11-21T12:03:04Z

hello @Cynerd there is a test failed in libuv [fs_partial_read] after this PR

https://github.com/libuv/libuv/blob/v1.x/test/test-fs.c#L3764 the result expect to be equal to nbytes （1000 which is less than the pipe buffer size 2048)

and i find this explanation about write api in https://pubs.opengroup.org/onlinepubs/9799919799/

`Complete/partial/deferred: A write request:

ret = write(fildes, buf, nbyte); may return:

Complete ret=nbyte Partial ret<nbyte This shall never happen if nbyte<={PIPE_BUF}. If it does happen (with nbyte>{PIPE_BUF}), this volume of POSIX.1-2024 does not guarantee atomicity, even if ret<={PIPE_BUF}, because atomicity is guaranteed according to the amount requested, not the amount written.`

does this mean that if the nbytes < buffer_size, than the write should be block when the O_NONBLOCK flag is not set

Yes, you are right. My change goes against POSIX for pipes and fifos. I even modified my test and run it on Linux and with sufficiently large buffer size it gets stuck as well:

  size_t tlen = 100000;
  char text[tlen];
  for (size_t i = 0; i < tlen; i++)
    text[i] = 'x';
  int fds[2];
  assert(pipe2(fds, 0) == 0);
  struct pollfd pfd[2] = {
      {
          .fd = fds[0],
          .events = POLLIN | POLLHUP,
      },
      {
          .fd = fds[1],
          .events = POLLOUT,
      },
  };
  size_t off = 0;
  while (true) {
    if (poll(pfd, 2, -1) == -1) {
      printf("Poll fail %s\n", strerror(errno));
      exit(1);
    }
    if (pfd[1].revents & POLLOUT) {
      ssize_t wlen = write(fds[1], text + off, tlen - off);
      if (wlen < 0) {
        printf("Write fail %s\n", strerror(errno));
        exit(1);
      }
      printf("Written %zd\n", wlen);
      off += wlen;
      if (off >= tlen)
        close(fds[1]);
      continue;
    }
    if (pfd[0].revents & POLLIN) {
      char buf[BUFSIZ];
      ssize_t rlen = read(fds[0], buf, BUFSIZ);
      if (rlen < 0) {
        printf("Read fail %s\n", strerror(errno));
        exit(1);
      }
      fwrite(buf, 1, rlen, stdout);
    }
    if (pfd[0].revents & POLLHUP) {
      putc('\n', stdout);
      exit(0);
    }
  }

I came to this from the socket (specifically local socket) side, and thus, I failed to check the POSIX specification for the pipe. Therefore, what is wrong here is not the behavior of pipe implementation but that unix local sockets are backed by them in NuttX.

Thus, this change should be reverted as well as "fix" in apps for pipe test, and fix must be instead at the socket level.

guohao15 · 2024-11-26T09:06:08Z

Could you please help revert these two patches first? @Cynerd

xiaoxiang781216 · 2024-11-26T09:10:07Z

I revert here: #14950

github-actions bot added Area: Drivers Drivers issues Size: M The size of the change in this PR is medium labels Nov 6, 2024

xiaoxiang781216 approved these changes Nov 6, 2024

View reviewed changes

xiaoxiang781216 merged commit d0680fd into apache:master Nov 6, 2024

Cynerd mentioned this pull request Nov 7, 2024

examples/pipe: fix write usage apache/nuttx-apps#2829

Merged

xiaoxiang781216 mentioned this pull request Nov 26, 2024

Revert "drivers/pipes: return after short write if buffer is full" #14950

Merged

michallenc mentioned this pull request Jun 27, 2025

[BUG] Unix datagram sockets seems broken #16555

Closed

1 task

michallenc mentioned this pull request Jul 7, 2025

[FEATURE] Polling support for Unix datagram sockets #16688

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

drivers/pipes: return after short write if buffer is full #14667

drivers/pipes: return after short write if buffer is full #14667

Uh oh!

Cynerd commented Nov 6, 2024

Uh oh!

nuttxpr commented Nov 6, 2024

Uh oh!

lupyuen commented Nov 7, 2024 •

edited

Loading

Uh oh!

Cynerd commented Nov 7, 2024 •

edited

Loading

Uh oh!

guohao15 commented Nov 21, 2024

Uh oh!

Cynerd commented Nov 21, 2024

Uh oh!

guohao15 commented Nov 26, 2024

Uh oh!

xiaoxiang781216 commented Nov 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

drivers/pipes: return after short write if buffer is full #14667

drivers/pipes: return after short write if buffer is full #14667

Uh oh!

Conversation

Cynerd commented Nov 6, 2024

Summary

Impact

Testing

Uh oh!

nuttxpr commented Nov 6, 2024

Uh oh!

lupyuen commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Cynerd commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guohao15 commented Nov 21, 2024

Uh oh!

Cynerd commented Nov 21, 2024

Uh oh!

guohao15 commented Nov 26, 2024

Uh oh!

xiaoxiang781216 commented Nov 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lupyuen commented Nov 7, 2024 •

edited

Loading

Cynerd commented Nov 7, 2024 •

edited

Loading