Concurrency patterns article #31

conectado · 2025-07-09T23:35:03Z

refs #17

cloudflare-workers-and-pages · 2025-07-09T23:35:42Z

Deploying taping-memory with Cloudflare Pages

Latest commit:	`7a436f7`
Status:	✅ Deploy successful!
Preview URL:	https://10698634.taping-memory.pages.dev
Branch Preview URL:	https://concurrency-patterns-article.taping-memory.pages.dev

View logs

thomaseizinger · 2025-07-25T08:58:37Z

Relevant: firezone/firezone#10003

I think the learning here is that one needs to be mindful as to which components are part of these loops.

conectado · 2025-07-25T16:14:04Z

Relevant: firezone/firezone#10003

I think the learning here is that one needs to be mindful as to which components are part of these loops.

@thomaseizinger Yeah! That's a great example, but also, that's the way to do it, first implement it single-task then split it to multi-task when benchmarks suggest it's a good idea. I'll try to make it clearer in the conclusion that it can be very beneficial to have multiple tasks.

But in this particular case, there's something a bit weird and I might be missing some context. Most of the CPU time consumed by the phoenix channel is due to tracing-related stuff?

thomaseizinger · 2025-07-25T21:05:20Z

But in this particular case, there's something a bit weird and I might be missing some context. Most of the CPU time consumed by the phoenix channel is due to tracing-related stuff?

Yeah. I am not sure if it is a bug in tracing right now (I've found this: tokio-rs/tracing#3345) but it seems that even just having trace! macros in the hotpath can be quite expensive even if they are not active.

thomaseizinger · 2025-07-26T23:11:50Z

Just gave this a read, very detailed explanation of the various trade-offs! I liked the section about error handling in particular.

I think sans-IO works especially well for completely UDP-based stuff because timeouts are your only way to react to any IO failures anyway.

Integrating that with TCP is a bit more difficult. For one, it is stream-based so if performance matters a lot, you may not want to send around heap-allocated messages all the time but directly operate on borrowed data. In the UDP case, using buffers pools helps here but for TCP that is more difficult due to the variation of the message sizes.

Two, the two-way communication in the case of errors is very annoying with channels as you demonstrated very well.

One thing that I would recommend you to mention is the complexity of task-wakeups. As elegant as they are in some ways, hand-rolled futures are very error-prone and I would not recommend them unless all other options are actually not workable. In Firezone, all IO (apart from phoenix-channel) is now actually using async but it is all in separate threads, using channels to interact with one, big sans-IO state machine and I am very happy with it. Task wake-up bugs are a very painful reality. (As an extension of that, so are state machine bugs in poll_timeout actually).

As a bottom-line, I think sticking to async where you can is good followed by building sans-IO stuff. Also, spawning tasks is always to be done with caution I think. For one, you often break back-pressure if you e.g. spawn a task for each incoming thing. Two, spawning tasks, similar to channels, breaks the stacktrace and thus error reporting. So structured concurrency is better there.

My 2c, great article on the whole! Happy to do some detailed editing on it if you'd like before you publish it :)

conectado · 2025-07-27T05:48:08Z

Just gave this a read, very detailed explanation of the various trade-offs! I liked the section about error handling in particular.

Thanks <3 and thanks for the feedback!

I think sans-IO works especially well for completely UDP-based stuff because timeouts are your only way to react to any IO failures anyway.

Integrating that with TCP is a bit more difficult. For one, it is stream-based so if performance matters a lot, you may not want to send around heap-allocated messages all the time but directly operate on borrowed data. In the UDP case, using buffers pools helps here but for TCP that is more difficult due to the variation of the message sizes.

You could still have a "performant" sans-IO approach without heap-allocating messages if you always immediately react to messages instead of buffering them within your sans-IO state?

I mean this isn't trivial but what I mean is this challenge would exist regardless of using sans-IO. TCP is quite complicated for performance-critical applications.

Two, the two-way communication in the case of errors is very annoying with channels as you demonstrated very well.

This is a problem with sans-IO?

One thing that I would recommend you to mention is the complexity of task-wakeups. As elegant as they are in some ways, hand-rolled futures are very error-prone and I would not recommend them unless all other options are actually not workable. In Firezone, all IO (apart from phoenix-channel) is now actually using async but it is all in separate threads, using channels to interact with one, big sans-IO state machine and I am very happy with it. Task wake-up bugs are a very painful reality. (As an extension of that, so are state machine bugs in poll_timeout actually).

Yeah! I definitely need to mention this and it's the biggest downside, I'll take a look into how you're doing it in Firezone now. I didn't think it was such a big problem that you'd rather use async. For me the biggest benefit about hand-rolling futures is that you get an error in the call-site so there's no need to keep track between the IO object that errored and the error.

As a bottom-line, I think sticking to async where you can is good followed by building sans-IO stuff. Also, spawning tasks is always to be done with caution I think. For one, you often break back-pressure if you e.g. spawn a task for each incoming thing. Two, spawning tasks, similar to channels, breaks the stacktrace and thus error reporting. So structured concurrency is better there.

My 2c, great article on the whole! Happy to do some detailed editing on it if you'd like before you publish it :)

thanks so much, I'll let you know when it's ready for a detailed review 😁

thomaseizinger · 2025-07-27T08:05:19Z

Two, the two-way communication in the case of errors is very annoying with channels as you demonstrated very well.

This is a problem with sans-IO?

High-performance sans-IO yes because at least the way I am doing it now is to have IO in separate threads to make progress on IO while the state machine is working (i.e. decrypting / encrypting)

thomaseizinger · 2025-07-27T08:07:26Z

I didn't think it was such a big problem that you'd rather use async.

The problem is that it is so hard to get it right and the bugs only happen sporadically. Eliminating this class of bugs where possible instills a lot more confidence.

thomaseizinger

Great stuff. I left a few notes!

content/concurrency-patterns.md

…ing and such and fix submodule stuff)

…n general rust context

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>

…ential pitfalls

conectado force-pushed the concurrency-patterns-article branch from 669f5fa to 59c1785 Compare July 9, 2025 23:35

conectado mentioned this pull request Jul 9, 2025

Feedback #2

Closed

conectado force-pushed the concurrency-patterns-article branch from e9ff83e to 66b2b67 Compare July 16, 2025 01:43

conectado marked this pull request as ready for review August 5, 2025 16:26

thomaseizinger reviewed Aug 12, 2025

View reviewed changes

conectado added 17 commits August 13, 2025 13:06

initial version of sharing io resources

9a75abb

fix title

056b4c3

fix header

daa41bc

grammar

8a506f2

naive section of the article

8d08107

csp checkpoint

ed55fff

hand-rolled futures

0c353dc

very early draft without conclusion

0a724e2

rewriting initial sections

95999d7

update naive to mutex and some rephrasings

043563c

update test section phrasing

b167ea1

fix up the test in the article

9049439

trivial rewrite in tests section

d83e4ba

rewrite intro and reorg

0d68213

rewrite motivation and getting rid of introduction

dd22a16

remove archive

b5b16e1

remove old format

0e27385

conectado and others added 27 commits August 13, 2025 13:06

restructure note on sans io

2ad626d

minor updates to the intro

0b0b3c3

router -> server

a83db75

title improvements

c1c299a

minor corrections

6c59220

rename article slug

292dbc4

set slug

af4ac02

tidy up code examples

85538ab

habemus mermaid

a15ed6d

ignore js files

8061d1d

revert forgotten configs on mermaid

de41e7b

ignore clientA and clientB false positive misspelling

12b98d5

english imrpovements

543b428

add flakes for local dev shell using organist (TODO: add it for build…

01423c8

…ing and such and fix submodule stuff)

apply english feedback

0a9831d

more english feedback

72fdfc3

more english feedback

d8b577b

add envrc

7d88e94

More english feedback

2e2e4bd

add zola as a service

cdf69a1

feedback: remove backtick in task explanation when talking about it i…

85f5be0

…n general rust context

feedback: simplifications rewording

3761c19

feedback: english grammar improvement

511e716

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>

feedback: akwarkdness -> unergonomic

a7e7d17

feedback: fix typo

52155ee

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>

feedback: add note on loop-continue pattern in hand-rolled future pot…

3104719

…ential pitfalls

feedback: add note one time spent reading vs writing code

7a436f7

conectado force-pushed the concurrency-patterns-article branch from 32e0e9b to 7a436f7 Compare August 13, 2025 16:07

conectado merged commit c72f5d8 into main Aug 13, 2025
2 checks passed

conectado deleted the concurrency-patterns-article branch August 13, 2025 16:51

Concurrency patterns article #31

Concurrency patterns article #31

Uh oh!

Conversation

conectado commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cloudflare-workers-and-pages bot commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying taping-memory with Cloudflare Pages

Uh oh!

thomaseizinger commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

conectado commented Jul 25, 2025

Uh oh!

thomaseizinger commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomaseizinger commented Jul 26, 2025

Uh oh!

conectado commented Jul 27, 2025

Uh oh!

thomaseizinger commented Jul 27, 2025

Uh oh!

thomaseizinger commented Jul 27, 2025

Uh oh!

thomaseizinger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

conectado commented Jul 9, 2025 •

edited

Loading

cloudflare-workers-and-pages bot commented Jul 9, 2025 •

edited

Loading

thomaseizinger commented Jul 25, 2025 •

edited

Loading

thomaseizinger commented Jul 25, 2025 •

edited

Loading