Async PostgreSQL API by itowlson · Pull Request #3404 · spinframework/spin

itowlson · 2026-02-19T00:32:19Z

The eventual intent is to asyncify all Spin APIs that involve I/O (which is, I think, all of them). I'm not expecting to learn all my lessons from this one step, but I'm putting it up for review so I can learn at least some lessons. Then I can make only new mistakes on the next tranche APIs.

(Tested manually: I think automated tests have to be done via conformance-tests, so may have to be deferred.)

alexcrichton · 2026-02-23T23:49:05Z

crates/wasi-async/src/stream.rs

+
+        if finish {
+            return Poll::Ready(Ok(StreamResult::Cancelled));
+        }


This is a bit subtle, but this'll want to go right before Poll::Pending is returned below. The general idea is that this means that the guest has cancelled a read so the host does whatever it can to try to complete the read, but if it can't then in the end it returns that it's cancelled instead of pending. Here the implementatnio is pretty simple wehre there's no buffering so cancellation doesn't need any further handling though, so just need to move this into the Poll::Pending case below.

alexcrichton · 2026-02-23T23:50:34Z

crates/wasi-async/src/future.rs

There's a built-in impl of FutureProducer for F: Future, so that might be best to use instead (requires wrapping a future's result in a wasmtime::Result<T>)

alexcrichton · 2026-02-23T23:59:00Z

crates/factor-outbound-pg/src/client.rs

+                    Ok(r) => r,
+                    Err(e) => {
+                        let err = query_failed(e);
+                        rows_tx.send(Err(err)).await.unwrap();


The .unwrap() here (and below in other sends) will want to be handled gracefully to break out of this loop. An error here means that the guest closed the stream without reading all of the results, which is expected to be a normal occurrence, so all the unwraps here I think can just be break

alexcrichton · 2026-02-23T23:59:40Z

wit/deps/spin-postgres@4.2.0/postgres.wit

+    /// Open a connection to the Postgres instance at `address`.
+    open: static func(address: string) -> result<connection, error>;
+
+    /// Open a connection to the Postgres instance at `address`.
+    @since(version = 4.2.0)
+    open-async: static async func(address: string) -> result<connection, error>;


The old open function is compat with 4.0.0? And/or exploration of sync/async bindings signatures?

I'm not sure what you're asking here. The open function is the one from 4.0.0: it hasn't changed. What I've (tentatively) done is add a separate open-async, which has the same behaviour but is async. Are you politely hinting that I have chosen... poorly? If so, could you elaborate? Thanks!

Oh sorry, that makes sense. Basically I was curious why there's two functions here vs just having the async func one, but backwards-compat is a solid reason!

An alternative to consider here might be a separate resource connection-async which then itself doesn't need backward-compatible methods. 🤷

dicej

LGTM; thanks for working on this!

Test-wise, you could follow this example of defining a test component that uses the new postgres API, then add a test here which uses it, then make sure you have Postgres running locally, and finally run it locally using e.g. cargo build --release && RUST_LOG=info cargo run --manifest-path tests/runtime-tests/Cargo.toml --no-default-features -- $(pwd)/target/release/spin. I don't know offhand what else is needed to ensure that test runs in CI, but that would at least be a start.

dicej · 2026-02-23T23:40:46Z

crates/factor-outbound-pg/src/client.rs

+                let Some(row) = stm.next().await else {
+                    break;
+                };
+                // TODO: figure out how to deal with errors here - I think there is like a FutureReader<Error> pattern?


What you're doing here looks reasonable to me, FWIW. Not sure we still need a TODO comment here.

The future<result<_, error>> thing you're thinking of is a thing, and I'll discuss it in another comment, but I don't think it applies to this level of abstraction.

dicej · 2026-02-23T23:47:33Z

crates/wasi-async/src/future.rs

+    rx: tokio::sync::oneshot::Receiver<T>,
+}
+
+impl<D, T: 'static + Send> wasmtime::component::FutureProducer<D> for FutureProducer<T> {


I'm curious if this could be reused instead.

Oh that works very nicely! I can construct the FutureReader off the rx directly rather than having a custom producer in the way. Thanks to you and Alex: I should have had more faith!

dicej · 2026-02-23T23:56:00Z

wit/deps/spin-postgres@4.2.0/postgres.wit

+
+    /// Query the database.
+    @since(version = 4.2.0)
+    query-async: async func(statement: string, params: list<parameter-value>) -> result<tuple<future<list<column>>, stream<result<row, error>>>, error>;


The current best practice for fallible streams is to return e.g. tuple<stream<row>, future<result<_, error>> such that the sending side will close the stream early on error and write the error to the future. That emulates a planned stream<T, E> type in a future edition of the component model.

The idea is that you might want to forward your stream<row> elsewhere but possibly change the error type by mapping the future to some other type. In this case it probably doesn't matter much, and the return type here is already pretty hairy, so I don't think it necessarily needs to change, but I wanted to mention it anyway.

My fear with that was that the stream would end and the user would go "well, that's all folks" rather than going "ooh now I better check this other thing to make sure that was a 'normal' end rather than an error." Making it so that stream next() produced an honest-to-goodness error seemed safer. I guess this could be worked around in bindings though.

I think for now I'm inclined to leave it as is, but definitely happy to take further guidance on this.

There's somewhat related discussion that happened here, but for WASI we ended up settling on:

Instead of result<(..., future<result<(), E>>), E> to instead only return (..., future<result<(), E>>). Basically sacrifice the immediate-ness of the outer result and only return a stream/future pair.

While you're right it's a bit more difficult to use, to use (stream<T>, future<result<(), E>>) instead of stream<result<...>>.

Here I'd recommend dropping the async part of async func. That way this function would return a stream/future immediately and those would be resolved in the background.

Personally I'd say to stick to the WASI principles here since whatever ends up done for WASI will be equally applicable here. The rough theory as well is that raw usage of these bindings will be somewhat rare and instead will be done through some form of library or similar, which may help mitigate the has-a-footgun property

Following the WASI conventions sounds good. I will rework. Thank you both for the guidance and discussion!

@alexcrichton Okay I have been looking at this and I'm afraid I'm struggling. Here is what I am currently trying:

query-async: async func( statement: string, params: list<parameter-value>) -> tuple< future<list<column>>, stream<row>, future<result<_, error>> >;

Now in my test harness the first thing I do is print out the number of columns:

let (cols, mut rows, err) = conn.query_async("select j from jtest".into(), vec![]).await; let cols = cols.await; println!("THERE'S {} COLS", cols.len()); // then go on to process `rows`

But... if an error occurs, cols will never resolve. And I can't err.await first because that won't resolve until after all rows have been streamed. I guess I could select on the cols and err futures but egad that feels complicated and heavyweight for what was previously a .await? on the query.

Is that your intention, or am I misunderstanding how the WASI folks see this working? If I've misunderstood, could you provide a little bit more guidance please on the intended signature (and usage if not obvious)? Thanks (and sorry)!

(I realise that you reckon the function should be synchronous but it wasn't obvious to me how to do that with the current Spin plumbing so I have punted on that for now. But I will ~~badger someone about it later~~ come back to it later - I am sure this is a Spin factors + Ivan ignorance thing.)

Here is where I landed in the guest:

let (cols, mut rows, err) = conn.query_async("select j from jtest".into(), vec![]).await; let mut cols = std::pin::pin!(cols.into_future()); let mut err = std::pin::pin!(err.into_future()); let selorama = futures::future::select(&mut cols, &mut err).await; let cols = match selorama { futures::future::Either::Left((cols, _err)) => cols, futures::future::Either::Right((err, cols)) => match err { Ok(_) => cols.await, // or panic/bail? this seems shouldn't-happen Err(e) => anyhow::bail!(e), } }; println!("THERE'S {} COLS", cols.len());

I dunno if this is anywhere close to what is envisaged. But I guess it could be encapsulated in a sufficiently beefy wrapper library if so. Row iteration is definitely nicer - there is a check of err after the stream ends but otherwise simpler.

Ah ok no yeah this is subtly, but significantly so, different from WASI where there's an additional initial state of columns before the subsequent transmission of rows. WASI things so far have all been "a long list of T followed by an error" where this is "a single T, then a bunch of U, then maybe an error".

For WASI idiom-wise the intent is that things stop resolving (e.g. the stream of T) which is a signal to go take a look at the error. I agree this won't work if the columns never resolve since the way this should, in theory, work is that the guest waits for columns, then a bunch of rows, and if at any point anything stops the error is consluted. Basically select shouldn't be required here, or else IMO it's not the right API to expose to the guest.

Would it be possible to have the columns get resolved with an empty list if an error happens? Or some other sort of sentinel value? That way if columns had an error it would mean that it'd resolve with nothing, rows wouldn't resolve with anything, and the guest could check the error and see what happened.

Another possible signature, if quite gnarly, could be:

query-async: async func(...) -> result< tuple<list<column>, stream<row>, future<result<_, error>>>, error, >;

where here the double-async-ness or double-error-ness goes back closer to what you had originally (sorry if this is whiplash) but is intentional to indicate that fetching the columns is the first layer of result and the second layer or results is the row-and-its-error.

I guess I could make columns resolve to an option, with None being the "go look at the error" indicator, but... The double-async one does have the restriction that the host has to resolve columns before rows, but in this case that isn't a problem. None of these feel very lovely eh! I'll have a tinker, but so far double-async seems the most obvious.

Signed-off-by: itowlson <ivan.towlson@fermyon.com>

itowlson requested review from alexcrichton and dicej February 19, 2026 00:32

itowlson marked this pull request as draft February 19, 2026 00:32

itowlson force-pushed the pg-async-streaming branch from 0902d17 to 961bea8 Compare February 23, 2026 23:13

alexcrichton reviewed Feb 23, 2026

View reviewed changes

dicej reviewed Feb 24, 2026

View reviewed changes

itowlson requested review from alexcrichton and dicej February 24, 2026 02:29

itowlson added 4 commits February 26, 2026 10:54

Async PostgreSQL API

51535d2

Signed-off-by: itowlson <ivan.towlson@fermyon.com>

Feedback from review

07b0a42

Signed-off-by: itowlson <ivan.towlson@fermyon.com>

Unlovely

849606b

Signed-off-by: itowlson <ivan.towlson@fermyon.com>

This is less unlovely

286455c

Signed-off-by: itowlson <ivan.towlson@fermyon.com>

itowlson force-pushed the pg-async-streaming branch from a9e807d to 286455c Compare February 25, 2026 22:02

Conversation

itowlson commented Feb 19, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dicej left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants