save / reuse user data #57

rgarcia · 2025-08-18T13:59:23Z

This PR makes some big modifications to the headless and headful images to power saving chromium user data across runs

Switches to supervisord

both images now use supervisord to manage processes. This gives us the ability to issue stops and restarts to chromium when reconfiguring it

Additions to the API server:

Websocket proxy. The API server watches the chromium log file and automatically configures a web socket proxy at 9222 that goes to the (autogenerated, random on every start) devtools endpoint. This lets us just assume that port 9222 will speak the devtools protocol instead of having to figure out what path to connect to
/process API: allows executing a command synchronously or asynchronously
/logs/stream: allows tailing a log file
/fs additions: download_dir_zip (download dir as zip), upload (upload many individual files), upload_zip (upload a zip and extract it to a directory)

Testing:

full e2e test of the docker image: server/e2e/e2e_chromium_test.go contains a test that spins up the image, browses around to various sites that modify cookies and local storage, and then tries two things: a simple restart of the chromium process, and then a full shutdown of the docker container, restart, and restoration of the user data folder

This test also is configured to run in CI, which necessitated building the docker images in CI. Hopefully this unlocks some more testing goodness in the future

TL;DR

Enabled persistent user data for Chromium in Docker images and significantly enhanced the API server with new capabilities for process management, file system operations, log streaming, and a dynamic DevTools proxy.

Why we made these changes

To allow Chromium browser instances in containerized environments to save and reuse user data (like cookies and local storage) across restarts and container shutdowns. This required robust process orchestration, new API functionalities for interacting with the container's filesystem and processes, and a simplified way to connect to Chromium's debugging interface.

What changed?

Container Orchestration (headful/headless images): Switched to supervisord for managing Chromium, Xorg, D-Bus, and the API server, replacing custom shell scripts for process control. Introduced new, standardized start-chromium.sh and start-xvfb.sh scripts. Streamlined Dockerfile builds and removed legacy VNC components.
API Server (kernel-images-api):
- Process Management: Added /process endpoints for synchronous/asynchronous command execution, I/O streaming, and process termination.
- File System Operations: Expanded /fs API with UploadFiles (multiple file uploads), UploadZip (secure zip extraction), and DownloadDirZip (directory zipping).
- Log Streaming: Implemented /logs/stream (Server-Sent Events) for real-time log file tailing.
- Dynamic DevTools Proxy: Created a WebSocket proxy on 0.0.0.0:9222 that dynamically discovers and forwards connections to Chromium's DevTools URL by monitoring its logs.
Persistent User Data: Configured Chromium images to include and use default user data, enabling state persistence across sessions and container restarts.
Testing & Development Support: Developed a comprehensive end-to-end (E2E) test suite (server/e2e/e2e_chromium_test.go) using Playwright to validate Chromium user data persistence. Integrated Docker image builds into CI workflows and added new utility packages (server/lib/ziputil, server/lib/devtoolsproxy). Updated OpenAPI specification.

^{Description generated by Mesa. Update settings}

mesa-dot-dev

Performed full review of fe02e69...8756a62

^{86 files reviewed | 7 comments | Review on Mesa | Edit Reviewer Settings}

server/cmd/api/api/process.go

server/e2e/cookie_debug.sh

server/e2e/e2e_chromium_test.go

server/e2e/playwright/index.ts

server/lib/devtoolsproxy/proxy.go

Sayan-

this is a meaty change and I'll be honest in that I didn't review every single line. overall:

nuking the user data we've saved sounds good. I'm not sure if there's a set of CDP calls we should be making at start up to get things into a specific state
using pid instead of name for the process apis sgtm
more automated testing is 💯
chromium hot reload lgtm. I am curious if/how this could expose us to race conditions we're not thinking of atm
end to end integration makes sense for where we want to go overall

one question: does the ws library we use implement heartbeats or is that something we need to be responsible for?

generally speaking, we're doing quite a bit of IO, processing in our image now. I think it's necessary at the moment but would be helpful to measure if it causes issues down the line

images/chromium-headful/supervisor/services/chromium.conf

images/chromium-headful/wrapper.sh

matthewjmarangoni · 2025-08-19T17:05:22Z

diff --git a/images/chromium-headful/image-chromium/entrypoint.sh b/images/chromium-headful/image-chromium/entrypoint.sh
index c0a5e67..f6e53e3 100755
--- a/images/chromium-headful/image-chromium/entrypoint.sh
+++ b/images/chromium-headful/image-chromium/entrypoint.sh
@@ -2,7 +2,6 @@
 set -e
 
 ./start_all.sh
-./novnc_startup.sh
 
 python http_server.py > /tmp/server_logs.txt 2>&1 &
 
diff --git a/images/chromium-headful/image-chromium/novnc_startup.sh b/images/chromium-headful/image-chromium/novnc_startup.sh
deleted file mode 100755
index 053e559..0000000
--- a/images/chromium-headful/image-chromium/novnc_startup.sh
+++ /dev/null
@@ -1,21 +0,0 @@
-#!/bin/bash
-echo "starting noVNC"
-
-# Start noVNC with explicit websocket settings
-/opt/noVNC/utils/novnc_proxy \
-    --vnc 0.0.0.0:5900 \
-    --listen 6080 \
-    --web /opt/noVNC \
-    > /tmp/novnc.log 2>&1 &
-
-# Wait for noVNC to start
-timeout=10
-while [ $timeout -gt 0 ]; do
-    if netstat -tuln | grep -q ":6080 "; then
-        break
-    fi
-    sleep 1
-    ((timeout--))
-done
-
-echo "noVNC started successfully"
diff --git a/images/chromium-headful/run-docker.sh b/images/chromium-headful/run-docker.sh
index 5adbd8a..bb65d0f 100755
--- a/images/chromium-headful/run-docker.sh
+++ b/images/chromium-headful/run-docker.sh
@@ -47,7 +47,7 @@ if [[ "${WITH_KERNEL_IMAGES_API:-}" == "true" ]]; then
   RUN_ARGS+=( -e WITH_KERNEL_IMAGES_API=true )
 fi
 
-# noVNC vs WebRTC port mapping
+# WebRTC port mapping
 if [[ "${ENABLE_WEBRTC:-}" == "true" ]]; then
   echo "Running container with WebRTC"
   RUN_ARGS+=( -p 8080:8080 )
@@ -59,9 +59,6 @@ if [[ "${ENABLE_WEBRTC:-}" == "true" ]]; then
     RUN_ARGS+=( -e NEKO_WEBRTC_NAT1TO1=127.0.0.1 )
     RUN_ARGS+=( -p 56000-56100:56000-56100/udp )
   fi
-else
-  echo "Running container with noVNC"
-  RUN_ARGS+=( -p 8080:6080 )
 fi
 
 docker rm -f "$NAME" 2>/dev/null || true

Without a contrary use case paring noVNC moves Neko/WebRTC into the default role. If that's the preference then ENABLE_WEBRTC=true should be the default state or the switch removed if unused. An instance can be found at chromium-headful/wrapper.sh.

rgarcia · 2025-08-20T20:57:27Z

@matthewjmarangoni I think there might be cases where we want to run w/o the overhead of webrtc so I lean towards leaving ENABLE_WEBRTC in

matthewjmarangoni · 2025-08-25T15:45:55Z

Understood @rgarcia. Regarding the patch, I didn't provide context but it appeared noVNC was being removed and the patch cleaned up some trailing references. I've opened a PR applying that patch but if it's irrelevant please just close it! See #62.

rgarcia added 17 commits August 12, 2025 21:43

begin supervisord-ifying

00ab46f

supervisord for all headful processes

b760bc6

multi tail

8483833

better logging

5317aba

reorder

112d4c9

prefix wrapper logs

1268538

headless chromium supervisor setup

80c22b2

/fs/upload /fs/upload_zip for writing many files

98c6b49

better logs for headless image

0b58b78

log tailing, process exec'ing

c261707

rm ncat proxy

25df2ff

endpoint to download directory as zip

a82572a

disable scale to zero on startup for the headless image

0e83402

draft e2e test of saving user data

6c61a60

user profile persistence passing

567e955

cleanup

95aa176

ignore *.png

8756a62

rgarcia requested a review from Sayan- August 18, 2025 13:59

mesa-dot-dev bot reviewed Aug 18, 2025

View reviewed changes

Sayan- mentioned this pull request Aug 18, 2025

Audio Support (v1) #51

Open

rgarcia added 4 commits August 18, 2025 18:16

move server build into Dockerfile

4ad3a6b

drop packages: write since we're using dockerhub

1d4794d

tweak tests

ffe7d78

there will be tests

9270bf2

This comment was marked as outdated.

Sign in to view

use gha docker cache; fix proxy test

c18c52e

Sayan- reviewed Aug 18, 2025

View reviewed changes

images/chromium-headful/supervisor/services/chromium.conf Outdated Show resolved Hide resolved

images/chromium-headful/wrapper.sh Show resolved Hide resolved

rgarcia added 2 commits August 20, 2025 20:38

pr feedback

7cea11f

heartbeat ws client connections

2b14588

Sayan- approved these changes Aug 20, 2025

View reviewed changes

rgarcia merged commit 7429755 into main Aug 20, 2025
4 checks passed

rgarcia deleted the raf/kernel-229-feature-request-api-to-save-browser-contexts branch August 20, 2025 23:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

save / reuse user data #57

save / reuse user data #57

Uh oh!

rgarcia commented Aug 18, 2025 •

edited by mesa-dot-dev bot

Loading

Uh oh!

mesa-dot-dev bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Sayan- left a comment

Uh oh!

Uh oh!

Uh oh!

matthewjmarangoni commented Aug 19, 2025

Uh oh!

rgarcia commented Aug 20, 2025

Uh oh!

Uh oh!

matthewjmarangoni commented Aug 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

save / reuse user data #57

save / reuse user data #57

Uh oh!

Conversation

rgarcia commented Aug 18, 2025 • edited by mesa-dot-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TL;DR

Why we made these changes

What changed?

Uh oh!

mesa-dot-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Sayan- left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

matthewjmarangoni commented Aug 19, 2025

Uh oh!

rgarcia commented Aug 20, 2025

Uh oh!

Uh oh!

matthewjmarangoni commented Aug 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rgarcia commented Aug 18, 2025 •

edited by mesa-dot-dev bot

Loading