Fix logging, tracing and tracy by recmo · Pull Request #217 · worldfnd/provekit

recmo · 2025-10-14T08:47:58Z

Before, the default features disabled all logging. To enable even basic logging you'd had to enable profilng which also enabled a very expensive Tracy stack trace for each allocation, blowing up runtime by 4x. This changes:

Logging is always enabled.
Use the re-exported crates from tracing_tracy to avoid version mismatch.
Make allocator counters an optional feature, but default since it is low overhead.
Make Tracy an optional dependency.
Make Tracy allocator tracing a runtime option, with configurable depth.

Bisht13 · 2025-10-14T23:21:32Z

+        ptr
+    }
+
+    unsafe fn realloc(&self, ptr: *mut u8, old_layout: Layout, new_size: usize) -> *mut u8 {


We should be reporting the deallocation as well.

Suggested change

unsafe fn realloc(&self, ptr: *mut u8, old_layout: Layout, new_size: usize) -> *mut u8 {

unsafe fn realloc(&self, ptr: *mut u8, old_layout: Layout, new_size: usize) -> *mut u8 {

self.tracy_dealloc(ptr);

I'm following the example of tracy-client here:

https://github.com/nagisa/rust_tracy_client/blob/45cd2681f3083f0bef6fdfb117e575c0905bdf8f/tracy-client/src/lib.rs#L292

It made sense to me since the ptr uniquely identifies the allocation until it gets freed. So if the client sees a multiple alloc events on the same ptr it must mean a realloc. (Whether the Tracy client actually does this, I don't know).

Ok, the Tracy doc has a different opinion then what the tracy-cleint crate does:

nvm, I'm blind. tracy-client does emit a free right here:

https://github.com/nagisa/rust_tracy_client/blob/45cd2681f3083f0bef6fdfb117e575c0905bdf8f/tracy-client/src/lib.rs#L287

Bisht13 · 2025-10-14T23:30:16Z

+            .current
+            .fetch_add(size, Ordering::SeqCst)
+            .wrapping_add(size);
+        self.max.fetch_max(current, Ordering::SeqCst);


Nit, between fetch_add and fetch_max, another thread could update max, causing the local current calculation to be stale, we can avoid this by using fetch_update

Suggested change

self.max.fetch_max(current, Ordering::SeqCst);

self.max.fetch_update(Ordering::SeqCst, Ordering::SeqCst, |max_val| {

let new_current = self.current.load(Ordering::SeqCst);

(new_current > max_val).then_some(new_current)

}).ok();

Thought about this, it should be fine! The current value computed here is always accurate at that sequence point. Some thread might compute an earlier or a later value of current, but the max operation is a join, so doesn't matter if it gets sequenced as

self.max.fetch_max(current_old, Ordering::SeqCst); self.max.fetch_max(current_new, Ordering::SeqCst);

self.max.fetch_max(current_new, Ordering::SeqCst); self.max.fetch_max(current_old, Ordering::SeqCst);

The end result is going to be that max contains max(old_max, current_old, current_new). Similarly if there are more parallel threads.

The one exception to this may be if a concurrent thread resets the maximum.

Bisht13 · 2025-10-14T23:46:45Z

+    /// enable tracy allocation tracking with provided stack depth
+    #[cfg(feature = "tracy")]
+    #[argh(option)]
+    pub tracy_allocations: Option<usize>,


UX improvement

Suggested change

pub tracy_allocations: Option<usize>,

#[argh(option, default = "0")]

pub tracy_allocations: usize,

and in main.rs

if args.tracy { if args.tracy_allocations > 0 { info!("Tracy enabled with allocation tracking (depth {}).", args.tracy_allocations); ALLOCATOR.enable_tracy(args.tracy_allocations); } else { info!("Tracy enabled (no allocation tracking)."); } }

There's actually a difference between 'off' and 'zero depth'. With zero depth we still collect allocation events, but we do not include stack traces (in fact, there is special code in the Allocator to handle this). With 'off' we do not collect allocation events in Tracy at all.

Bisht13 · 2025-10-15T00:02:46Z

-    unsafe {
-        tracy_client_sys::___tracy_shutdown_profiler();
+    //
+    // @recmo: This is not safe. tracing_tracy may still


I think to handle this we can do one of the two things,

Remove it
Pro: No crashes. Con: Lose last ~100ms of trace data.

Grace period (~100 ms)
Pro: Usually works. Con: Still has race condition, just smaller window.

Con: Lose last ~100ms of trace data.

It's worse unfortunately. There could be an unbounded backlog of data still to be transmitted. This actually happened all the time before this PR when we did 100-depth stacktraces on each allocation (times 90M allocations is gigabytes of data to transfer over what seems a very poorly implemented IPC bus).

I checked, but there doesn't seem to be a 'flush' function that keeps the Tracy client alive, but clears the backlog.

I guess the best solution here is to manually implement TRACY_NO_EXIT with something like "Keeping alive for Tracy connections. Press any key to exit."

Bisht13 · 2025-10-16T02:35:52Z

LGTM

…ization

Port to whir PR #215 + PR #217 with SHA-256

recmo added 2 commits October 14, 2025 10:38

Fix logging, tracing and tracy

69be8d3

Fixes

23150c1

Bisht13 requested changes Oct 15, 2025

View reviewed changes

Fix issues

961f1d9

recmo requested a review from Bisht13 October 15, 2025 15:23

Bisht13 approved these changes Oct 16, 2025

View reviewed changes

Bisht13 merged commit cbcdd07 into main Oct 16, 2025
4 of 5 checks passed

Bisht13 deleted the remco/tracing branch October 16, 2025 02:36

Bisht13 added a commit that referenced this pull request Feb 14, 2026

Port to whir Weights dyn trait API (PR #217) with pre-transform optim…

9b91018

…ization

This was referenced Feb 14, 2026

Update gnark recursive verifier for new whir API #289

Closed

Port to whir PR #215 + PR #217 with SHA-256 #288

Merged

Bisht13 added a commit that referenced this pull request Feb 16, 2026

Merge pull request #288 from worldfnd/px/whir-pr215-compat

6b71717

Port to whir PR #215 + PR #217 with SHA-256

	unsafe fn realloc(&self, ptr: mut u8, old_layout: Layout, new_size: usize) -> mut u8 {
	unsafe fn realloc(&self, ptr: mut u8, old_layout: Layout, new_size: usize) -> mut u8 {
	self.tracy_dealloc(ptr);

-        self.max.fetch_max(current, Ordering::SeqCst);
+        self.max.fetch_update(Ordering::SeqCst, Ordering::SeqCst, |max_val| {
+    let new_current = self.current.load(Ordering::SeqCst);
+    (new_current > max_val).then_some(new_current)
+}).ok();

	pub tracy_allocations: Option<usize>,
	#[argh(option, default = "0")]
	pub tracy_allocations: usize,

Conversation

recmo commented Oct 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

recmo Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

recmo Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

recmo Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Bisht13 commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

recmo Oct 15, 2025 •

edited

Loading

recmo Oct 15, 2025 •

edited

Loading

recmo Oct 15, 2025 •

edited

Loading