feat: add #simulate command by rnbguy · Pull Request #6 · verse-lab/veil

rnbguy · 2026-03-16T03:05:01Z

Random-walk state exploration for Veil -- runs random traces checking invariants at each step.

It finds shallow invariant violations faster than #model_check (exhaustive BFS), but it is not complete.

Usage

-- basic
#simulate {}

-- with type/theory instantiation
#simulate { node := Fin 4 } { nextNode := fun n => n + 1 }

-- with config
#simulate {} (seed := 42, maxTraces := 1000, maxSteps := 50)

Benchmark

Search time only (Lean loading overhead subtracted) on my machine:

Example	`#model_check interpreted`	`#simulate`
DieHard	1.4s	111ms
RiverCrossing	2.4s	5ms
BuggyCircularBuffer	.9s	1ms
Traffic	1.2s	2ms
MutexViolation	22s	378ms

All five have known violations.

Disclaimer

Contains LLM generated code.

dranov · 2026-03-16T03:54:56Z

Thank you, @rnbguy! This looks good.

I'll have time to look at this more closely and merge it later in the week, after the OOPSLA deadline. (For future maintainability, I want to make sure #simulate and #model_check share as much code as possible.)

However, I'm wondering whether you're running into a bug with #model_check. We have two modes of operation for the model checker: (1) compiled and (2) interpreted.

By default, the way #model_check is supposed to work is it runs the model checker in interpreted mode while it does the compilation in the background (which can take quite long, as you're seeing). If the interpreted mode finds a violation, that gets displayed — there's no waiting for compilation to finish.

For me, #model_check for the benchmarks in your table all find a violation within 1 second. The timing you're seeing makes me think somehow only the compiled mode runs for you.

What do you see when you run #model_check? Is it something like this? (This shows the interpreted model checker running — states are being explored — whilst compilation happens in the background.)

rnbguy · 2026-03-16T04:50:51Z

hey @dranov ! Good luck with OOPSLA deadline 🍀 I am just playing around with Veil 😄 so, there is no rush.

You're correct. I was using CLI lake lean <example>.lean so I am sure it included the compilation too.

I just ran with #model_check interpreted {} {} and also validated the numbers on VSCode.

Example	`#model_check interpreted`	`#simulate`
DieHard	1.4s	111ms
RiverCrossing	2.4s	5ms
BuggyCircularBuffer	.9s	1ms
Traffic	1.2s	2ms
MutexViolation	22s	378ms

Thanks for taking the time to point this out. 🙌🏼

dranov · 2026-03-25T02:23:21Z

@rnbguy Apologies for the delay. I'll let @zqy1018 handle integrating this. He developed and is in charge of the model checker in Veil.

We'd want #simulate to have a soundness proof, similar to the soundness and completeness proof of #model_check's new version, and that might require a rewrite. @zqy1018 will look into it.

zqy1018 · 2026-04-14T02:47:33Z

Hi @rnbguy! Sorry for the delay on my side. I saw there have been some recent commits, are you still planning to add more changes, or is it ready for review?

rnbguy · 2026-04-14T15:01:23Z

hey @zqy1018, thanks for the message. give me 2 days to go over my changes one final time. after dranov's comment, I started working on the soundness proof for #simulate. I will make the PR ready for review when I am done.

rnbguy · 2026-04-16T16:33:18Z

hey @zqy1018, the PR is ready for review.

#simulate proves soundness of the path now.
I reused #model_checker types for #simulate. please check if they make sense.
I added compiled and interpreted modes for #simulate, just like #model_check.
Please check #simulate config if they look alright.
I added three #simulate friendly examples. let me know if you have any questions about them.
I added multiple tests for #simulate. let me know if it's okay to keep them.

I also re-did the benchmark again from the above.

Example	`#model_check interpreted`	`#simulate interpreted`
DieHard	1s	0.5s
RiverCrossing	1s	0.2s
BuggyCircularBuffer	0.7s	0.2s
Traffic	1.1s	0.3s
MutexViolation	18s	1.2s
CheckpointLeaseFailover	t/o at 1m	1.3s
LeaseKeepaliveRace	t/o at 1m	2.3s
ReliableBroadcast	t/o at 1m	14.5s

zqy1018 · 2026-04-17T09:19:43Z

Great, thanks! I'll take a look now.

rnbguy added 2 commits March 16, 2026 04:04

add #simulate command for random-walk state exploration

434f5e1

lazy trace recording with replay-on-violation

f15aca5

rnbguy force-pushed the feat/simulate branch 2 times, most recently from 211e20b to f822ebe Compare March 16, 2026 05:10

rnbguy added 2 commits March 16, 2026 06:24

try/catch error handling with seed reporting in simulate loop

df4f8e5

share helpers between #model_check and #simulate, add widget support

543cce4

rnbguy force-pushed the feat/simulate branch from f822ebe to 543cce4 Compare March 16, 2026 05:37

restore docstrings and inline comments removed during refactor

1e9c481

rnbguy force-pushed the feat/simulate branch from d8c6b28 to 1e9c481 Compare March 16, 2026 05:47

rnbguy added 5 commits March 16, 2026 18:37

uncomment violationIsError option in MutexViolation example

2f12dd7

simplify assertion failure handling to single-pass in simulateOnceLoop

c066bee

remove totalSteps tracking from simulate pipeline

53cbc95

remove unused displayResultWidget, clean up formatting

e2783dd

add SharedCounter example where #simulate outperforms #model_check

542db5c

rnbguy added 13 commits April 12, 2026 05:22

Merge branch 'veil-2.0-preview' into feat/simulate

b5c3bef

fix: build

2c1598e

fix: qualify compilation status constructors

8201073

feat: align command architecture with #model_check

a8e9419

test: cover parity regressions

ed7f810

refactor: share executed path semantics

534c9f2

refactor: share pure and runtime trace loops

2fc5d47

feat: use runtime runner for command execution

96d39f0

refactor: clean result rendering semantics

7ad5f60

refactor: split simulation into modular files

12adb4c

feat: support assumptions checks

55e15e4

feat: add simulation-native results and progress

7f58696

refactor: add theorem-level soundness bridges

6075426

rnbguy added 3 commits April 12, 2026 21:34

refactor: prove runtime soundness directly

78542c0

refactor: make simulate theorem assumptions-aware

8ca3b6f

refactor: align simulate theorem boundary with command semantics

a379849

rnbguy marked this pull request as draft April 14, 2026 15:00

rnbguy added 20 commits April 15, 2026 01:07

fix: persist final progress metrics

64f76af

fix: encode trace budget termination

f2656c2

fix: use a single result log path

025d752

test: cover emitted mode behavior

1fde08c

refactor: drop unused path helpers

d81587c

refactor: remove unused simulate names

dd97553

chore: remove new proof warnings

d09d7d2

fix: make trace limits part of core results

dcf6735

fix: align display trace-limit counts

41f2602

test: add simulate violation mode regression

e9ad7d3

fix(model-checker): isolate compiled command instances

797f9f6

fix(simulate): keep default compilation running

7140a15

fix(simulate): tighten handoff cancellation and parity

c27a938

fix(examples): replace SharedCounter with lease race examples

ed23a67

fix(examples): add simulate-friendly reliable broadcast

3f9605f

fix(simulate): preserve explicit default config values

9971ba1

fix(simulate): show chosen seed in output

6da6208

fix(simulate): short-circuit empty initial states

48fdf21

test(simulate): make interpreted mode explicit

312d8e2

chore: reduce noises

17228a1

rnbguy marked this pull request as ready for review April 16, 2026 16:26

fix(simulate): align violation soundness with runtime semantics

fd86b56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add #simulate command#6

feat: add #simulate command#6
rnbguy wants to merge 55 commits intoverse-lab:veil-2.0-previewfrom
rnbguy:feat/simulate

rnbguy commented Mar 16, 2026 •

edited

Loading

Uh oh!

dranov commented Mar 16, 2026

Uh oh!

rnbguy commented Mar 16, 2026

Uh oh!

dranov commented Mar 25, 2026

Uh oh!

zqy1018 commented Apr 14, 2026

Uh oh!

rnbguy commented Apr 14, 2026

Uh oh!

rnbguy commented Apr 16, 2026 •

edited

Loading

Uh oh!

zqy1018 commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rnbguy commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Usage

Benchmark

Disclaimer

Uh oh!

dranov commented Mar 16, 2026

Uh oh!

rnbguy commented Mar 16, 2026

Uh oh!

dranov commented Mar 25, 2026

Uh oh!

zqy1018 commented Apr 14, 2026

Uh oh!

rnbguy commented Apr 14, 2026

Uh oh!

rnbguy commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zqy1018 commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rnbguy commented Mar 16, 2026 •

edited

Loading

rnbguy commented Apr 16, 2026 •

edited

Loading