Add proof validation to tactic mode #63

augustepoiroux · 2025-01-03T11:49:02Z

This PR introduces proof validation in tactic mode (see #44).

Highlights:

Introduce a getProofStatus method inspired by LeanDojo validateProof method. It is called by createProofStepReponse after each tactic application.
- Tweak introduced to achieve this: add to ProofSnapshot a rootGoals attribute (type: List MVarId) containing the initial goals of the declaration before applying any tactic.
Add a proofStatus attribute to ProofStepResponse showing after each tactic the status of the proof up to this point.
- Update + add new tests accordingly
(bonus feature) Add a rootGoals attribute to CommandOption to extract initial goals and return (synthetic) sorries from each declaration in a command. It differs from allTactics in that it only extracts initial goals. Useful to test automated provers on existing Lean projects. It is similar to what LeanDojo can achieve after the tracing/parsing step.

Issues:

Code in this PR correctly catches the apply theorem itself issue reported in [BUG] REPL accepts incorrect proofs #44 and returns an error (see test/self_proof_apply_check.in). However, it doesn't catch this issue when using the exact? tactic. From what I understand, it seems the exact? tactic should first fail at evaluation (runString method) but doesn't.
Example:
```
theorem ex : False := by
   exact?
```
In Lean (and LeanDojo), it returns: `exact?\` could not close the goal. Try `apply?` to see partial suggestions.
However, in Lean REPL in tactic mode, when we run:
```
{"cmd": "theorem ex : False := sorry"}

{"proofState": 0, "tactic": "exact?"}
```
we get: Try this: exact _root_.ex and the tactic successfully applies.
And I am surprised that getProofStatus doesn't catch this either. My best guess is that the proof state is decoupled from the declaration at some point. Not sure if this is indeed the case though.

…ations root goal - Useful for people experimenting with automated provers. Example usage: run a lean file from the MiniF2F benchmark, and get sorries even if there already exist proofs. Similar to LeanDojo, but without having to trace the project. - Fix: add rootGoals for proof states extracted by the `tactics` method

augustepoiroux force-pushed the proof_validation branch from caa1de0 to f9dc2fb Compare January 8, 2025 08:44

RexWzh mentioned this pull request Feb 20, 2025

[BUG] REPL accepts incorrect proofs #44

Open

augustepoiroux added 4 commits April 4, 2025 09:18

Initial fix (incomplete)

596dcae

Add proofStatus field in ProofStepResponse + update tests

1631b1f

Add new tests + update old ones with the new goals accomplished message

cf14c9a

augustepoiroux force-pushed the proof_validation branch from f9dc2fb to cf14c9a Compare April 9, 2025 14:51

augustepoiroux added 2 commits April 10, 2025 09:24

Remove exact? test

10175f6

Improve docstrings

447a4f3

kim-em merged commit 4a7e8af into leanprover-community:master Apr 10, 2025
1 check passed

augustepoiroux mentioned this pull request Apr 10, 2025

Fix missing local context + valid proofs wrongfully rejected in tactic mode #82

Merged

Kripner mentioned this pull request Apr 16, 2025

Add proof step verification #85

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add proof validation to tactic mode #63

Add proof validation to tactic mode #63

Uh oh!

augustepoiroux commented Jan 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add proof validation to tactic mode #63

Add proof validation to tactic mode #63

Uh oh!

Conversation

augustepoiroux commented Jan 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

augustepoiroux commented Jan 3, 2025 •

edited

Loading