Missing Assertion Correctness Implementation in Evaluation Pipeline

In the TestEval paper, three criteria are mentioned for test case correctness evaluation:

Syntactic correctness
Execution correctness
Assertion correctness

The current implementation in `eval_overall.py` properly evaluates:

Syntactic correctness (using `compile(testcase,'<string>','exec')`)
Execution correctness (using the `execute()` function, which returns True for both normal execution and assertion errors)

However, the assertion correctness metric mentioned in the paper isn't implemented.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Assertion Correctness Implementation in Evaluation Pipeline #12

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Missing Assertion Correctness Implementation in Evaluation Pipeline #12

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions