Skip to content

swefficiency v1 implementation#423

Open
18jeffreyma wants to merge 2 commits intoOpenHands:mainfrom
18jeffreyma:main
Open

swefficiency v1 implementation#423
18jeffreyma wants to merge 2 commits intoOpenHands:mainfrom
18jeffreyma:main

Conversation

@18jeffreyma
Copy link

(as title)

tested locally with

SKIP_BUILD=0 uv run swefficiency-infer /home/ubuntu/benchmarks/.llm_config/gemini3flash.json --dataset swefficiency/swefficiency --workspace docker --num-workers 4 --num-cpus-per-worker 4 --mem-limit 32g --n-limit 10


Follow these steps to improve performance:
1. As a first step, activate the testbed environment by running:
. /opt/miniconda3/etc/profile.d/conda.sh ; conda activate testbed
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

curious on this one, maybe worth publicizing this detail to other benchmark mainttainers: initial runs on SWE-fficiency were spuriously failing since agents didn't have the conda environement already active

@18jeffreyma
Copy link
Author

@enyst would you mind taking a look when you have time?

should be a straightforward port of:
https://github.com/OpenHands/OpenHands/pull/11716/changes

Copy link

@enyst enyst left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is so cool, thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

Comments