SysLLMatic: Large Language Models are Software System Optimizers

About

This repository contains the artifacts for the project SysLLMatic: Large Language Models are Software System Optimizers. It includes implementation details, instructions to reproduce results, and experimental data.

Our artifact includes the following

Item	Description	Corresponding content in the paper	Path
Pattern Catalog	The catalog including 43 performance optimization patterns	§4, Figure 2-3, Table 2	pattern_catalog
Implementation	The implementation of SysLLMatic	§5, Figure 4-6	src
Benchmarks	The benchmarks we used in evaluation	§6-B	humaneval, scimark, dacapo
Eval	The evaluation scripts and results	§7, Figure 7-15, Table 6-12	eval

Environment Requirement

This artifact requires a machine with the following capabilities to support RAPL (Running Average Power Limit) and read MSR (Model-Specific Registers):

Hardware

Intel Processor: Machine with Intel processors supporting RAPL (Sandy Bridge or newer).
MSR Support: Machine must allow access to MSRs.

Operating System

Linux-based OS (e.g., Ubuntu 16.04+).
Linux Kernel Version 3.13+ required for RAPL support.
Root Access: MSRs can only be accessed with root/superuser privileges.

Software

msr-tools: Install for reading MSRs:
```
sudo apt-get install msr-tools
```

Environment Setup

Clone the repository:

git clone <repository-link>
cd <project-directory>

Install the required dependencies using the Makefile
```
make setup
```
Create .env file in the root directory Add the following:
```
API_KEY=your_openai_api_key_here
USER_PREFIX=$(pwd)
```
Then source your env with
```
. .env
```
Compile performance measurement module In the MEASURE directory, run:
```
make
```

Running the pipeline

Run the main script from the project root (/sysllmatic) Run HumanEval_CPP benchmark

python3 src/main.py --benchmark HumanEval --llm gpt-4o --self_optimization_step 2 --num_programs 2

Run SciMark benchmark

python3 src/main.py --benchmark SciMark --llm gpt-4o --self_optimization_step 2

Run Dacapo benchmark Prebuild the target application following the Dacapobench official instruction, then run:

python3 src/main.py --benchmark Dacapobench --llm gpt-4.1 --self_optimization_step 2 --application_name biojava

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
MEASURE		MEASURE
async-profiler @ ee75d80		async-profiler @ ee75d80
benchmark_dacapo		benchmark_dacapo
benchmark_human_eval		benchmark_human_eval
benchmark_scimark		benchmark_scimark
eval		eval
pattern_catalog		pattern_catalog
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SysLLMatic: Large Language Models are Software System Optimizers

About

Table of Contents

Environment Requirement

Environment Setup

Running the pipeline

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SysLLMatic: Large Language Models are Software System Optimizers

About

Table of Contents

Environment Requirement

Environment Setup

Running the pipeline

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages