PLSemanticsBench

This benchmark investigates the idea of using LLMs as interpreters for programming languages.

Environment Setup

Requirements

Conda is used for managing dependencies and creating virtual environment to run the experiments.

Setup

Navigate to the root directory containing env.yaml and execute the command below to install project dependencies
```
conda env create -f env.yaml
```
In the root directory, execute the below command to start the created virtual environment.
```
conda activate llm-interpreter
```

Repository Structure

data/: directory contains the raw data in .jsonl
results/: models' generated results will be written to this directory
model_configs contains the config files for model inference, including hyper-parameter

How to run the code

We provide the code to run gpt-4o models on all four tasks in PLSemanticsBench. The results will be written to the results/ directory.

# the default one is gpt-4o
python main.py

Citation & License

Please use the following citations if you found our work to be useful in your work.

@inproceedings{plsemanticsbench,
    title={PLSemanticsBench: Large Language Models are Bad Programming Language Interpreters},
    author={Jiyang Zhang and Aditya Thimmaiah and Samuel Yuan and Junyi Jessy Li and Milos Gligoric},
    booktitle={},
    year={2025},
    url={}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
model_configs		model_configs
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
base_experiment.py		base_experiment.py
env.yaml		env.yaml
experiment_args.py		experiment_args.py
gpt_experiment.py		gpt_experiment.py
macros.py		macros.py
main.py		main.py
prompt_maker.py		prompt_maker.py
prompts.py		prompts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PLSemanticsBench

Environment Setup

Requirements

Setup

Repository Structure

How to run the code

Citation & License

About

Uh oh!

Releases

Packages

Languages

JiyangZhang/PLSemanticsBench-code

Folders and files

Latest commit

History

Repository files navigation

PLSemanticsBench

Environment Setup

Requirements

Setup

Repository Structure

How to run the code

Citation & License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages