AlignEval: Evaluating LLM Alignment by Evaluating LLMs as Judges

This is the code repository for our NeurIPS 2025 paper "On Evaluating LLM Alignment by Evaluating LLMs as Judges". This repository contains the necessary scripts and data to evaluate the alignment of large language models (LLMs) using the AlignEval framework.

File Structure

README.md: This file.
data/: Contains the AlignEval datasets used for evaluation.
prompts/: Contains the prompt templates used for evaluation.
results/: Contains the results of the evaluation.
aligneval.py: The main script for running the evaluation.
get_predictions.py: A script for generating predictions using the LLMs.
get_predictions.sh: A shell script for generating predictions using the LLMs.

Introduction

Running aligneval.py will evaluate the alignment of LLMs using the AlignEval datasets.
Running get_predictions.py will generate predictions using the LLMs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AlignEval: Evaluating LLM Alignment by Evaluating LLMs as Judges

File Structure

Introduction

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
prompts		prompts
results		results
README.md		README.md
aligneval.py		aligneval.py
get_predictions.py		get_predictions.py
get_predictions.sh		get_predictions.sh
llm.py		llm.py

yale-nlp/AlignEval

Folders and files

Latest commit

History

Repository files navigation

AlignEval: Evaluating LLM Alignment by Evaluating LLMs as Judges

File Structure

Introduction

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages