Bethge Lab

foolbox Public

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

CiteME Public

CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.

Python 48 5

model-vs-human Public

Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)

Python 358 54

robust-detection-benchmark Public

Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)

Jupyter Notebook 191 24

imagecorruptions Public

Python package to corrupt arbitrary images.

Python 458 72

stylize-datasets Public

A script that applies the AdaIN style transfer method to arbitrary datasets

Python 165 38

Provide feedback