Add create-cronjob benchmark task to k8s-bench #359

wussh · 2025-06-18T09:44:06Z

Description

This pull request adds a new benchmark task to k8s-bench that tests kubectl-ai's ability to create a Kubernetes CronJob resource. This expands test coverage to include more Kubernetes resource types.

Changes

Created a new benchmark task in k8s-bench/tasks/create-cronjob/
Implemented task.yaml with a prompt to create a CronJob that runs at midnight
Added setup.sh, cleanup.sh, and verify.sh scripts for the benchmark
Designed verify.sh to be robust and handle different valid implementations

Benefits

Tests kubectl-ai's ability to work with CronJob resources
Validates understanding of cron schedule syntax
Adds a medium-difficulty benchmark to the existing suite
Follows the best practices outlined in the k8s-bench contribution guide

Testing

The benchmark has been prepared according to the format of other benchmark tasks and includes comprehensive verification logic.

Checklist

Task follows the structure of existing benchmark tasks
Task includes proper setup, cleanup, and verification scripts
Verification script handles multiple valid implementation approaches
Scripts are executable

noahlwest · 2025-07-31T19:01:25Z

k8s-bench/tasks/create-cronjob/setup.sh

@@ -0,0 +1,3 @@
+#!/usr/bin/env bash
+# Create the namespace for the test
+kubectl create namespace create-cronjob-test 


Can we adjust the namespace to just create-cronjob? Including test, eval, etc. can affect how the model handles the request.

noahlwest · 2025-07-31T19:13:37Z

k8s-bench/tasks/create-cronjob/verify.sh

+
+# Verify schedule is set to midnight
+# Accept either 0 0 * * * or @daily
+SCHEDULE=$(kubectl get cronjob data-backup -n create-cronjob-test -o jsonpath='{.spec.schedule}')


minor nitpick: Is it reasonable to do a single kubectl get call, save the output, and check against that for the verification checks? The intent you have here is clear, so if that optimization would make things less readable or too tricky, feel free to disregard this comment.

droot · 2025-11-12T19:06:11Z

Since we are moving k8s-bench to its own repo, closing this out.

wussh and others added 2 commits June 18, 2025 16:40

Add create-cronjob task with setup, cleanup, and verification scripts

e0fe27c

Merge branch 'GoogleCloudPlatform:main' into main

5fd4b33

mikebz requested a review from noahlwest July 30, 2025 23:10

noahlwest requested changes Jul 31, 2025

View reviewed changes

droot closed this Nov 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add create-cronjob benchmark task to k8s-bench #359

Add create-cronjob benchmark task to k8s-bench #359

Uh oh!

wussh commented Jun 18, 2025 •

edited by noahlwest

Loading

Uh oh!

noahlwest Jul 31, 2025

Uh oh!

noahlwest Jul 31, 2025

Uh oh!

droot commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add create-cronjob benchmark task to k8s-bench #359

Add create-cronjob benchmark task to k8s-bench #359

Uh oh!

Conversation

wussh commented Jun 18, 2025 • edited by noahlwest Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

Benefits

Testing

Checklist

Uh oh!

noahlwest Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

noahlwest Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

droot commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wussh commented Jun 18, 2025 •

edited by noahlwest

Loading