Skip to content

Conversation

@yeshsurya
Copy link
Contributor

@yeshsurya yeshsurya commented Dec 1, 2025

azure_python_grader.py
Link to sanity run :

grpo 2 steps : here

@github-actions
Copy link

github-actions bot commented Dec 1, 2025

Test Results for assets-test

0 tests   0 ✅  0s ⏱️
0 suites  0 💤
0 files    0 ❌

Results for commit 5fa0bee.

♻️ This comment has been updated with latest results.

@yeshsurya yeshsurya changed the title [feat]: [draft-awaiting pm review] RFT trainer component and simplified model import [feat]: RFT trainer component and simplified model import Jan 5, 2026
display_name: Component Model Import
description: Component to import HuggingFace models or AML registered models.

environment: azureml://registries/azureml/environments/acft-hf-nlp-gpu/versions/107
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will this env acft-hf-nlp-data-import work here?

@babu-namburi
Copy link
Contributor

@yeshsurya
I see another issue
Received unexpected config parameter automatic_object_spilling_enabled

I am testing locally to flush all the issues.

@babu-namburi
Copy link
Contributor

May be we should catch these early on

AssertionError: normalized ppo_mini_batch_size 128 should be divisible by ppo_micro_batch_size_per_gpu 10

@babu-namburi
Copy link
Contributor

-- �[36m(pid=15902)�[0m ERROR 01-07 05:57:20 [config.py:32] Failed to import Triton kernels. Please make sure your triton version is compatible. Error: module 'triton.language' has no attribute 'constexpr_function'�[32m [repeated 18x across cluster]�[0m

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants