Skip to content

Conversation

@YanSong97
Copy link
Collaborator

@YanSong97 YanSong97 commented Nov 5, 2024

LLM self-refining during tree searching #42

TODO:

  1. Merge critic_MATH env to MATH
  2. Configurate step tag
  3. Solve action_his and prm length mismatch problem

add full_answer property to base_env;
p.evaluate_problem also output metadata;
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants