Skip to content

Collection of LLM completions for reasoning-gym task datasets

License

Notifications You must be signed in to change notification settings

open-thought/reasoning-gym-eval

Repository files navigation

reasoning-gym Evaluation

We store evaluation results of reasoning-gym datasets (including llm outputs) in this repository.

Progress and LLM accuracy metrics are tracked on our main Google Spreadsheet.

Team

Contact / Contributing / Sponsoring

  • You can reach the eval-team in the #reasoning-gym channel of the GPU-Mode discord server.
  • We would be very happy about donations in the form of OpenRouter API keys (or other inference API providers)!

About

Collection of LLM completions for reasoning-gym task datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •