Skip to content

OpenCSGs/opencsg-r1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OPENCSG R1

Table of Contents

  1. Overview
  2. Installation
  3. Training models

Overview

The goal of this repo is to build the different dataset or methods for trainning r1-like models.The project is simple by design and mostly consists of:

  • src: contains the scripts to train and evaluate models on different datasets and trainning methods:
    • full_train_grpo.py: trains a model with GRPO by using full-parameters training.
    • lora_train_grpo: performs a simple SFT of a model on a dataset.
  • scripts: contains easy-to-run commands for each step in the R1 pipeline leveraging the scripts above.
  • inference: contains some code for model tests.

Installation

pip install -r requirements.txt

Training models

# full parameter trainning method with grpo
bash scripts/full_train_grpo.sh

# lora trainning method with grpo
bash scripts/lora_train_grpo.sh

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published