OPENCSG R1

Table of Contents

Overview
Installation
Training models

Overview

The goal of this repo is to build the different dataset or methods for trainning r1-like models.The project is simple by design and mostly consists of:

src: contains the scripts to train and evaluate models on different datasets and trainning methods:
- full_train_grpo.py: trains a model with GRPO by using full-parameters training.
- lora_train_grpo: performs a simple SFT of a model on a dataset.
scripts: contains easy-to-run commands for each step in the R1 pipeline leveraging the scripts above.
inference: contains some code for model tests.

Installation

pip install -r requirements.txt

Training models

# full parameter trainning method with grpo
bash scripts/full_train_grpo.sh

# lora trainning method with grpo
bash scripts/lora_train_grpo.sh

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
inference		inference
scripts		scripts
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OPENCSG R1

Overview

Installation

Training models

About

Releases

Packages

Languages

OpenCSGs/opencsg-r1

Folders and files

Latest commit

History

Repository files navigation

OPENCSG R1

Overview

Installation

Training models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages