Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
main.py		main.py
prepare_data.sh		prepare_data.sh
prepare_model.py		prepare_model.py
requirements.txt		requirements.txt
run_benchmark.sh		run_benchmark.sh
run_fine_tuning.sh		run_fine_tuning.sh
run_quant.sh		run_quant.sh

README.md

Step-by-Step

This example quantizes the microsoft/codebert-base fine-tuned on the the code defect detection task.

Prerequisite

1. Environment

pip install neural-compressor
pip install -r requirements.txt

Note: Validated ONNX Runtime Version.

2. Prepare Dataset

Run prepare_data.sh script to download dataset from website to dataset folder and pre-process it:

bash prepare_data.sh

3. Prepare Model

Fine-tuning the model on code defect detection task.

bash run_fine_tuning.sh --train_dataset_location=./dataset/train.jsonl --dataset_location=./dataset/valid.jsonl  --fine_tune

Export model to ONNX format.

# By default, the input model path is `checkpoint-best-acc/`.
python prepare_model.py  --input_model=./checkpoint-best-acc  --output_model=./codebert-exported-onnx

Run

1. Quantization

Static quantization with QOperator format:

bash run_quant.sh --input_model=/path/to/model \ # model path as *.onnx
                   --output_model=/path/to/model_tune \
                   --dataset_location=path/to/glue/data

2. Benchmark

bash run_benchmark.sh --input_model=path/to/model \ # model path as *.onnx
                      --dataset_location=path/to/glue/data \ 
                      --batch_size=batch_size \ 
                      --mode=performance # or accuracy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ptq_dynamic

ptq_dynamic

README.md

Step-by-Step

Prerequisite

1. Environment

2. Prepare Dataset

3. Prepare Model

Run

1. Quantization

2. Benchmark

Files

ptq_dynamic

Directory actions

More options

Directory actions

More options

Latest commit

History

ptq_dynamic

Folders and files

parent directory

README.md

Step-by-Step

Prerequisite

1. Environment

2. Prepare Dataset

3. Prepare Model

Run

1. Quantization

2. Benchmark