bash clean_megatron-deepspeed.sh
bash build_megatron-deepspeed.sh
bash install_megatron-deepspeed.sh
cd ci && bash run_ci_tests_one_node.sh
First, you should make sure something below.
- The CI Test in 1 node can pass in master node container.
- Copy master node container environment to other node servers.
- Make sure the account name, contrainer name is the same in different node servers.
- Set up password free login between the master node container and other node servers.
Second, set your node server info. You can set up like:
## The account in server
export HOST_NAME="username"
## Severs IP, begin with the master node server IP, and split by ","
export ADDR_ARRAY="10.111.222.1,10.111.222.2"
## Container name
export CONTAINER_NAME="megatron-deepspeed"
Third, run.
cd ci && bash run_ci_tests_multi_node.sh
bash dataset/download_dataset.sh
bash dataset/download_vocab.sh
cd examples/aquila && bash run_aquila_7b_node1_bf16.sh
First, you should make sure something below.
- The pretrain in 1 node run successfully in master node container.
- Copy master node container environment to other node servers.
- Make sure the account name, contrainer name is the same in different node servers.
- Set up password free login between the master node container and other node servers.
- Make megatron-deepspeed repo and dataset at same path in different node servers.
Second, set your node server info. You can set up like:
## The account in server
export HOST_NAME="username"
## Severs IP, begin with the master node server IP, and split by ","
export ADDR_ARRAY="10.111.222.1,10.111.222.2"
## Container name
export CONTAINER_NAME="megatron-deepspeed"
Third, run.
cd examples/aquila && bash run_aquila_7b_node2_bf16.sh
If you have an your own jsonl text dataset, and you want to use it in Megatron-Deepspeed, you can do something to change it to proper format.
For example, look file megatron-deepspeed/dataset/convert_llama2tokenizer_dataset.sh
, you need to change some main parameter by your own, there contain:
- tokenizer-type
- tokenizer-model or (vocab-file and merge-file)
- json-keys
- input
- output-prefix