Loading checkpoint shards takes too long #251

irjawais · 2024-05-09T19:16:53Z

When I load "meta-llama/Meta-Llama-3-8B-Instruct" model like this

from transformers import AutoTokenizer, TextStreamer from intel_extension_for_transformers.transformers import AutoModelForCausalLM model_name = "meta-llama/Meta-Llama-3-8B-Instruct" # Hugging Face model_id or local model tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True) streamer = TextStreamer(tokenizer) model = AutoModelForCausalLM.from_pretrained(model_name, load_in_4bit=True)

it got hanged. Then only way is to restart instance to recover it.

Is there any issue in my spec?

my instance spec ubunu 32 GB RAM.

The text was updated successfully, but these errors were encountered:

irjawais · 2024-05-09T19:32:16Z

warnings.warn(
Loading checkpoint shards: 75%|█████████████████████████████████████████████████████████████████████████████████████████████████████████ | 3/4 [01:53<00:37, 37.72s/it]Traceback (most recent call last):
File "", line 1, in
File "/usr/local/lib/python3.10/dist-packages/intel_extension_for_transformers/transformers/modeling/modeling_auto.py", line 593, in from_pretrained
model.init( # pylint: disable=E1123
File "/usr/local/lib/python3.10/dist-packages/neural_speed/init.py", line 182, in init
assert os.path.exists(fp32_bin), "Fail to convert pytorch model"
AssertionError: Fail to convert pytorch model

intellinjun · 2024-05-17T02:21:54Z

@irjawais
Can you check the memory usage when converting the model? From your description, it seems that there may be insufficient memory.

kevinintel assigned intellinjun Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading checkpoint shards takes too long #251

Loading checkpoint shards takes too long #251

irjawais commented May 9, 2024

irjawais commented May 9, 2024

intellinjun commented May 17, 2024

Loading checkpoint shards takes too long #251

Loading checkpoint shards takes too long #251

Comments

irjawais commented May 9, 2024

irjawais commented May 9, 2024

intellinjun commented May 17, 2024