developer_document.md need elaboration on determining buffer sizes? #287

hpcpony · 2024-06-09T21:27:40Z

In the example for adding to gptneox_mem_req I see that n_layers comes from the num_hidden_layers in the config.json file, but where does the 512, 512, and 1024 come from? Maybe a comment in the document would help.

I was looking to extend the existing bloom capability to handle https://huggingface.co/bigscience/bloom but it's not obvious to me how chose the right scratch sizes from the config.json.

zhentaoyu · 2024-06-21T05:27:18Z

hi, @hpcpony, sorry for the confusion. The model_scratch is like a kind of kernel workspace and it will be used in model eval process. They are roughly estimated values. About how to set them when adding new models, my experience is that looking for a reference model with similar parameters. It will automatically enlarge these memories when meets larger bs and ctx_size if you use our python API. cc @Zhenzhong1 and @a32543254.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

developer_document.md need elaboration on determining buffer sizes? #287

developer_document.md need elaboration on determining buffer sizes? #287

hpcpony commented Jun 9, 2024

zhentaoyu commented Jun 21, 2024

developer_document.md need elaboration on determining buffer sizes? #287

developer_document.md need elaboration on determining buffer sizes? #287

Comments

hpcpony commented Jun 9, 2024

zhentaoyu commented Jun 21, 2024