You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Well you can calculate it via: 13b times 16 Bit (f16) = 26 GB.
Accelerate will probably try to page some of the layers, if you exceed your 16 GB and get stuck there. Theoretically it's possible to stream the layers in, but i think neither GGML or this project has implemented that yet for GPT 2.
I'm trying to convert it on 16gb RAM but converting process seems to last forever.
The text was updated successfully, but these errors were encountered: