-
Notifications
You must be signed in to change notification settings - Fork 133
Issues: alibaba/Pai-Megatron-Patch
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
qwen2.5-coder dense报错:AssertionError: Find unsupported keys in checkpoint
#489
opened Mar 4, 2025 by
elimsjxr
训练75模型,TP8,PP4,4张A800sft微调训练,总是报错torch.distributed.DistStoreError: Socket Timeout
#468
opened Feb 19, 2025 by
Chunfeng1994
为什么llama-3.1权重转换时70B需要cpu_options="--use-cpu-initialization",不会速度过慢吗
#465
opened Feb 13, 2025 by
kkkeepgoing
[Question] Reason for excluding
fusion
in deepseek-v2 training
#446
opened Jan 24, 2025 by
wavy-jung
ProTip!
Mix and match filters to narrow down what you’re looking for.