-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The paper on the Bone structure has been updated #2312
base: main
Are you sure you want to change the base?
Conversation
Thanks for the update. I haven't compared the new paper, but checking the PR: Is DiSHA a method that could be used on its own, but is not available in PEFT, or is it the union of Bone and Bat? What I'd like to avoid is for users to be confused when they want to use DiSHA but don't find any method with that name in PEFT. Maybe this could be clarified in the description. |
Simply put, DiSHA is the overall framework, while Bone and Bat are just subsets of it. So I even want to rename Bone to DiSHA, and then choose either Bone or Bat during init_weights. |
When it comes to the PEFT code, let's avoid any renaming, as this would break backwards compatibility. |
So how should I modify the description of Bone? Because DiSHA explains in detail how Bone came about, the theoretical support provided in the older Bone paper is insufficient. |
I think rewriting the descriptions of the method as you did is okay, as the existing code is not affected by it. I just think you should highlight that in the PEFT code base, the method is still referred to as In theory, we can also allow renaming the method in code. However, for that, we would need a long deprecation period, so that users who use |
The new version of the paper is more reliable and easier for readers to understand.
https://arxiv.org/abs/2409.15371