Documentation for LoRAConfig. #2212

brynhayder · 2024-11-12T16:06:40Z

Lines 112 to 113 in 162d7e5

    
                       initialization scaled by the LoRA rank for linear and layers. Setting the initialization to False leads to 
        
                       completely random initialization and is discouraged. Pass `'loftq'` to use LoftQ initialization. Pass

Documentation for False is not clear. Presumably 'completely random' means the arrays will be uninitialized and hence contain whatever the contents of the relevant memory locations are?

The text was updated successfully, but these errors were encountered:

BenjaminBossan · 2024-11-12T22:44:45Z

To explain further: The default implementation initializes the LoRA A parameter randomly and the LoRA B parameter to zeros. This results in LoRA being an identity transform at initialization, which can help with training. When setting init_lora_weights=False, the LoRA B weight is instead also randomly initialized, resulting in a non-identity transform.

For real LoRA training, you almost never want that, which is why we discourage it. However, the weights are not initialized as random memory as in torch.empty, which seems to be what you suspected.

github-actions · 2024-12-13T15:04:00Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

brynhayder · 2025-01-06T11:43:51Z

To explain further: The default implementation initializes the LoRA A parameter randomly and the LoRA B parameter to zeros. This results in LoRA being an identity transform at initialization, which can help with training. When setting init_lora_weights=False, the LoRA B weight is instead also randomly initialized, resulting in a non-identity transform.

For real LoRA training, you almost never want that, which is why we discourage it. However, the weights are not initialized as random memory as in torch.empty, which seems to be what you suspected.

Thanks. Is it possible to update the docs to explain what you've just said? I think this should be clear in the docs.

github-actions bot closed this as completed Dec 22, 2024

BenjaminBossan reopened this Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation for LoRAConfig. #2212

Documentation for LoRAConfig. #2212

brynhayder commented Nov 12, 2024 •

edited

Loading

BenjaminBossan commented Nov 12, 2024

github-actions bot commented Dec 13, 2024

brynhayder commented Jan 6, 2025

Documentation for LoRAConfig. #2212

Documentation for LoRAConfig. #2212

Comments

brynhayder commented Nov 12, 2024 • edited Loading

BenjaminBossan commented Nov 12, 2024

github-actions bot commented Dec 13, 2024

brynhayder commented Jan 6, 2025

brynhayder commented Nov 12, 2024 •

edited

Loading