Clarification on 3-Step Training Approach and Commands for Uni-MoE v2 #9

Bhagyashreet20 · 2024-06-25T00:23:12Z

I like the three step innovative training approach to train the MLLMs. This intrigued me more and I was going through the scripts trying to replicate 3 step training technique to train my own model. However, I have few queries.

is it possible to replicate all three training steps with the scripts in uni-moe-v2 folder?
Could you share the command to train uni-moe-v2-speech as there are only inference and eval scripts?
relating to the 3 step training approach and the given model checkpoints, Uni-MoE 8-expert base is the result of step1, Uni_MoE 8-expert experts model after step 2 and Uni_MoE 8-expert finetune model is the model after step 3. Is my understanding correct?

expapa · 2024-06-26T07:34:21Z

Thanks for your attention and support to our model! Here's some replies, hope they are helpful for you:

sry we are not releasing the first two stage training script, but these stages can be done by removing the MoE structure from the code.
Sure, the script will be uploaded soon, check it out.
Actually the projector weight and qformer weight have all been changed during the first, second and third stage, so Uni-MoE 8-expert base is the base model we train all our stages from, Uni_MoE 8-expert experts model are the stage 2 result which contains MLPs from the stage 2 models, Uni_MoE 8-expert finetune model is the lora weights and the actual qformer and projector weight for the MoE model.

Bhagyashreet20 · 2024-06-26T15:51:11Z

cool. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on 3-Step Training Approach and Commands for Uni-MoE v2 #9

Clarification on 3-Step Training Approach and Commands for Uni-MoE v2 #9

Bhagyashreet20 commented Jun 25, 2024

expapa commented Jun 26, 2024

Bhagyashreet20 commented Jun 26, 2024

Clarification on 3-Step Training Approach and Commands for Uni-MoE v2 #9

Clarification on 3-Step Training Approach and Commands for Uni-MoE v2 #9

Comments

Bhagyashreet20 commented Jun 25, 2024

expapa commented Jun 26, 2024

Bhagyashreet20 commented Jun 26, 2024