For finetuning, add an alternative to LlamaFactory #134

zhangfaen · 2024-09-06T08:40:13Z

For finetuning, add an alternative to LlamaFactory

zhangfaen · 2024-09-13T05:12:18Z

https://github.com/zhangfaen/finetune-Qwen2-VL has 50+ stars and many repost in twitter.
I think it is a good thing to let more people to know and use QwenLM/Qwen2-VL.
Would you please merge this PR? many thanks

zhangfaen · 2024-09-14T08:55:48Z

Thank you deeksha! Hope this PR will be merged to main branch soon.

CarlHuangNuc · 2024-10-17T01:43:10Z

Cound you have a plan to add continue pretraining support ?

zhangfaen · 2024-10-17T02:56:35Z

Cound you have a plan to add continue pretraining support ?

I think Finetune itself is kind of continue pretraining.

CarlHuangNuc · 2024-10-17T03:03:47Z

Cound you have a plan to add continue pretraining support ?

I think Finetune itself is kind of continue pretraining.

yes, you are right. I want to add some new task in stage2 (multi_task pretraining) and then SFT on stage 3. whether your PR can more easy for our project. The biggest obstacle is that we can not obtain the model output from stage 1 from Qwen2-VL team which just release stage3 model(Qwen2-VL-7B-instruct).

zhangfaen · 2024-10-17T03:46:30Z

Below is from Qwen2-vl tech report:

Following Qwen-VL (Bai et al., 2023b), we adopt a three-stage training methodology. In the first stage, we
focus exclusively on training the Vision Transformer (ViT) component, utilizing a vast corpus of image-text
pairs to enhance semantic understanding within the Large Language Model (LLM). In the second stage, we
unfreeze all parameters and train with a wider range of data for more comprehensive learning. In the final
stage, we lock the ViT parameters and perform exclusive fine-tuning of the LLM using instructional datasets.

It seems the stage2 is just to optimize all parameters. For your purpose, I think it should be OK that you use https://github.com/zhangfaen/finetune-Qwen2-VL and prepare your data.

In case you want to do stage3, you can just add a few lines in my code script, for example:

model.vision_model.requires_grad_(False) # 冻结这module中的参数， 不更新... 
model.language_model.requires_grad_(False)  # 冻结这module中的参数， 不更新... 

logger.info(f"total params for Lora training: {sum(p.numel() for p in model.parameters())}")
logger.info(f"total trainable params for Lora training: {sum(p.numel() for p in model.parameters() if p.requires_grad)}")

CarlHuangNuc · 2024-10-17T04:06:46Z

thanks,,good idea.

CarlHuangNuc · 2024-10-21T01:21:01Z

Why Qwen-VL release Qwen-VL + Qwen-VL-Char, but Qwen2-VL only release Qwen2-VL-Instruct , do not release Qwen2-VL which maybe belong to pretrain model.

ahmedheakl · 2024-10-25T08:56:54Z

I agree with @CarlHuangNuc.

Can you guys please provide your pretraining code? @deekshaaneja

zhangfaen added 2 commits September 6, 2024 16:37

add an alternative to LlamaFactory

8030718

refine wording

42ae167

deekshaaneja approved these changes Sep 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For finetuning, add an alternative to LlamaFactory #134

For finetuning, add an alternative to LlamaFactory #134

zhangfaen commented Sep 6, 2024

zhangfaen commented Sep 13, 2024

zhangfaen commented Sep 14, 2024

CarlHuangNuc commented Oct 17, 2024

zhangfaen commented Oct 17, 2024

CarlHuangNuc commented Oct 17, 2024

zhangfaen commented Oct 17, 2024

CarlHuangNuc commented Oct 17, 2024

CarlHuangNuc commented Oct 21, 2024

ahmedheakl commented Oct 25, 2024

For finetuning, add an alternative to LlamaFactory #134

Are you sure you want to change the base?

For finetuning, add an alternative to LlamaFactory #134

Conversation

zhangfaen commented Sep 6, 2024

zhangfaen commented Sep 13, 2024

zhangfaen commented Sep 14, 2024

CarlHuangNuc commented Oct 17, 2024

zhangfaen commented Oct 17, 2024

CarlHuangNuc commented Oct 17, 2024

zhangfaen commented Oct 17, 2024

CarlHuangNuc commented Oct 17, 2024

CarlHuangNuc commented Oct 21, 2024

ahmedheakl commented Oct 25, 2024