ValueError: Cannot merge LORA layers when the model is gptq quantized #1318

goy-jin · 2024-09-23T01:58:44Z

是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions

该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

我已经搜索过FAQ | I have searched FAQ

当前行为 | Current Behavior

微调Qwen_1.8b_chat_int4模型，分别使用lora和qlora方法，合并模型时报错
ValueError: Cannot merge LORA layers when the model is gptq quantized

期望行为 | Expected Behavior

解决该问题

复现方法 | Steps To Reproduce

python qwen_lora_merge.py

from peft import AutoPeftModelForCausalLM
from transformers import AutoTokenizer

path_to_adapter="/home/ren/Finetuning/Qwen-1.8-chat/"
new_model_directory="/home/ren/Finetuning/llm_model/Qwen-1_8B-Chat-Int4_law"

model = AutoPeftModelForCausalLM.from_pretrained(

path_to_adapter, # path to the output directory

device_map="auto",

trust_remote_code=True

).eval()merged_model = model.merge_and_unload()

max_shard_size and safe serialization are not necessary.

They respectively work for sharding checkpoint and save the model to safetensors

merged_model.save_pretrained(new_model_directory, max_shard_size="2048MB", safe_serialization=True)

运行环境 | Environment

- OS:Ubuntu 20.04
- Python:3.10
- Transformers:4.37.2
- PyTorch:2.2.1
- CUDA (`python -c 'import torch; print(torch.version.cuda)'`):11.8

备注 | Anything else?

no

danyow-cheung · 2024-09-24T15:38:50Z

如果你觉得这样一步到位的方式让你很不安心或者影响你接入下游应用，你可以选择先合并并存储模型（LoRA支持合并，Q-LoRA不支持），再用常规方式读取你的新模型，示例如下：

qlora不支持合并

github-actions · 2024-10-25T08:07:15Z

This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread.
此问题由于长期未有新进展而被系统自动标记为不活跃。如果您认为它仍有待解决，请在此帖下方留言以补充信息。

github-actions bot added the inactive label Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Cannot merge LORA layers when the model is gptq quantized #1318

ValueError: Cannot merge LORA layers when the model is gptq quantized #1318

goy-jin commented Sep 23, 2024

danyow-cheung commented Sep 24, 2024

github-actions bot commented Oct 25, 2024

ValueError: Cannot merge LORA layers when the model is gptq quantized #1318

ValueError: Cannot merge LORA layers when the model is gptq quantized #1318

Comments

goy-jin commented Sep 23, 2024

是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

期望行为 | Expected Behavior

复现方法 | Steps To Reproduce

max_shard_size and safe serialization are not necessary.

They respectively work for sharding checkpoint and save the model to safetensors

运行环境 | Environment

备注 | Anything else?

danyow-cheung commented Sep 24, 2024

github-actions bot commented Oct 25, 2024