Fix https://github.com/huggingface/peft/issues/1974#issue-2437471248

#74

by Finger-ebic - opened Aug 25, 2024

base: refs/heads/main

←

from: refs/pr/74

Discussion Files changed

-1

Finger-ebic

Aug 25, 2024

The issue is also posted at https://github.com/THUDM/GLM-4/issues/493. THUDM/glm-4-9b-chat uses custom code, which is not compatible with PromptEncoder. As the error indicates, this line fails:

batch_size, seq_length = input_ids.shape
AttributeError: 'NoneType' object has no attribute 'shape'

This is because the prompt encoder does not pass input_ids, instead passing the inputs_embeds directly. I could patch the issue by using inputs_embeds instead by editing this line:

https://huggingface.co/THUDM/glm-4-9b-chat/blob/c24133cef34ff7a7010f1e97c113effdead0966b/modeling_chatglm.py#L875

        if input_ids is not None:
            batch_size, seq_length = input_ids.shape
        else:
            batch_size, seq_length, _ = inputs_embeds.shape

Fix https://github.com/huggingface/peft/issues/1974#issue-243747124808ab4bfc

Finger-ebic

Aug 25, 2024

Supplement the problem scenario. This problem occurs when using peft for ptuning fine-tuning

xujfcn

about 16 hours ago

For those asking about API access — I've been using Crazyrouter as a unified gateway. One API key, OpenAI SDK compatible. Works well for testing different models without managing multiple accounts.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment