Error with dtype=torch.float16.

#10

by Cheshire94 - opened Aug 4, 2023

Discussion

Cheshire94

Aug 4, 2023

•

edited Aug 4, 2023

In Qwen-7B-Chat/qwen_generation_utils.py
line 349： scores[i, self.eos_token_id] = float(2**30)

When using torch_dtype=torch.float16, this line above trigger the following error:
RuntimeError: value cannot be converted to type at::Half without overflow

Maybe consider changing it to torch.finfo(torch.float16).max when the dtype is set to torch.float16 and keeping it to float(2**30) for torch.float32?

logicwong

Qwen org Aug 4, 2023

We manually set it to float(2**15), try again?

jklj077

Qwen org Aug 8, 2023

Thank you for raising this problem!

It has been fixed in main. I'll just close this for now. If the problem persists, please open a new issue.

jklj077 changed discussion status to closed Aug 8, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment