Multiple GPUs for inference error

by Mostudy - opened Jul 10, 2024

Discussion

Mostudy

Jul 10, 2024

Multiple GPUs for inference error

kosung

Jul 10, 2024

same error

Irving1

Jul 11, 2024

same error

czczup

OpenGVLab org Jul 11, 2024

I'm very sorry, but this problem has troubled me for a long time.

Different devices or different numbers of GPUs always trigger this issue in various ways.

I have a silly workaround, which involves modifying the source code of utils.py in the transformers library, manually moving the tensors to the same device.

If there is a better method, please let me know!

HriDal

Jul 15, 2024

Could you share this please:

"I have a silly workaround, which involves modifying the source code of utils.py in the transformers library, manually moving the tensors to the same device."

dranger003

Jul 15, 2024

@czczup As a silly person myself, I would also be interested to know about your changes in utils.py. Can you post a diff maybe? Thanks.

czczup

OpenGVLab org Jul 16, 2024

Please refer to the new readme code. By placing the input and output layers of the LLM on a single device, it should now work without needing to modify utils.py, and this issue should no longer occur.

HriDal

Jul 16, 2024

Hey bud! Just saw that you updated the readme! Wow it works! Thanks a ton man!! You rock!

Mostudy

Jul 16, 2024

Please refer to the new readme code. By placing the input and output layers of the LLM on a single device, it should now work without needing to modify utils.py, and this issue should no longer occur.
It works,thakns

Mostudy changed discussion status to closed Jul 16, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment