Can we run this in FP16 instead of FP32 ?

by vince62s - opened Jan 18, 2024

Discussion

vince62s

Jan 18, 2024

Hi Ricardo
Would that make sense to release a checkpoint in FP16 ? would the accuracy change ?

vince62s

Feb 6, 2024

answering to myself: converting to fp16, changing two lines of code model.half() and in_features.to(torch.float16) makes thing twice faster, twice less ram, same scores.

Blepo

Jun 7, 2024

Hi @vince62s , did you change the model to half precision and cast the input on your side of the code? Or did you do it inside the unbabel-comet python package?

vince62s

Jun 8, 2024

I modified the comet code, only two lines to change as mentioned in my previous message.

laelhalawani

Jun 19, 2024

@vince62s which files did you change?

vince62s

Jun 19, 2024

in score.py you need to add half() to the line: model = load_from_checkpoint(model_path).half()
in feed_forward?py you need to change the last line: return self.ff(in_features.to(torch.float16))

ljhwild

Jun 21, 2024

Confirmed, it works. Thanks so much!
Maybe a PR for unbabels would be appreciated?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment