Half precision

by ljhwild - opened Jun 20, 2024

Discussion

ljhwild

Jun 20, 2024

Is this compatible out of the box with half precision or quantizations as opposed to unbables library implementation?

vince62s

Owner Jun 20, 2024

as you can see the model size is 7GB which for a 3.5G params is FP16.
but you can achive the same with the Unbabel model by changing two lines of code.

vince62s changed discussion status to closed Jun 20, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment