Question

by WesPro - opened Oct 26, 2024

Oct 26, 2024

Hi, I was just wondering what the point is behind slerping the same model with itself? Is getting noticeably better/different or is the only difference the dtype?

xi0v

Oct 26, 2024

Hi, I was just wondering what the point is behind slerping the same model with itself? Is getting noticeably better/different or is the only difference the dtype?

Probably to change the dtype of the model to F32. Which for some People yields "better" outputs compared to BF/FP16. VRAM Usage is higher when using F32 though, which is why people prefer BF/FP16.

allknowingroger

Owner Oct 27, 2024

•

edited Oct 27, 2024

CUDA optimised for 32 floating numbers
https://m.youtube.com/watch?v=h9Z4oGN89MU&t=9m23s

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment