Conversion to pytorch

by alpindale - opened Jan 30

Discussion

alpindale

Jan 30

Thanks for the fix. Can you share the script you used for converting the f16 gguf to pytorch?

152334H

Owner Jan 30

build llama.cpp, run ./quantize --pure --allow-requantize ./miqu-1-70b.q5_K_M.gguf 1
apply a.patch, run python3 convert.py --dump ./miqu-1-70b.q5_K_M.gguf
run https://gist.github.com/152334H/27d4181ce3641cec335131b971584ddd

152334H changed discussion status to closed Jan 30

152334H

Owner Jan 30

behaviour is hardcoded for llama2-70b shape so be weary of that much

also, the result of d = {v[1]:k for k,v in tm.get_tensor_name_map(tm.MODEL_ARCH.LLAMA, LAYERS).mapping.items()} is an unstable hack that will change in future llama.cpp versions

Doctor-Shotgun

Jan 30

Interesting. Now I'm praying that I can train/merge qlora and quant to exl2 without issues lol.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment