facebook
/

galactica-1.3b

Text Generation

text-generation-inference

Model card Files Files and versions Community

Convert checkpoint files to float16

#6

by mkardas - opened Dec 5, 2022

base: refs/heads/main

←

from: refs/pr/6

Discussion Files changed

mkardas

Dec 5, 2022

No description provided.

Convert checkpoint files to float16241c2933

mkardas changed pull request status to open Dec 5, 2022

cvinker

Dec 6, 2022

How can I implement this?

mkardas

Dec 6, 2022

What are you trying to achieve?

cvinker

Dec 6, 2022

The 1.3b model uses most of my 8gb of vram so large requests make it go over pretty quickly, I was hoping this would cut the memory use down.

mkardas

Dec 6, 2022

You can load your model with:

model = OPTForCausalLM.from_pretrained(
            "facebook/galactica-1.3b",
            torch_dtype="float16",
            device_map="auto"
        )

mkardas changed pull request status to merged Dec 6, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment