PierreMesure/sentence-bert-swedish-cased-gguf

This is a GGUF conversion of KBLab/sentence-bert-swedish-cased (F32).

I used llama.cpp's script (convert_hf_to_gguf.py):

python convert_hf_to_gguf.py --outtype f32 ./sentence-bert-swedish-cased --outfile ./sentence-bert-swedish-cased.F32.gguf

Usage

You can use this with any tool building on llama.cpp. I made this GGUF to import it in Ollama.

  1. Create a Modelfile:

    FROM ./sentence-bert-swedish-cased.F32.gguf
    

    Or in one command:

    git clone https://huggingface.co/PierreMesure/sentence-bert-swedish-cased-gguf
    cd sentence-bert-swedish-cased-gguf/
    echo 'FROM ./sentence-bert-swedish-cased.F32.gguf' > Modelfile
    
  2. Import with Ollama

    ollama create sentence-bert-swedish-cased
    
Downloads last month
42
GGUF
Model size
124M params
Architecture
bert

32-bit

Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for PierreMesure/sentence-bert-swedish-cased-gguf

Quantized
(1)
this model