visheratin
/

nllb-clip-base

Inference Endpoints

Model card Files Files and versions Community

visheratin commited on Sep 20, 2023

Commit

87c2778

•

1 Parent(s): 0341e0d

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -4,7 +4,17 @@ datasets:
 - visheratin/laion-coco-nllb
 ---
-The code to run the model:
 ```
 from transformers import AutoTokenizer, CLIPProcessor

 - visheratin/laion-coco-nllb
 ---
+## Model Summary
+NLLB-CLIP is a model that combines a text encoder from the [NLLB model](https://huggingface.co/facebook/nllb-200-distilled-600M) and an image encoder from the
+standard [CLIP](https://huggingface.co/openai/clip-vit-base-patch32). This allows us to extend the model capabilities
+to 201 languages of the Flores-200. NLLB-CLIP sets state-of-the-art on the [Crossmodal-3600](https://google.github.io/crossmodal-3600/) dataset by performing very
+well on low-resource languages. You can find more details about the model in the [paper](https://arxiv.org/abs/2309.01859).
+## How to use
+The model [repo](https://huggingface.co/visheratin/nllb-clip-base/tree/main) contains the model code files that allow the use of NLLB-CLIP as any other model from the hub.
+The interface is also compatible with CLIP models. Example code is below:
 ```
 from transformers import AutoTokenizer, CLIPProcessor