Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ model-index:
|
|
21 |
|
22 |
# X-CLIP (base-sized model)
|
23 |
|
24 |
-
X-CLIP model (base-sized, patch resolution of
|
25 |
|
26 |
This model was trained using 8 frames per video, at a resolution of 224x224.
|
27 |
|
|
|
21 |
|
22 |
# X-CLIP (base-sized model)
|
23 |
|
24 |
+
X-CLIP model (base-sized, patch resolution of 16) trained fully-supervised on [Kinetics-400](https://www.deepmind.com/open-source/kinetics). It was introduced in the paper [Expanding Language-Image Pretrained Models for General Video Recognition](https://arxiv.org/abs/2208.02816) by Ni et al. and first released in [this repository](https://github.com/microsoft/VideoX/tree/master/X-CLIP).
|
25 |
|
26 |
This model was trained using 8 frames per video, at a resolution of 224x224.
|
27 |
|