Zero-Shot Image Classification
Transformers
Safetensors
clip
Inference Endpoints

CLIP ViT-L/14 finetune: SAE-informed adversarial training

image/png

  • Interesting things with adversarial robustness to try: Right-click and download individual images: Image 1 -- Image 2 -- Image 3 image/png
  • Upload each into zero-shot [hopefully available soon on the right here->]
  • Try labels (class names): a photo of a cat, a photo of a dog, a photo of a text
  • Repeat the same with e.g. my GmP models models and see what happens. =)
  • I'm really hoping the HF format .safetensors conversion didn't mess anything up (it happens!); just in case it did, or if there's no inference API available to use:
  • I put a script that will do the same thing (on the not-converted model) on my GitHub repo. Plus, you can just reproduce the fine-tune yourself, as that code is also available! 🤗
  • 👉 All training info & code: github.com/zer0int/CLIP-SAE-finetune
  • Buy me a coffee

image/png

Downloads last month
325
Safetensors
Model size
428M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for zer0int/CLIP-SAE-ViT-L-14

Finetuned
(52)
this model
Finetunes
1 model

Datasets used to train zer0int/CLIP-SAE-ViT-L-14