Usage rights?

#5
by ndurner - opened

Hi,

the model pages on HF state that:

The models are available in float32, bfloat16 and float16 format for research purposes only.

The "for research purposes" portion is confusing because:

  • the Gemma license, which is linked as "Terms", does not have this provision
  • README.md on Github does have such a provision, but only for Transfer Checkpoints:
    "We provide checkpoints transferred to most of the tasks we evaluated transfer on [...] for academic research purposes only.". This could imply that the base model and the "mix" checkpoints are free to use for non-academic work (subject to the Gemma license), but contradicts the HF page for e.g. paligemma-3b-mix-448 which does have the "for research purposes" statement.

Please clarify.

Thanks,

Nils

Asked the other way round: is it correct that only "downstream" trained models are just for academic purposes, such as:

  • google/paligemma-3b-ft-ocrvqa-896
  • google/paligemma-3b-ft-docvqa-896
  • google/paligemma-3b-ft-infovqa-896

?

Hi Nils,

If you consider non-research deployments, we recommend using the paligemma-3b-pt-{224|448|896} checkpoints, and fine-tune very quickly on your customized datasets.

PaliGemma fine-tune is pretty lightweight, that you could try our colab at:
https://colab.sandbox.google.com/github/google-research/big_vision/blob/main/big_vision/configs/proj/paligemma/finetune_paligemma.ipynb

I also found this fine-tune example tutoriol from Twitter for reference: https://blog.roboflow.com/how-to-fine-tune-paligemma/

Best,
Xiaohua

I also created https://huggingface.co/google/paligemma-3b-mix-224/discussions/7, I think the License: gemma property then needs to be fixed to make clear that it is licensed under different (only research) terms.

Sign up or log in to comment