Ciao,please check that the model architecture and tokenizers are included in official transformers... so we can check with lama.cpp or OpenVINO to create Quantizations
· Sign up or log in to comment