Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model (#19) cc9521a verified itlevy tomer-nv commited on Oct 13, 2024
DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50 (#16) 3209eec verified itlevy commited on Sep 30, 2024