Itay Levy
itlevy
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Organizations
itlevy's activity
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#19 opened 3 months ago
by
tomer-nv
DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50
#16 opened 3 months ago
by
itlevy
add batch_size attribute to VariableCache
#15 opened 3 months ago
by
itlevy
nvidia-open-model-license
#14 opened 3 months ago
by
itlevy
nvidia-open-model-license
#13 opened 3 months ago
by
itlevy
nvidia-open-model-license
#12 opened 3 months ago
by
itlevy
v4.46 support
#7 opened 3 months ago
by
itlevy
loading as llama model
1
#4 opened 3 months ago
by
KnutJaegersberg
v4.45 support
#6 opened 3 months ago
by
itlevy
fixed flash_attention backward_compat
#3 opened 3 months ago
by
itlevy
flash_attention_utils_backward_compat
#2 opened 3 months ago
by
itlevy
flash_attention_utils_backward_compat
#2 opened 3 months ago
by
itlevy