Adding `safetensors` variant of this model
#3 opened 5 days ago
by
SFconvertbot
Allow for attention weights to be extracted.
#2 opened 8 days ago
by
FJFehr
Included gradient checkpointing
#1 opened 8 days ago
by
FJFehr