support use_flash_attn in from_pretrained
#2
by
michael-guenther
- opened
This adds a shortcut to enable flash attention and xformer attention.
michael-guenther
changed pull request status to
open
gmastrapas
changed pull request status to
merged