support use_flash_attn in from_pretrained

#2

This adds a shortcut to enable flash attention and xformer attention.

michael-guenther changed pull request status to open
gmastrapas changed pull request status to merged

Sign up or log in to comment