Update detail about Triton Flash Attention with ALiBi implementation 8a9076d jacobfulano commited on Jan 3
Change attention_probs_dropout_prob to 0.1 so that FlashAttention/triton dependencies are avoided ed2a544 jacobfulano commited on Dec 27, 2023