Change attention_probs_dropout_prob to 0.1 so that FlashAttention/triton dependencies are avoided
ed2a544
jacobfulano
commited on