When torch.nn.functional.scaled_dot_product_attention calls _scaled_dot_product_attention_math, the model reports an error
4
#3 opened 4 months ago
by
Quasimodo0808
Add task tag
#2 opened 5 months ago
by
merve
CogVLM2 3rd party sft support
#1 opened 5 months ago
by
tastelikefeet