Enhance speed by using nn.layernorm and nn.groupnorm (triton-lang/triton#5712) 0b5291c verified zhiyuan8 commited on 2 days ago