Improved precision / reduced frequency of nan outputs, allow bf16 t5, f32 rmsnorm, larger clamp
f708e90
aredden
commited on