bglearning
Set use_memory_efficient_attention to False and modify config.json of 2_Dense_8192
1bd9d2a
raw
history blame contribute delete
122 Bytes
{
"in_features": 1024,
"out_features": 1024,
"bias": true,
"activation_function": "torch.nn.modules.linear.Identity"
}