Allow for attention weights to be extracted.

There is a small bug in indexing which didn't allow me to get the attentions out. I think this change fixes it.

Files changed (1) hide show

modeling_codesage.py CHANGED Viewed

@@ -149,7 +149,7 @@ class CodeSageBlock(nn.Module):
         feed_forward_hidden_states = self.mlp(hidden_states)
         hidden_states = residual + feed_forward_hidden_states
-        outputs = (hidden_states,) + outputs[1:]
         return outputs  # hidden_states, present, (attentions)

         feed_forward_hidden_states = self.mlp(hidden_states)
         hidden_states = residual + feed_forward_hidden_states
+        outputs = (hidden_states,) + outputs
         return outputs  # hidden_states, present, (attentions)