nan values in the hidden_states

#5
by puar-playground - opened

This checkpoint produces nan values in the hidden_states. It sometimes stuck at inference too.

OpenGVLab org

Hello, thank you for your feedback. Could you please provide a simple example so that I can reproduce the issue? I haven't encountered this model outputting NaN hidden states.

line 1119, in forward
position_ids = position_ids.view(-1, seq_length).long()
RuntimeError: shape '[-1, 0]' is invalid for input of size 846

I keep getting this error.

OpenGVLab org

line 1119, in forward
position_ids = position_ids.view(-1, seq_length).long()
RuntimeError: shape '[-1, 0]' is invalid for input of size 846

I keep getting this error.

Thanks for the feedback. This problem should be fixed yesterday, you can try it again.

Sign up or log in to comment