建议forward函数参数增加full_attention_mask

#35
by zkwhandan - opened

建议把ChatGLMForConditionalGeneration.forward的输入参数中中增加full_attention_mask参数,然后传递给transformer,这样就可以更灵活的去进行多轮对话的训练了

Sign up or log in to comment