Gradient Checkpointing for HF Trainer

by acon96 - opened

Wire up existing checkpoint logic to work with transformers Trainer

Looking forward to the merge of this PR!

It would be great if you could merge this

Microsoft org

Hello everyone!

We have an ongoing PR in which will solve this issue.


gugarosa changed pull request status to closed

Sign up or log in to comment