🤗 Language model initialized from mT5 and trained for an additional 100K steps on the Prefix LM objective using mC4 data.

Authors: Tu Vu, Aditya Barua, Brian Lester, Daniel Cer, Mohit Iyyer, Noah Constant

PyTorch port of the original Flax checkpoint at Google/T5X repository.

Safetensors

Model size

3.74B params

Tensor type

F32

DKYoon
/

mt5-xl-lm-adapt

Spaces using DKYoon/mt5-xl-lm-adapt 2