GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper โข 2403.03507 โข Published Mar 6 โข 182 โข 15
SaulLM-7B: A pioneering Large Language Model for Law Paper โข 2403.03883 โข Published Mar 6 โข 74 โข 5
PersianMind: A Cross-Lingual Persian-English Large Language Model Paper โข 2401.06466 โข Published Jan 12 โข 3 โข 2
GRATH: Gradual Self-Truthifying for Large Language Models Paper โข 2401.12292 โข Published Jan 22 โข 2 โข 2