LLM - a Norm Collection

Norm 's Collections

Visual Document Understanding

LLM

Fundamental Research

LLM

updated Nov 18, 2023

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 84

Note Train a LoRA by shifting local attention. Achieve better perplexity than train-free methods
Small-scale proxies for large-scale Transformer training instabilities

Paper • 2309.14322 • Published Sep 25, 2023 • 17

Note Explain what may cause the instability of transformer training. LR sensitive is a good metric for training stability. The explanation of why loss spikes occur is also very interesting
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 53
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond

Paper • 2310.06147 • Published Oct 9, 2023 • 1
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 45
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Paper • 2311.07575 • Published Nov 13, 2023 • 11