LLaMA-7B + Landmark Attention

This repo hosts the weight diff between LLaMA 7B trained with landmark attention for 15000 steps on RedPajama and the original model. Please visit the Github repository for further instructions on how to recover the full weights and how to use them.

Github repository: https://github.com/epfml/landmark-attention

Downloads last month
16
Inference API
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Spaces using epfml/landmark-attention-llama7b-wdiff 3