---
base_model:
- huggyllama/llama-7b
language:
- en
library_name: transformers
license: other
pipeline_tag: text-generation
---

This contains the weights for the LLaMA-7b model. This model is under a non-commercial license (see the LICENSE file).
You should only use this repository if you have been granted access to the model by filling out [this form](https://docs.google.com/forms/d/e/1FAIpQLSfqNECQnMkycAp2jP4Z9TFX0cGR4uf7b_fBxjY_OjhJILlKGA/viewform?usp=send_form) but either lost your copy of the weights or got some trouble converting them to the Transformers format.

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->


## Model Details

### Model Description

<!-- Provide a longer summary of what this model is. -->

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

- **Developed by:** Meta, Changwoo Lee, Soo Min Kwon, Qing Qu, Hun-Seok Kim
- **Model type:** Text Generation
- **Language(s) (NLP):** English
- **License:** This model inherited Llama License (see `LICENSE`).
- **Finetuned from model:** [huggyllama/llama-7b](huggyllama/llama-7b)

### Model Sources

<!-- Provide the basic links for the model. -->

- **Repository:** https://github.com/changwoolee/BLAST
- **Paper:** Changwoo Lee, Soo Min Kwon, Qing Qu, and Hun-Seok Kim. "BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference." NeurIPS 2024


## How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

## Citation [optional]

<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->

**BibTeX:**
```
@inproceedings{
    lee2024blast,
    title={{BLAST}: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference},
    author={Lee, Changwoo and Kwon, Soo Min and Qu, Qing and Kim, Hun-Seok},
    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
    year={2024},
}
```