MBZUAI
/

MobiLlama-05B

@@ -14,11 +14,11 @@ datasets:
 <center><img src="MobileLLaMa.png" alt="mobillama logo" width="300"/></center>
-## Model Summary
 MobiLlama-05B is a Small Language Model with **0.5 billion** parameters. It was trained using the Amber data sources [Amber-Dataset](https://huggingface.co/datasets/LLM360/AmberDatasets).
-[Github](https://github.com/mbzuai-oryx/MobiLlama)
 ## Model Description
@@ -26,23 +26,13 @@ MobiLlama-05B is a Small Language Model with **0.5 billion** parameters. It was
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0
 - **Resources for more information:**
-  - [Training Code](https://github.com/LLM360/amber-train)
   - [Data Preparation](https://github.com/LLM360/amber-data-prep)
   - [Fully processed Amber pretraining data](https://huggingface.co/datasets/LLM360/AmberDatasets)
 ## How to Use
-MobiLlama-05B has been integrated in the development version (4.37.0.dev) of `transformers`. Until the official version is released through `pip`, ensure that you are doing one of the following:
-* When loading the model, ensure that `trust_remote_code=True` is passed as an argument of the `from_pretrained()` function.
-* Update your local `transformers` to the development version: `pip uninstall -y transformers && pip install git+https://github.com/huggingface/transformers`. The previous command is an alternative to cloning and installing from the source.
-The current `transformers` version can be verified with: `pip list | grep transformers`.
-To load a specific checkpoint, simply pass a revision with a value between `"ckpt_000"` and `"ckpt_358"`. If no revision is provided, it will load `"ckpt_359"`, which is the final checkpoint.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -56,22 +46,6 @@ print(tokenizer.batch_decode(outputs[:, input_ids.shape[1]:-1])[0].strip())
 ```
-## Evaluation
-| Evaluation Benchmark | MobiLlama-0.5B | MobiLlama-0.8B | MobiLlama-1.2B |
-| ----------- | ----------- | ----------- | ----------- |
-| HellaSwag | 0.5252 | 0.5409 | 0.6299 |
-| MMLU | 0.2645 | 0.2692 | 0.2423 |
-| Arc Challenge | 0.2952 | 0.3020 | 0.3455 |
-| TruthfulQA | 0.3805 | 0.3848 | 0.3557 |
-| CrowsPairs | 0.6403 | 0.6482 | 0.6812 |
-| PIQA | 0.7203 | 0.7317 | 0.7529 |
-| Race | 0.3368 | 0.3337 | 0.3531 |
-| SIQA | 0.4022 | 0.4160 | 0.4196 |
-| Winogrande | 0.5753 | 0.5745 | 0.6108 |
 ## Hyperparameters
 | Hyperparameter      | Value |
 | ----------- | ----------- |
@@ -84,6 +58,22 @@ print(tokenizer.batch_decode(outputs[:, input_ids.shape[1]:-1])[0].strip())
 | Max Seq Length   | 2048        |
 | Vocab Size | 32000 |
 ## Intended Uses
 Given the nature of the training data, the MobiLlama-05B model is best suited for prompts using the QA format, the chat format, and the code format.

 <center><img src="MobileLLaMa.png" alt="mobillama logo" width="300"/></center>
 MobiLlama-05B is a Small Language Model with **0.5 billion** parameters. It was trained using the Amber data sources [Amber-Dataset](https://huggingface.co/datasets/LLM360/AmberDatasets).
+## Model Summary
+"Bigger the better" has been the predominant trend in recent Large Language Models (LLMs) development. However, LLMs do not suit well for scenarios that require on-device processing, energy efficiency, low memory footprint, and response efficiency. These requisites are crucial for privacy, security, and sustainable deployment. This paper explores the ‘less is more’ paradigm by addressing the challenge of designing accurate yet efficient Small Language Models (SLMs) for resource-constrained devices. Our primary contribution is the introduction of an accurate and fully transparent open-source 0.5 billion (0.5B) parameter SLM, named MobiLlama, catering to the specific needs of resource-constrained computing with an emphasis on enhanced performance with reduced resource demands. MobiLlama is a SLM design that initiates from a larger model and applies a careful parameter sharing scheme to reduce both the pre-training and the deployment cost. Our work strives to not only bridge the gap in open-source SLMs but also ensures full transparency, where complete training data pipeline, training code, model weights, and over 300 checkpoints along with evaluation codes are available on our [Github](https://github.com/mbzuai-oryx/MobiLlama).
 ## Model Description
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0
 - **Resources for more information:**
+  - [Training Code](https://github.com/mbzuai-oryx/MobiLlama)
   - [Data Preparation](https://github.com/LLM360/amber-data-prep)
   - [Fully processed Amber pretraining data](https://huggingface.co/datasets/LLM360/AmberDatasets)
 ## How to Use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 ```
 ## Hyperparameters
 | Hyperparameter      | Value |
 | ----------- | ----------- |
 | Max Seq Length   | 2048        |
 | Vocab Size | 32000 |
+## Evaluation
+| Evaluation Benchmark | MobiLlama-0.5B | MobiLlama-0.8B | MobiLlama-1.2B |
+| ----------- | ----------- | ----------- | ----------- |
+| HellaSwag | 52.52 | 54.09 | 62.99 |
+| MMLU | 26.45 | 26.92 | 24.23 |
+| Arc Challenge | 29.52 | 30.20 | 34.55 |
+| TruthfulQA | 38.05 | 38.48 | 35.57 |
+| CrowsPairs | 64.03 | 64.82 | 68.12 |
+| PIQA | 72.03 | 73.17 | 75.29 |
+| Race | 33.68 | 33.37 | 35.31 |
+| SIQA | 40.22 | 41.60 | 41.96 |
+| Winogrande | 57.53 | 57.45 | 61.08 |
 ## Intended Uses
 Given the nature of the training data, the MobiLlama-05B model is best suited for prompts using the QA format, the chat format, and the code format.