0xroyce
/

Valkyrie-Llama-3.1-8B-bnb-4bit

@@ -10,7 +10,7 @@ tags:
 - conversational
 pipeline_tag: text-generation
 inference: false
-model_creator: petrroyce
 model_type: LLaMA
 ---
@@ -24,7 +24,7 @@ Valkyrie-Llama-3.1-8B-bnb-4bit is an advanced language model fine-tuned on a mix
 - **Model Size**: 8 Billion Parameters
 - **Quantization**: 4-bit (bnb, bitsandbytes)
 - **Architecture**: Transformer-based
-- **Creator**: [petrroyce](https://huggingface.co/petrroyce)
 - **License**: [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
 ## Training
@@ -36,7 +36,7 @@ Valkyrie-Llama-3.1-8B-bnb-4bit was fine-tuned on a curated dataset containing di
 - Diverse web content
 - Academic articles
-The training was conducted on high-performance GPUs with a focus on balancing model accuracy and inference efficiency. The 4-bit quantization allows for deployment in environments with limited computational resources without a significant loss in model performance.
 ## Intended Use
@@ -65,14 +65,15 @@ You can load and use the model with the following code:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("petrroyce/Valkyrie-Llama-3.1-8B-bnb-4bit")
-model = AutoModelForCausalLM.from_pretrained("petrroyce/Valkyrie-Llama-3.1-8B-bnb-4bit")
 input_text = "Your text here"
 input_ids = tokenizer(input_text, return_tensors="pt").input_ids
 output = model.generate(input_ids, max_length=50)
 print(tokenizer.decode(output[0], skip_special_tokens=True))
 ## Ethical Considerations
@@ -82,13 +83,15 @@ The Valkyrie-Llama-3.1-8B-bnb-4bit model, like all large language models, can ge
 If you use this model in your research or applications, please cite it as follows:
-@misc{petrroyce2024valkyrie,
-  author = {petrroyce},
   title = {Valkyrie-Llama-3.1-8B-bnb-4bit},
   year = {2024},
   publisher = {Hugging Face},
-  howpublished = {\url{https://huggingface.co/petrroyce/Valkyrie-Llama-3.1-8B-bnb-4bit}},
 }
 ## Acknowledgements

 - conversational
 pipeline_tag: text-generation
 inference: false
+model_creator: 0xroyce
 model_type: LLaMA
 ---
 - **Model Size**: 8 Billion Parameters
 - **Quantization**: 4-bit (bnb, bitsandbytes)
 - **Architecture**: Transformer-based
+- **Creator**: [0xroyce](https://huggingface.co/0xroyce)
 - **License**: [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
 ## Training
 - Diverse web content
 - Academic articles
+The fine-tuning process leveraged Unsloth.ai for optimizing model performance, ensuring a well-balanced approach to both accuracy and efficiency. The 4-bit quantization allows for deployment in environments with limited computational resources without a significant loss in model performance.
 ## Intended Use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("0xroyce/Valkyrie-Llama-3.1-8B-bnb-4bit")
+model = AutoModelForCausalLM.from_pretrained("0xroyce/Valkyrie-Llama-3.1-8B-bnb-4bit")
 input_text = "Your text here"
 input_ids = tokenizer(input_text, return_tensors="pt").input_ids
 output = model.generate(input_ids, max_length=50)
 print(tokenizer.decode(output[0], skip_special_tokens=True))
+```
 ## Ethical Considerations
 If you use this model in your research or applications, please cite it as follows:
+```bibtex
+@misc{0xroyce2024valkyrie,
+  author = {0xroyce},
   title = {Valkyrie-Llama-3.1-8B-bnb-4bit},
   year = {2024},
   publisher = {Hugging Face},
+  howpublished = {\url{https://huggingface.co/0xroyce/Valkyrie-Llama-3.1-8B-bnb-4bit}},
 }
+```
 ## Acknowledgements