zeeshanali01
/

TinyLlama-1.1B-Chat-v0.3-GGUF

Inference Endpoints

Model card Files Files and versions Community

zeeshanali01 commited on May 1, 2024

Commit

06947e9

·

verified ·

1 Parent(s): 5b94420

Update README.md

Files changed (1) hide show

README.md +10 -6

README.md CHANGED Viewed

@@ -1,16 +1,15 @@
----
-language:
-- en
-tags:
-- code
----
 # Quantized_by: Zeeshan
 # Tinyllama 1.1B Chat v0.3 - GGUF
 - Model creator: [TinyLlama](https://huggingface.co/TinyLlama)
 - Original model: [Tinyllama 1.1B Chat v0.3](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.3)
 <!-- description start -->
 ## Description
 This repo contains GGUF format model files for [TinyLlama's Tinyllama 1.1B Chat v0.3](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.3).
 <!-- description end -->
 <!-- README_GGUF.md-about-gguf start -->
 ### About GGUF
@@ -32,6 +31,8 @@ Here is an incomplete list of clients and libraries that are known to support GG
 <!-- README_GGUF.md-about-gguf end -->
 <!-- repositories-available start -->
 <!-- README_GGUF.md-how-to-download start -->
 ## How to download GGUF files
@@ -74,9 +75,12 @@ Do check the [TinyLlama](https://github.com/jzhang38/TinyLlama) github page for
 # Install transformers from source - only needed for versions <= v4.34
 # pip install git+https://github.com/huggingface/transformers.git
 # pip install accelerate
 import torch
 from transformers import pipeline
 pipe = pipeline("text-generation", model="TinyLlama/TinyLlama-1.1B-Chat-v0.3", torch_dtype=torch.bfloat16, device_map="auto")
 # We use the tokenizer's chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
 messages = [
     {

 # Quantized_by: Zeeshan
 # Tinyllama 1.1B Chat v0.3 - GGUF
 - Model creator: [TinyLlama](https://huggingface.co/TinyLlama)
 - Original model: [Tinyllama 1.1B Chat v0.3](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.3)
 <!-- description start -->
 ## Description
 This repo contains GGUF format model files for [TinyLlama's Tinyllama 1.1B Chat v0.3](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.3).
 <!-- description end -->
 <!-- README_GGUF.md-about-gguf start -->
 ### About GGUF
 <!-- README_GGUF.md-about-gguf end -->
 <!-- repositories-available start -->
 <!-- README_GGUF.md-how-to-download start -->
 ## How to download GGUF files
 # Install transformers from source - only needed for versions <= v4.34
 # pip install git+https://github.com/huggingface/transformers.git
 # pip install accelerate
 import torch
 from transformers import pipeline
 pipe = pipeline("text-generation", model="TinyLlama/TinyLlama-1.1B-Chat-v0.3", torch_dtype=torch.bfloat16, device_map="auto")
 # We use the tokenizer's chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
 messages = [
     {