MaxJeblick
/

llama2-0b-unit-test

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MaxJeblick commited on Oct 25, 2023

Commit

a9262f5

•

1 Parent(s): 9ac0a41

Create README.md

Files changed (1) hide show

README.md +28 -0

README.md ADDED Viewed

	@@ -0,0 +1,28 @@

+Small dummy LLama2-type Model useable for Unit/Integration tests.
+Ensure that model input ids are < 100, see code below.
+```python
+from transformers import AutoConfig, AutoTokenizer, AutoModelForCausalLM
+repo_name = "MaxJeblick/llama2-0b-unit-test"
+model_name = "h2oai/h2ogpt-4096-llama2-7b-chat"
+config = AutoConfig.from_pretrained(model_name)
+config.hidden_size = 12
+config.max_position_embeddings = 32
+config.intermediate_size = 24
+config.num_attention_heads = 2
+config.num_hidden_layers = 2
+config.num_key_value_heads = 2
+config.vocab_size = 100
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_config(config)
+print(model.num_parameters())  # 5340
+model.push_to_hub(repo_name, private=False)
+tokenizer.push_to_hub(repo_name, private=False)
+config.push_to_hub(repo_name, private=False)
+```