mayacinka commited on
Commit
0a3f56a
·
verified ·
1 Parent(s): 395e87b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -13,6 +13,8 @@ license: apache-2.0
13
 
14
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
 
 
 
16
  ## Merge Details
17
  ### Merge Method
18
 
@@ -44,4 +46,29 @@ base_model: CultriX/NeuralTrix-7B-dpo
44
  parameters:
45
  int8_mask: true
46
  dtype: bfloat16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
  ```
 
13
 
14
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
 
16
+ Code credit: [this excellent medium blog](https://medium.com/towards-data-science/merge-large-language-models-with-mergekit-2118fb392b54)
17
+
18
  ## Merge Details
19
  ### Merge Method
20
 
 
46
  parameters:
47
  int8_mask: true
48
  dtype: bfloat16
49
+ ```
50
+
51
+ # Inference
52
+
53
+ ```python
54
+ # pip install transformers
55
+
56
+ from transformers import AutoTokenizer
57
+ import transformers
58
+ import torch
59
+
60
+ model = "mayacinka/NeuralZephyr-Beagle-7B"
61
+ messages = [{"role": "user", "content": "What is a large language model?"}]
62
+
63
+ tokenizer = AutoTokenizer.from_pretrained(model)
64
+ prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
65
+ pipeline = transformers.pipeline(
66
+ "text-generation",
67
+ model=model,
68
+ torch_dtype=torch.float16,
69
+ device_map="auto",
70
+ )
71
+
72
+ outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
73
+ print(outputs[0]["generated_text"])
74
  ```