asusevski
/

mistraloo-sft

PEFT

Safetensors

Model card Files Files and versions Community

asusevski commited on Jan 20, 2024

Commit

e6ed4f2

verified ·

1 Parent(s): 1e689ef

Upload model

Browse files

Files changed (3) hide show

README.md +37 -56
adapter_config.json +4 -4
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -5,8 +5,8 @@ base_model: mistralai/Mistral-7B-v0.1
 # Model Card for Model ID
-LoRA model trained for ~11 hours on r/uwaterloo data.
-Only trained on top-level comments with the most upvotes on each post.
 ## Model Details
@@ -17,33 +17,49 @@ Only trained on top-level comments with the most upvotes on each post.
-- **Developed by:** Anthony Susevski and Alvin Li
-- **Model type:** LoRA
-- **Language(s) (NLP):** English
-- **License:** 	mit
-- **Finetuned from model [optional]:** mistralai/Mistral-7B-v0.1
 ## Uses
-Pass a post title and a post text(optional) in the style of a Reddit post into the below prompt.
-```
-prompt = f"""
-Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
-### Instruction:
-Respond to the reddit post in the style of a University of Waterloo student.
-### Input:
-{post_title}
-{post_text}
-### Response:
-```
 ## Bias, Risks, and Limitations
-No alignment training as of yet -- only SFT.
 ### Recommendations
@@ -55,42 +71,7 @@ Users (both direct and downstream) should be made aware of the risks, biases and
 Use the code below to get started with the model.
-```
-from transformers import AutoTokenizer, AutoModelForCausalLM
-import torch
-from peft import PeftModel, PeftConfig
-peft_model_id = "asusevski/mistraloo-sft"
-peft_config = PeftConfig.from_pretrained(peft_model_id)
-model = AutoModelForCausalLM.from_pretrained(peft_config.base_model_name_or_path)
-model = PeftModel.from_pretrained(model, peft_model_id).to(device)
-model.eval()
-tokenizer = AutoTokenizer.from_pretrained(
-    peft_config.base_model_name_or_path,
-    add_bos_token=True
-)
-post_title = "my example post title"
-post_text = "my example post text"
-prompt = f"""
-Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
-### Instruction:
-Respond to the reddit post in the style of a University of Waterloo student.
-### Input:
-{post_title}
-{post_text}
-### Response:
-"""
-model_input = tokenizer(prompt, return_tensors="pt").to(device)
-with torch.no_grad():
-    model_output = model.generate(**model_input, max_new_tokens=256, repetition_penalty=1.15)[0]
-output = tokenizer.decode(model_output, skip_special_tokens=True)
-```
 ## Training Details

 # Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
 ## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
 ## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
 ### Recommendations
 Use the code below to get started with the model.
+[More Information Needed]
 ## Training Details

adapter_config.json CHANGED Viewed

@@ -19,14 +19,14 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "lm_head",
     "v_proj",
     "k_proj",
     "o_proj",
     "q_proj",
-    "gate_proj",
-    "up_proj",
-    "down_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "down_proj",
     "v_proj",
     "k_proj",
+    "gate_proj",
     "o_proj",
     "q_proj",
+    "lm_head",
+    "up_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:59e8ebe1499dfa8217ff170ae957904d71f19013871f7337ad1d1b67823da6ac
 size 600059184

 version https://git-lfs.github.com/spec/v1
+oid sha256:d382fd5844478693e1257ab3ea5bfb1fddc4b35eccee64f433955b11a21d04e0
 size 600059184