AdaptLLM
/

medicine-LLM-13B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AdaptLLM commited on Jan 2

Commit

9871a1a

•

1 Parent(s): 41b951f

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -45,8 +45,8 @@ For example, to chat with the biomedicine model:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model = AutoModelForCausalLM.from_pretrained("AdaptLLM/medicine-chat")
-tokenizer = AutoTokenizer.from_pretrained("AdaptLLM/medicine-chat", use_fast=False)
 # Put your input here:
 user_input = '''Question: Which of the following is an example of monosomy?
@@ -58,11 +58,11 @@ Options:
 Please provide your choice first and then provide explanations if possible.'''
-# We use the prompt template of LLaMA-2-Chat demo
-prompt = f"<s>[INST] <<SYS>>\nYou are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe.  Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature.\n\nIf a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information.\n<</SYS>>\n\n{user_input} [/INST]"
 inputs = tokenizer(prompt, return_tensors="pt", add_special_tokens=False).input_ids.to(model.device)
-outputs = model.generate(input_ids=inputs, max_length=4096)[0]
 answer_start = int(inputs.shape[-1])
 pred = tokenizer.decode(outputs[answer_start:], skip_special_tokens=True)

 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("AdaptLLM/medicine-LLM-13B")
+tokenizer = AutoTokenizer.from_pretrained("AdaptLLM/medicine-LLM-13B", use_fast=False)
 # Put your input here:
 user_input = '''Question: Which of the following is an example of monosomy?
 Please provide your choice first and then provide explanations if possible.'''
+# Simply use your input as the prompt
+prompt = user_input
 inputs = tokenizer(prompt, return_tensors="pt", add_special_tokens=False).input_ids.to(model.device)
+outputs = model.generate(input_ids=inputs, max_length=2048)[0]
 answer_start = int(inputs.shape[-1])
 pred = tokenizer.decode(outputs[answer_start:], skip_special_tokens=True)