upstage
/

solar-pro-preview-pretrained

Text Generation

Model card Files Files and versions Community

solar-pro-preview-pretrained / README.md

BearSean's picture

Update README.md

adae237 verified 3 months ago

|

history blame contribute delete

2.12 kB

	---
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- upstage
	- solar
	- pytorch
	extra_gated_prompt: >-
	Terms and Conditions

	1. You shall not redistribute the original pre-trained model.

	2. You are granted permission to use this model for your own fine-tuning purposes.

	3. You may open-source the resulting fine-tuned model with any license, including for commercial use.

	extra_gated_fields:
	First Name: text
	Last Name: text
	Country: country
	Organization/Company: text
	geo: ip_location
	By clicking Submit below I accept the terms of the license and acknowledge that the information I provide will be collected stored processed and shared in accordance with the Upstage Privacy Policy: checkbox
	extra_gated_button_content: Submit
	---
	# solar-pro-preview-pretrained

	solar-pro-preview-pretrained is a pre-trained model made by Upstage.


	## Usage

	```python
	import torch
	from transformers import AutoModelForCausalLM, AutoTokenizer

	tokenizer = AutoTokenizer.from_pretrained("upstage/solar-pro-preview-pretrained")
	model = AutoModelForCausalLM.from_pretrained(
	"upstage/solar-pro-preview-pretrained",
	device_map="cuda:0",
	torch_dtype='auto',
	trust_remote_code=True,
	)
	```

	## Fine-tuning

	If you want to use it for chat purpose, please fine-tune it first.
	Please refer to the following chat template when fine-tuning.

	```python
	# Generating Text for Multi-Turn Interaction
	# For multi-turn conversations, use this approach:

	context = [
	{"role": "system", "content": "You are Solar, an AI bot by Upstage, loved by many people."},
	{"role": "user", "content": "Hi, there!"},
	{"role": "assistant", "content": "Hello, how can I help you?"},
	{"role": "user", "content": "Send me a message of support."},
	]

	prompt = tokenizer.apply_chat_template(context, tokenize=False, add_generation_prompt=True)

	print("# Input")
	print(prompt)

	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
	outputs = model.generate(**inputs, use_cache=True, max_new_tokens=4096)

	print("# Output")
	print(tokenizer.decode(outputs[0, inputs["input_ids"].shape[-1]:]))
	```