MRNH
/

llama-2-7b-coder

Text2Text Generation

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama-2-7b-coder / README.md

MRNH's picture

Update README.md

4346eac verified 9 months ago

|

history blame contribute delete

1.49 kB

	---
	language:
	- en
	pipeline_tag: text2text-generation
	metrics:
	- f1
	tags:
	- english
	- sql
	---

	This is a fine-tuned version of LLAMA2 trained (7b) on spider, sql-create-context.

	To initialize the model:


	bnb_config = BitsAndBytesConfig(
	load_in_4bit=use_4bit,
	bnb_4bit_quant_type=bnb_4bit_quant_type,
	bnb_4bit_compute_dtype=compute_dtype,
	bnb_4bit_use_double_quant=use_nested_quant,
	)

	model = AutoModelForCausalLM.from_pretrained(
	model_name,
	quantization_config=bnb_config,
	device_map=device_map,
	trust_remote_code=True
	)


	Use the tokenizer:


	tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
	tokenizer.pad_token = tokenizer.eos_token
	tokenizer.padding_side = "right"

	To get the prompt:

	dataset = dataset.map(
	lambda example: {
	"input": "### Instruction: \nYou are a powerful text-to-SQL model. \
	Your job is to answer questions about a database. You are given \
	a question and context regarding one or more tables. \n\nYou must \
	output the SQL query that answers the question. \
	\n\n \
	### Dialect:\n\nsqlite\n\n \
	### question:\n\n"+ example["question"]+" \
	\n\n### Context:\n\n"+example["context"],
	"answer": example["answer"]
	}
	)


	To generate text using the model:

	output = model.generate(input["input_ids"])