w8ay
/

secgpt1_5

Text Generation

Model card Files Files and versions Community

secgpt1_5 / README.md

w8ay's picture

Update README.md

1aa73c9 verified 4 months ago

|

raw history blame contribute delete

No virus

2.05 kB

	---
	license: apache-2.0
	datasets:
	- w8ay/security-paper-datasets
	- TigerResearch/tigerbot-zhihu-zh-10k
	pipeline_tag: text-generation
	language:
	- zh
	---

	Github: https://github.com/Clouditera/secgpt

	## 使用
	商业模型对于网络安全领域问题大多会有道德限制，所以基于网络安全数据训练了一个模型，模型基于qwen14b，模型参数大小140亿，至少需要30G显存运行，35G最佳。
	- transformers
	- peft

	模型加载
	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	import torch
	from peft import PeftModel
	device = 'auto'
	tokenizer = AutoTokenizer.from_pretrained("w8ay/secgpt1_5", trust_remote_code=True)
	model = AutoModelForCausalLM.from_pretrained("w8ay/secgpt1_5",
	trust_remote_code=True,
	device_map=device,
	torch_dtype=torch.float16)
	print("模型加载成功")

	```
	调用
	```python
	def reformat_sft(instruction, input):
	if input:
	prefix = (
	"Below is an instruction that describes a task, paired with an input that provides further context. "
	"Write a response that appropriately completes the request.\n"
	f"### Instruction:\n{instruction}\n\n### Input:\n{input}\n\n### Response:"
	)
	else:
	prefix = (
	"Below is an instruction that describes a task. "
	"Write a response that appropriately completes the request.\n"
	f"### Instruction:\n{instruction}\n\n### Response:"
	)
	return prefix

	query = '''介绍sqlmap如何使用'''
	query = reformat_sft(query,'')

	generation_kwargs = {
	"top_p": 0.7,
	"temperature": 0.3,
	"max_new_tokens": 2000,
	"do_sample": True,
	"repetition_penalty":1.1
	}

	inputs = tokenizer.encode(query, return_tensors='pt', truncation=True)
	inputs = inputs.cuda()
	generate = model.generate(input_ids=inputs, **generation_kwargs)
	output = tokenizer.decode(generate[0])
	print(output)
	```