Update README.md

6712245 verified 8 months ago

5.04 kB

	---
	license: apache-2.0
	base_model: yanolja/EEVE-Korean-10.8B-v1.0
	tags:
	- generated_from_trainer
	model-index:
	- name: yanolja/EEVE-Korean-Instruct-10.8B-v1.0
	results: []
	---

	<p align="left">
	<img src="https://huggingface.co/yanolja/EEVE-Korean-Instruct-10.8B-v1.0/resolve/main/eeve_logo.webp" width="50%"/>
	<p>

	# EEVE-Korean-Instruct-10.8B-v1.0

	## Join Our Community on Discord!

	If you're passionate about the field of Large Language Models and wish to exchange knowledge and insights, we warmly invite you to join our Discord server. It's worth noting that Korean is the primary language used in this server. The landscape of LLM is evolving rapidly, and without active sharing, our collective knowledge risks becoming outdated swiftly. Let's collaborate and drive greater impact together! Join us here: [Discord Link](https://discord.gg/b27bAHg95m).

	## Our Dedicated Team (Alphabetical Order)
	\| Research \| Engineering \| Product Management \| UX Design \|
	\|-----------------\|-----------------\|--------------------\|--------------
	\| Myeongho Jeong \| Geon Kim \| Bokyung Huh \| Eunsue Choi \|
	\| Seungduk Kim \| Rifqi Alfi \| \| \|
	\| Seungtaek Choi \| Sanghoon Han \| \| \|
	\| \| Suhyun Kang \| \| \|

	## About the Model

	This model is a fine-tuned version of [yanolja/EEVE-Korean-10.8B-v1.0](https://huggingface.co/yanolja/EEVE-Korean-10.8B-v1.0), which is a Korean vocabulary-extended version of [upstage/SOLAR-10.7B-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-v1.0). Specifically, we employed Direct Preference Optimization (DPO) based on [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory).

	## Prompt Template
	```
	A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.
	Human: {prompt}
	Assistant:
	```
	## How to Use it
	```python
	from transformers import AutoTokenizer
	from transformers import AutoModelForCausalLM

	model = AutoModelForCausalLM.from_pretrained("yanolja/EEVE-Korean-Instruct-10.8B-v1.0")
	tokenizer = AutoTokenizer.from_pretrained("yanolja/EEVE-Korean-Instruct-10.8B-v1.0")

	prompt_template = "A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.\nHuman: {prompt}\nAssistant:\n"
	text = '한국의 수도는 어디인가요? 아래 선택지 중 골라주세요.\n\n(A) 경성\n(B) 부산\n(C) 평양\n(D) 서울\n(E) 전주'
	model_inputs = tokenizer(prompt_template.format(prompt=text), return_tensors='pt')

	outputs = model.generate(**model_inputs, max_new_tokens=256)
	output_text = tokenizer.batch_decode(outputs, skip_special_tokens=True)[0]
	print(output_text)
	```

	### Example Output
	```
	A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.
	Human: 한국의 수도는 어디인가요? 아래 선택지 중 골라주세요.

	(A) 경성
	(B) 부산
	(C) 평양
	(D) 서울
	(E) 전주
	Assistant:
	(D) 서울이 한국의 수도입니다. 서울은 나라의 북동부에 위치해 있으며, 정치, 경제, 문화의 중심지입니다. 약 1,000만 명이 넘는 인구를 가진 세계에서 가장 큰 도시 중 하나입니다. 서울은 높은 빌딩, 현대적인 인프라, 활기 문화 장면으로 유명합니다. 또한, 많은 역사적 명소와 박물관이 있어 방문객들에게 풍부한 문화 체험을 제공합니다.
	```

	### Training Data
	- Korean-translated version of [Open-Orca/SlimOrca-Dedup](https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup)
	- Korean-translated version of [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
	- No other dataset was used

	## Citation

	```
	@misc{cui2023ultrafeedback,
	title={UltraFeedback: Boosting Language Models with High-quality Feedback},
	author={Ganqu Cui and Lifan Yuan and Ning Ding and Guanming Yao and Wei Zhu and Yuan Ni and Guotong Xie and Zhiyuan Liu and Maosong Sun},
	year={2023},
	eprint={2310.01377},
	archivePrefix={arXiv},
	primaryClass={cs.CL}
	}
	```
	```
	@misc{SlimOrcaDedup,
	title = {SlimOrca Dedup: A Deduplicated Subset of SlimOrca},
	author = {Wing Lian and Guan Wang and Bleys Goodson and Eugene Pentland and Austin Cook and Chanvichet Vong and "Teknium" and Nathan Hoos},
	year = {2023},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup/}
	}
	```
	```
	@misc{mukherjee2023orca,
	title={Orca: Progressive Learning from Complex Explanation Traces of GPT-4},
	author={Subhabrata Mukherjee and Arindam Mitra and Ganesh Jawahar and Sahaj Agarwal and Hamid Palangi and Ahmed Awadallah},
	year={2023},
	eprint={2306.02707},
	archivePrefix={arXiv},
	primaryClass={cs.CL}
	}
	```