mt-ko-en-scitech / README.md
mjk0618's picture
Update README.md
bbe6078
metadata
language:
  - ko
  - en
metrics:
  - bleu
pipeline_tag: translation
tags:
  - science
  - technology

Model Overview

This model is fine-tuned model of "Helsinki-NLP/opus-mt-ko-en"
The model is trained with 1,198,943 Korean-English sentence pairs which mainly contains science, technology terms.

Load Model

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("mjk0618/mt-ko-en-scitech")
model = AutoModelForSeq2SeqLM.from_pretrained("mjk0618/mt-ko-en-scitech")

How to use

# After loading model
sentence = "์ธ๊ณต์ง€๋Šฅ์€ ์ธ๊ฐ„์˜ ํ•™์Šต๋Šฅ๋ ฅ, ์ถ”๋ก ๋Šฅ๋ ฅ, ์ง€๊ฐ๋Šฅ๋ ฅ์„ ์ธ๊ณต์ ์œผ๋กœ ๊ตฌํ˜„ํ•˜๋ ค๋Š” ์ปดํ“จํ„ฐ ๊ณผํ•™์˜ ์„ธ๋ถ€๋ถ„์•ผ ์ค‘ ํ•˜๋‚˜์ด๋‹ค"

inputs = tokenizer(sentence, return_tensors="pt").input_ids
outputs = model.generate(inputs)[0]
translated_sentence = tokenizer.decode(outputs, skip_special_tokens=True)

print(translated_sentence)
# Artificial intelligence is one of the details of computer science that artifically implements human learning ability, reasoning ability, and perception ability.