savasy
/

mt0-large-Turkish-qa

Question Answering

text-generation

Model card Files Files and versions Community

savasy commited on Jun 21, 2023

Commit

b4fb709

•

1 Parent(s): ec5aa14

Create README.md

Files changed (1) hide show

README.md +70 -0

README.md ADDED Viewed

	@@ -0,0 +1,70 @@

+# Turkish Question Answering model based on mt0-large
+In this model, I fine-tune *mT0-large* model with the following Turkish QA datasets
+* https://huggingface.co/bigscience/mt0-large
+* https://github.com/okanvk/Turkish-Reading-Comprehension-Question-Answering-Dataset
+The model is tuned within parameter-efficient fine-tuning, which is PEFT LORA. So we need to install peft modules. Please check
+* https://github.com/huggingface/peft
+The training set size is around 11K QAs.
+Example usage for single inference is as follows:
+```
+from peft import PeftModel, PeftConfig
+peft_model_path="savasy/mt0-large-Turkish-qa"
+config = PeftConfig.from_pretrained(peft_model_path)
+model = AutoModelForSeq2SeqLM.from_pretrained(
+    config.base_model_name_or_path)
+# Load the Lora model
+inference_model = PeftModel.from_pretrained(model, peft_model_path)
+inference_model.eval()
+import numpy as np
+inference_model.to("cuda")
+test_input = '''Mustafa adını babası Ali Rıza Efendi kendi dedesinin adı olduğundan dolayı vermiştir. Çünkü Ali Rıza Efendi'nin babasının adı olan
+ Ahmed adı ağabeylerinden birisine verilmişti. Mustafa'ya neden Kemal isminin verildiğine yönelik ise çeşitli iddialar vardır.
+ Afet İnan, bu ismi ona matematik öğretmeni Üsküplü Mustafa Efendi'nin Kemal adının anlamında olduğu gibi onun "mükemmel ve olgun"
+ olduğunu göstermek için verdiğini söylemiştir. (source: wikipedia) .
+Mustafa'nın dedesinin ismi nedir ?
+ '''
+with torch.no_grad():
+  inputs = tokenizer(test_input, return_tensors="pt", padding=True).to("cuda")
+  generated_ids = inference_model.generate(**inputs)
+  outputs = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)
+outputs```
+-> [Ahmed]
+The usage for batch mode is as follows:
+```
+from peft import PeftModel, PeftConfig
+peft_model_path="savasy/mt0-large-Turkish-qa"
+config = PeftConfig.from_pretrained(peft_model_path)
+model = AutoModelForSeq2SeqLM.from_pretrained(
+    config.base_model_name_or_path)
+# Load the Lora model
+inference_model = PeftModel.from_pretrained(model, peft_model_path)
+inference_model.eval()
+inference_model.to("cuda")
+test_inputs = ["","",""] #  a list of texts. A text must have Content followed by a Question
+preds=[]
+data_loader= DataLoader(test_inputs,batch_size=8)
+from tqdm import tqdm
+with torch.no_grad():
+  for batch in tqdm(data_loader):
+    inputs = tokenizer(batch, return_tensors="pt", padding=True).to("cuda")
+    generated_ids = model.generate(**inputs)
+    outputs = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)
+    preds+=outputs
+```
+# compare preds with your expected ground-truth results