hugginglearners
/

malayalam-ULMFit-Seq2Seq

text-translation

Model card Files Files and versions Community

malayalam-ULMFit-Seq2Seq / README.md

rajeshradhakrishnan's picture

rajeshradhakrishnan

Update README.md

311a854 over 2 years ago

|

history blame contribute delete

1.84 kB

	---
	tags:
	- fastai
	- text-translation

	language: ml

	widget:
	- text: "കേൾക്കുന്ന എല്ലാ കാര്യങ്ങളും എനിക്കു മനസിലായില്ല"
	example_title: "Malayalam Seq2Seq translation"


	---

	# മലയാളം - English ULMFit translationmodel. (Working in Progress)


	[![മലയാളം: kaggle notebook](https://img.shields.io/badge/മലയാളം%20-notebook-green.svg)](https://www.kaggle.com/code/rajeshradhakrishnan/ml-ulmfit-seq2seq-translation)


	---

	# malayalam-ULMFit-Seq2Seq (Traslation model)

	malayalam-ULMFit-Seq2Seq model is pre-trained on [Malyalam_Language_Model_ULMFiT](https://github.com/goru001/nlp-for-malyalam/blob/master/language-model/Malyalam_Language_Model_ULMFiT.ipynb) using [fastai](https://docs.fast.ai/text.data.html) Language Model using fastai

	Tokenized using Sentencepiece with a vocab size of 10000 the language model is upload to [kaggle dataset](https://www.kaggle.com/datasets/rajeshradhakrishnan/ulmfit-fastai)

	## Usage

	```
	!pip install -Uqq huggingface_hub["fastai"]

	from huggingface_hub import from_pretrained_fastai
	learner = from_pretrained_fastai(repo_id)

	original_xtext = 'കേൾക്കുന്ന എല്ലാ കാര്യങ്ങളും എനിക്കു മനസിലായില്ല'
	original_ytext = 'I didnt understand all this'
	predicted_text = learner.predict(original_xtext)
	print(f'original text: {original_xtext}')
	print(f'original answer: {original_ytext}')
	print(f'predicted text: {predicted_text}')

	```

	## Intended uses & limitations

	It's not fine tuned to the state of the art accuracy

	## Training and evaluation data

	[Malayalam Samanantar Dataset - uploaded to kaggle with english - malayalam ](https://www.kaggle.com/datasets/rajeshradhakrishnan/ulmfit-fastai)