license: apache-2.0 | |
datasets: | |
- McGill-NLP/FaithDial | |
language: | |
- en | |
metrics: | |
- bleu | |
- bertscore | |
- accuracy | |
pipeline_tag: conversational | |
T3 stands for Terribly Tiny Transformers that are an efficient way of creating tiny distilled (student) models for hallucination-free LLM models in parameter-constrained environment (edge devices). | |
The base model is a T3 adaptation of T5 model. The paradigm of T3 models can be extended to all types of models ( encoder only, decoder only & seq2seq) |